Re-enabled FAH for COVID, GPU engages then PC crashes

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
thak
Posts: 1
Joined: Sat Mar 14, 2020 9:32 pm

Re-enabled FAH for COVID, GPU engages then PC crashes

Post by thak »

Hi all - here's my logs. I just update the nVidia drivers last night.

As soon as the GPU picked up a WU, my PC hard crashed, like the power cut out. I have had occasional similar problems with games running with extremely high framerates (120+) depending on the game, but was very surprised that FAH did this. Any ideas?

Code: Select all

*********************** Log Started 2020-03-14T21:19:15Z ***********************
21:19:15:************************* Folding@home Client *************************
21:19:15:        Website: https://foldingathome.org/
21:19:15:      Copyright: (c) 2009-2018 foldingathome.org
21:19:15:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:19:15:           Args: --open-web-control
21:19:15:         Config: <none>
21:19:15:******************************** Build ********************************
21:19:15:        Version: 7.5.1
21:19:15:           Date: May 11 2018
21:19:15:           Time: 13:06:32
21:19:15:     Repository: Git
21:19:15:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
21:19:15:         Branch: master
21:19:15:       Compiler: Visual C++ 2008
21:19:15:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
21:19:15:       Platform: win32 10
21:19:15:           Bits: 32
21:19:15:           Mode: Release
21:19:15:******************************* System ********************************
21:19:15:            CPU: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz
21:19:15:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 10
21:19:15:           CPUs: 12
21:19:15:         Memory: 15.93GiB
21:19:15:    Free Memory: 6.19GiB
21:19:15:        Threads: WINDOWS_THREADS
21:19:15:     OS Version: 6.2
21:19:15:    Has Battery: false
21:19:15:     On Battery: false
21:19:15:     UTC Offset: -4
21:19:15:            PID: 2668
21:19:15:            CWD: C:\Users\keller\AppData\Roaming\FAHClient
21:19:15:             OS: Windows 10 Enterprise
21:19:15:        OS Arch: AMD64
21:19:15:           GPUs: 0
21:19:15:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:10.2
21:19:15:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:442.59
21:19:15:  Win32 Service: false
21:19:15:***********************************************************************
21:19:15:<config>
21:19:15:  <!-- Folding Slots -->
21:19:15:</config>
21:19:15:Connecting to assign1.foldingathome.org:8080
21:19:15:Updated GPUs.txt
21:19:15:Read GPUs.txt
21:19:15:Trying to access database...
21:19:15:Successfully acquired database lock
21:19:15:Enabled folding slot 00: PAUSED cpu:10 (not configured)
21:19:15:Enabled folding slot 01: PAUSED gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 (not configured)
21:19:17:8:127.0.0.1:New Web connection
21:19:23:Set client configured
21:19:24:WU00:FS00:Connecting to 65.254.110.245:8080
21:19:24:WU01:FS01:Connecting to 65.254.110.245:8080
21:19:24:WU00:FS00:Connecting to 65.254.110.245:8080
21:19:24:WU01:FS01:Connecting to 65.254.110.245:8080
21:19:24:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:19:24:WU00:FS00:Connecting to 18.218.241.186:80
21:19:24:WU01:FS01:Assigned to work server 155.247.166.220
21:19:24:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 from 155.247.166.220
21:19:24:WU01:FS01:Connecting to 155.247.166.220:8080
21:19:24:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:19:24:ERROR:WU00:FS00:Exception: Could not get an assignment
21:19:25:WU00:FS00:Connecting to 65.254.110.245:8080
21:19:25:ERROR:WU01:FS01:Exception: 10001: Server responded: HTTP_SERVICE_UNAVAILABLE
21:19:25:WU01:FS01:Connecting to 65.254.110.245:8080
21:19:25:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:19:25:WU00:FS00:Connecting to 18.218.241.186:80
21:19:25:WU01:FS01:Assigned to work server 155.247.166.220
21:19:25:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 from 155.247.166.220
21:19:25:WU01:FS01:Connecting to 155.247.166.220:8080
21:19:25:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:19:25:ERROR:WU00:FS00:Exception: Could not get an assignment
21:19:25:ERROR:WU01:FS01:Exception: 10001: Server responded: HTTP_SERVICE_UNAVAILABLE
21:20:16:Saving configuration to config.xml
21:20:16:<config>
21:20:16:  <!-- Folding Slots -->
21:20:16:  <slot id='0' type='CPU'/>
21:20:16:  <slot id='1' type='GPU'/>
21:20:16:</config>
21:20:25:WU00:FS00:Connecting to 65.254.110.245:8080
21:20:25:WU01:FS01:Connecting to 65.254.110.245:8080
21:20:25:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:20:25:WU00:FS00:Connecting to 18.218.241.186:80
21:20:25:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:20:25:WU01:FS01:Connecting to 18.218.241.186:80
21:20:25:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:20:25:ERROR:WU00:FS00:Exception: Could not get an assignment
21:20:26:WU01:FS01:Assigned to work server 155.247.166.220
21:20:26:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 from 155.247.166.220
21:20:26:WU01:FS01:Connecting to 155.247.166.220:8080
21:20:26:ERROR:WU01:FS01:Exception: 10001: Server responded: HTTP_SERVICE_UNAVAILABLE
21:22:02:WU00:FS00:Connecting to 65.254.110.245:8080
21:22:02:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:22:02:WU00:FS00:Connecting to 18.218.241.186:80
21:22:02:WU01:FS01:Connecting to 65.254.110.245:8080
21:22:02:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:22:02:ERROR:WU00:FS00:Exception: Could not get an assignment
21:22:02:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:22:02:WU01:FS01:Connecting to 18.218.241.186:80
21:22:03:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:22:03:ERROR:WU01:FS01:Exception: Could not get an assignment
21:23:36:35:127.0.0.1:New Web connection
21:24:00:WU00:FS00:Connecting to 65.254.110.245:8080
21:24:00:WU01:FS01:Connecting to 65.254.110.245:8080
21:24:00:WU00:FS00:Assigned to work server 128.252.203.10
21:24:00:WU00:FS00:Requesting new work unit for slot 00: READY cpu:10 from 128.252.203.10
21:24:00:WU00:FS00:Connecting to 128.252.203.10:8080
21:24:01:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:24:01:WU01:FS01:Connecting to 18.218.241.186:80
21:24:01:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:24:01:ERROR:WU01:FS01:Exception: Could not get an assignment
21:24:03:ERROR:WU00:FS00:Exception: Server did not assign work unit
21:25:21:Saving configuration to config.xml
21:25:21:<config>
21:25:21:  <!-- Folding Slot Configuration -->
21:25:21:  <cause v='CANCER'/>
21:25:21:
21:25:21:  <!-- Folding Slots -->
21:25:21:  <slot id='0' type='CPU'/>
21:25:21:  <slot id='1' type='GPU'/>
21:25:21:</config>
21:25:37:WU00:FS00:Connecting to 65.254.110.245:8080
21:25:37:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
21:25:37:WU00:FS00:Connecting to 18.218.241.186:80
21:25:37:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
21:25:37:ERROR:WU00:FS00:Exception: Could not get an assignment
21:25:37:WU01:FS01:Connecting to 65.254.110.245:8080
21:25:38:WU01:FS01:Assigned to work server 155.247.166.220
21:25:38:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 from 155.247.166.220
21:25:38:WU01:FS01:Connecting to 155.247.166.220:8080
21:25:38:WU01:FS01:Downloading 5.79MiB
21:25:38:WU01:FS01:Download complete
21:25:38:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:14310 run:534 clone:53 gen:0 core:0x21 unit:0x000000000002894c5e6cfafac4f48194
21:25:38:WU01:FS01:Downloading core from http://cores.foldingathome.org/v7/win/64bit/nvidia/Core_21.fah
21:25:38:WU01:FS01:Connecting to cores.foldingathome.org:80
21:25:38:WU01:FS01:FahCore 21: Downloading 3.47MiB
21:25:39:WU01:FS01:FahCore 21: Download complete
21:25:39:WU01:FS01:Valid core signature
21:25:39:WU01:FS01:Unpacked 11.80MiB to cores/cores.foldingathome.org/v7/win/64bit/nvidia/Core_21.fah/FahCore_21.exe
21:25:39:WU01:FS01:Starting
21:25:39:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\keller\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/nvidia/Core_21.fah/FahCore_21.exe -dir 01 -suffix 01 -version 705 -lifeline 2668 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
21:25:39:WU01:FS01:Started FahCore on PID 24764
21:25:39:WU01:FS01:Core PID:18460
21:25:39:WU01:FS01:FahCore 0x21 started
21:25:40:WU01:FS01:0x21:*********************** Log Started 2020-03-14T21:25:39Z ***********************
21:25:40:WU01:FS01:0x21:Project: 14310 (Run 534, Clone 53, Gen 0)
21:25:40:WU01:FS01:0x21:Unit: 0x000000000002894c5e6cfafac4f48194
21:25:40:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
21:25:40:WU01:FS01:0x21:Machine: 1
21:25:40:WU01:FS01:0x21:Reading tar file core.xml
21:25:40:WU01:FS01:0x21:Reading tar file integrator.xml
21:25:40:WU01:FS01:0x21:Reading tar file state.xml
21:25:40:WU01:FS01:0x21:Reading tar file system.xml
21:25:40:WU01:FS01:0x21:Digital signatures verified
21:25:40:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
21:25:40:WU01:FS01:0x21:Version 0.0.20
21:25:45:WU01:FS01:0x21:Completed 0 out of 25000000 steps (0%)
21:25:45:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
X-Wing
Posts: 54
Joined: Sat Apr 27, 2019 11:43 pm

Re: Re-enabled FAH for COVID, GPU engages then PC crashes

Post by X-Wing »

Are you overclocked? Also, do you have adequate cooling? If it is like the power cut out, is your power supply able to provide enough power?
Rig: i3-8350K, GTX 1660Ti, GTX 750Ti, 16GB DRR4-3000MHz.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Re-enabled FAH for COVID, GPU engages then PC crashes

Post by bruce »

Two obvious possibilities.

You don't have a big enough power supply to run everything at 100%.
Your system doesn't have enough airflow to cool everything when running at 100%

FAH often pushes a GPU into higher performance levels that games do.

You might investigate the obscure configuration settings to limit your GPU to what the rest of your system can handle.

As far as FAHClient is concerned, it will always try to maximize it's use of resources.

If it's heat, try limiting the number of CPU cores that are folding.
atlr
Posts: 9
Joined: Sat Mar 14, 2020 10:14 pm

Re: Re-enabled FAH for COVID, GPU engages then PC crashes

Post by atlr »

That sounds like a power supply over current restart. I have not tested a 1080Ti but it would not surprise me if it peaks to over 50A (600 W) on 12V. I have measured 100 ms peaks like that on a Radeon VII using a sampling multimeter with 40 ns sampling period. You could Remove slot0 (cpu) in the FAH Advanced Control-Configure-Slots to lighten the load on the power supply.
Post Reply