GPU gets WU only if its slots get removed and added again

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
TomasNovak4200
Posts: 6
Joined: Wed Apr 22, 2020 10:33 am

GPU gets WU only if its slots get removed and added again

Post by TomasNovak4200 »

Good day,
I have noticed that my GPU gets some WUs a day and then will get no new ones.
Although this issue can be resolved by removing the GPU slot from the Configure - Slot tab, pressing Save and finnaly adding the GPU again.
Then the GPU will get WU almost immediately. Is there any way to resolve this issue ?
Thank you and here is my log after the GPU is restarted and succesfully gets the WU:

Code: Select all

10:26:23:Adding folding slot 01: READY gpu:0:TU106 [GeForce RTX 2070] M 6497
10:26:23:Removing old file 'configs/config-20200409-104704.xml'
10:26:23:Saving configuration to config.xml
10:26:23:<config>
10:26:23:  <!-- Folding Core -->
10:26:23:  <core-priority v='low'/>
10:26:23:
10:26:23:  <!-- Network -->
10:26:23:  <proxy v=':8080'/>
10:26:23:
10:26:23:  <!-- Slot Control -->
10:26:23:  <power v='full'/>
10:26:23:
10:26:23:  <!-- User Information -->
10:26:23:  <passkey v='********************************'/>
10:26:23:  <team v='49658'/>
10:26:23:  <user v='TomasNovak4200'/>
10:26:23:
10:26:23:  <!-- Folding Slots -->
10:26:23:  <slot id='0' type='CPU'/>
10:26:23:  <slot id='1' type='GPU'/>
10:26:23:</config>
10:26:23:FS00:Shutting core down
10:26:23:WU02:FS00:0xa7:WARNING:Console control signal 1 on PID 6516
10:26:23:WU02:FS00:0xa7:Exiting, please wait. . .
10:26:23:WU02:FS00:0xa7:Folding@home Core Shutdown: INTERRUPTED
10:26:31:WU00:FS01:Connecting to 65.254.110.245:8080
10:26:31:WU02:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
10:26:32:WARNING:WU00:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:26:32:WU00:FS01:Connecting to 18.218.241.186:80
10:26:33:WARNING:WU00:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:26:33:ERROR:WU00:FS01:Exception: Could not get an assignment
10:26:33:WU00:FS01:Connecting to 65.254.110.245:8080
10:26:33:WARNING:WU00:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:26:33:WU00:FS01:Connecting to 18.218.241.186:80
10:26:34:WU00:FS01:Assigned to work server 140.163.4.241
10:26:34:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:TU106 [GeForce RTX 2070] M 6497 from 140.163.4.241
10:26:34:WU00:FS01:Connecting to 140.163.4.241:8080
10:27:16:WU02:FS00:Starting
10:27:16:WARNING:WU02:FS00:Changed SMP threads from 4 to 3 this can cause some work units to fail
10:27:16:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\tomas\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 02 -suffix 01 -version 705 -lifeline 12936 -checkpoint 15 -np 3
10:27:16:WU02:FS00:Started FahCore on PID 17308
10:27:16:WU02:FS00:Core PID:15796
10:27:16:WU02:FS00:FahCore 0xa7 started
10:27:17:WU02:FS00:0xa7:*********************** Log Started 2020-04-22T10:27:16Z ***********************
10:27:17:WU02:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
10:27:17:WU02:FS00:0xa7:       Type: 0xa7
10:27:17:WU02:FS00:0xa7:       Core: Gromacs
10:27:17:WU02:FS00:0xa7:       Args: -dir 02 -suffix 01 -version 705 -lifeline 17308 -checkpoint 15 -np
10:27:17:WU02:FS00:0xa7:             3
10:27:17:WU02:FS00:0xa7:************************************ CBang *************************************
10:27:17:WU02:FS00:0xa7:       Date: Oct 26 2019
10:27:17:WU02:FS00:0xa7:       Time: 01:38:25
10:27:17:WU02:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
10:27:17:WU02:FS00:0xa7:     Branch: master
10:27:17:WU02:FS00:0xa7:   Compiler: Visual C++ 2008
10:27:17:WU02:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
10:27:17:WU02:FS00:0xa7:   Platform: win32 10
10:27:17:WU02:FS00:0xa7:       Bits: 64
10:27:17:WU02:FS00:0xa7:       Mode: Release
10:27:17:WU02:FS00:0xa7:************************************ System ************************************
10:27:17:WU02:FS00:0xa7:        CPU: Intel(R) Core(TM) i5-7600K CPU @ 3.80GHz
10:27:17:WU02:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
10:27:17:WU02:FS00:0xa7:       CPUs: 4
10:27:17:WU02:FS00:0xa7:     Memory: 31.95GiB
10:27:17:WU02:FS00:0xa7:Free Memory: 18.67GiB
10:27:17:WU02:FS00:0xa7:    Threads: WINDOWS_THREADS
10:27:17:WU02:FS00:0xa7: OS Version: 6.2
10:27:17:WU02:FS00:0xa7:Has Battery: false
10:27:17:WU02:FS00:0xa7: On Battery: false
10:27:17:WU02:FS00:0xa7: UTC Offset: 2
10:27:17:WU02:FS00:0xa7:        PID: 15796
10:27:17:WU02:FS00:0xa7:        CWD: C:\Users\tomas\AppData\Roaming\FAHClient\work
10:27:17:WU02:FS00:0xa7:******************************** Build - libFAH ********************************
10:27:17:WU02:FS00:0xa7:    Version: 0.0.18
10:27:17:WU02:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
10:27:17:WU02:FS00:0xa7:  Copyright: 2019 foldingathome.org
10:27:17:WU02:FS00:0xa7:   Homepage: https://foldingathome.org/
10:27:17:WU02:FS00:0xa7:       Date: Oct 26 2019
10:27:17:WU02:FS00:0xa7:       Time: 01:52:30
10:27:17:WU02:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
10:27:17:WU02:FS00:0xa7:     Branch: master
10:27:17:WU02:FS00:0xa7:   Compiler: Visual C++ 2008
10:27:17:WU02:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
10:27:17:WU02:FS00:0xa7:   Platform: win32 10
10:27:17:WU02:FS00:0xa7:       Bits: 64
10:27:17:WU02:FS00:0xa7:       Mode: Release
10:27:17:WU02:FS00:0xa7:************************************ Build *************************************
10:27:17:WU02:FS00:0xa7:       SIMD: avx_256
10:27:17:WU02:FS00:0xa7:********************************************************************************
10:27:17:WU02:FS00:0xa7:Project: 16425 (Run 1898, Clone 4, Gen 14)
10:27:17:WU02:FS00:0xa7:Unit: 0x00000010a8f5c67d5e913faa3b14cb44
10:27:17:WU02:FS00:0xa7:Digital signatures verified
10:27:17:WU02:FS00:0xa7:Calling: mdrun -s frame14.tpr -o frame14.trr -x frame14.xtc -cpi state.cpt -cpt 15 -nt 3
10:27:17:WU02:FS00:0xa7:Steps: first=14000000 total=1000000
10:27:17:WU02:FS00:0xa7:Completed 623722 out of 1000000 steps (62%)
10:28:00:WU00:FS01:Downloading 11.98MiB
10:28:06:WU00:FS01:Download 44.34%
10:28:12:WU00:FS01:Download 88.17%
10:28:13:WU00:FS01:Download complete
10:28:13:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:11741 run:0 clone:7105 gen:15 core:0x22 unit:0x0000001b8ca304f15e6bc57c30d70a34
10:28:13:WU00:FS01:Starting
10:28:13:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\tomas\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 00 -suffix 01 -version 705 -lifeline 12936 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
10:28:13:WU00:FS01:Started FahCore on PID 392
10:28:13:WU00:FS01:Core PID:21660
10:28:13:WU00:FS01:FahCore 0x22 started
10:28:14:WU00:FS01:0x22:*********************** Log Started 2020-04-22T10:28:14Z ***********************
10:28:14:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
10:28:14:WU00:FS01:0x22:       Type: 0x22
10:28:14:WU00:FS01:0x22:       Core: Core22
10:28:14:WU00:FS01:0x22:    Website: https://foldingathome.org/
10:28:14:WU00:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
10:28:14:WU00:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
10:28:14:WU00:FS01:0x22:             <rafal.wiewiora@choderalab.org>
10:28:14:WU00:FS01:0x22:       Args: -dir 00 -suffix 01 -version 705 -lifeline 392 -checkpoint 15
10:28:14:WU00:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
10:28:14:WU00:FS01:0x22:             0 -gpu 0
10:28:14:WU00:FS01:0x22:     Config: <none>
10:28:14:WU00:FS01:0x22:************************************ Build *************************************
10:28:14:WU00:FS01:0x22:    Version: 0.0.2
10:28:14:WU00:FS01:0x22:       Date: Dec 6 2019
10:28:14:WU00:FS01:0x22:       Time: 21:30:31
10:28:14:WU00:FS01:0x22: Repository: Git
10:28:14:WU00:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
10:28:14:WU00:FS01:0x22:     Branch: HEAD
10:28:14:WU00:FS01:0x22:   Compiler: Visual C++ 2008
10:28:14:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
10:28:14:WU00:FS01:0x22:   Platform: win32 10
10:28:14:WU00:FS01:0x22:       Bits: 64
10:28:14:WU00:FS01:0x22:       Mode: Release
10:28:14:WU00:FS01:0x22:************************************ System ************************************
10:28:14:WU00:FS01:0x22:        CPU: Intel(R) Core(TM) i5-7600K CPU @ 3.80GHz
10:28:14:WU00:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
10:28:14:WU00:FS01:0x22:       CPUs: 4
10:28:14:WU00:FS01:0x22:     Memory: 31.95GiB
10:28:14:WU00:FS01:0x22:Free Memory: 18.63GiB
10:28:14:WU00:FS01:0x22:    Threads: WINDOWS_THREADS
10:28:14:WU00:FS01:0x22: OS Version: 6.2
10:28:14:WU00:FS01:0x22:Has Battery: false
10:28:14:WU00:FS01:0x22: On Battery: false
10:28:14:WU00:FS01:0x22: UTC Offset: 2
10:28:14:WU00:FS01:0x22:        PID: 21660
10:28:14:WU00:FS01:0x22:        CWD: C:\Users\tomas\AppData\Roaming\FAHClient\work
10:28:14:WU00:FS01:0x22:         OS: Windows 10 Pro
10:28:14:WU00:FS01:0x22:    OS Arch: AMD64
10:28:14:WU00:FS01:0x22:********************************************************************************
10:28:14:WU00:FS01:0x22:Project: 11741 (Run 0, Clone 7105, Gen 15)
10:28:14:WU00:FS01:0x22:Unit: 0x0000001b8ca304f15e6bc57c30d70a34
10:28:14:WU00:FS01:0x22:Reading tar file core.xml
10:28:14:WU00:FS01:0x22:Reading tar file integrator.xml
10:28:14:WU00:FS01:0x22:Reading tar file state.xml
10:28:15:WU00:FS01:0x22:Reading tar file system.xml
10:28:16:WU00:FS01:0x22:Digital signatures verified
10:28:16:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
10:28:16:WU00:FS01:0x22:Version 0.0.2
10:28:29:WU02:FS00:0xa7:Completed 630000 out of 1000000 steps (63%)
10:28:31:WU00:FS01:0x22:Completed 0 out of 1000000 steps (0%)
10:28:31:WU00:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
10:29:52:WU00:FS01:0x22:Completed 10000 out of 1000000 steps (1%)
10:30:40:WU02:FS00:0xa7:Completed 640000 out of 1000000 steps (64%)
10:31:16:WU00:FS01:0x22:Completed 20000 out of 1000000 steps (2%)
10:32:44:WU00:FS01:0x22:Completed 30000 out of 1000000 steps (3%)
10:33:43:WU02:FS00:0xa7:Completed 650000 out of 1000000 steps (65%)
10:34:13:WU00:FS01:0x22:Completed 40000 out of 1000000 steps (4%)
10:35:34:WU00:FS01:0x22:Completed 50000 out of 1000000 steps (5%)
10:35:52:WU02:FS00:0xa7:Completed 660000 out of 1000000 steps (66%)
10:36:59:WU00:FS01:0x22:Completed 60000 out of 1000000 steps (6%)
10:37:51:WU02:FS00:0xa7:Completed 670000 out of 1000000 steps (67%)
10:38:19:WU00:FS01:0x22:Completed 70000 out of 1000000 steps (7%)

Joe_H
Site Admin
Posts: 7937
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: GPU gets WU only if its slots get removed and added agai

Post by Joe_H »

No need to remove the slot and add it again, just Pause folding on that slot for a minute or two and then restart the slot.

In general, there are more requests for GPU WUs than the servers can currently provide. They are working on adding capacity. In the meantime it can take a while for a request to be accepted and a WU assigned. The client backs off an increasing amount, with the 7.5.1 client that can end up being hours between requests. So if the time between requests gets longer than a couple hours, the pause and restart resets the timer.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
TomasNovak4200
Posts: 6
Joined: Wed Apr 22, 2020 10:33 am

Re: GPU gets WU only if its slots get removed and added agai

Post by TomasNovak4200 »

Well I havent as much of a success with pausing and restarting as with removing it from slot and adding it again.
In fact I had my GPU slot paused for a day or two before I resolved it as I already described it.
But it seems to go smooth from the time of posting this topic originally.
I will update if anything will go astray.
I thank you for a fast answer though :)
Post Reply