Failed WU's on AMD GPU's

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
FireFox-89
Posts: 72
Joined: Wed Nov 27, 2019 9:18 pm
Hardware configuration: Luna-
AMD Ryzen 5 2600 @ stock | Noctua NH-U9s with 2 Noctua B9 Redux 1600 92mm Fans
MSI B450 Tomahawk Max Revision 1.0 BIOS version 3.F0
Corsair Vengeance LPX 4x8GB DDR4-3000 | Timings 16-18-18-36
Sapphire Radeon RX 590 Nitro+ 8GB
SeaSonic M12II 850w 80+Bronze PSU

Terra-
Intel Core i5-4690K @ 4.20GHz with Vcore of 1.138v | Corsair H100i V2 with 2 Corsair SP120 120mm fans
Gigabyte Z87X-UD3H BIOS version 10b
Crucial Ballistix Sport 2x4GB DDR3-1600 | Timings 9-9-9-24
Sapphire Radeon RX 570 Pulse 4GB -100mV undervolt
OCZ Technology ZS Series 550w 80+Bronze PSU

Anubis-
Intel Core i7-5820K @3.80GHz with Vcore of 1.089v | Cooler Master Hyper 212 with Noctua NF-F12 120mm IPPC 3000 fan
Gigabyte X99-SOC Champion BIOS version F23c
G.Skill Trident Z RGB 2x8GB DDR4-3000 @ 2400 | Timings 15-15-15-36
Sapphire Radeon RX 580 Nitro+ 4GB -25mV undervolt
SeaSonic Prime Platinum 750w 80+Platinum PSU

All machines running F@H v7.6.21
Location: Lincolnshire, UK

Failed WU's on AMD GPU's

Post by FireFox-89 »

Hi Guys,

I think I have issues with my GPU drivers (AMD and annoying drivers, who would have thought it) but I'm not 100% sure that is the issue so I need to start somewhere. The PC having the issues has a Core i7-2600K @ stock on a Gigabyte Z77X-D3H board, Sapphire RX 590 Nitro+ as the primary GPU and a Sapphire RX 580 Nitro+ running on driver version 20.3.1 although I was running on 20.2.2 and also tried 19.9.2 but I'm getting the same issues. With each driver reinstall I first uninstalled it with DisplayDriverUninstaller to make sure I got all of the remnants of the previous install.

Code: Select all

*********************** Log Started 2020-04-02T10:22:46Z ***********************
10:22:47:ERROR:FS02:OpenCL GPU not found for 'opencl-index' = 2 with vendor ID = 0x1002, please correct this by removing the manually configured 'opencl-index' option.
10:23:14:WARNING:WU00:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:15:WARNING:WU00:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:15:ERROR:WU00:FS02:Exception: Could not get an assignment
10:23:29:WU00:FS02:0x22:ERROR:exception: Illegal value for DeviceIndex: 2
10:23:30:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
10:23:30:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:30:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:30:ERROR:WU01:FS02:Exception: Could not get an assignment
10:23:31:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:31:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:31:ERROR:WU01:FS02:Exception: Could not get an assignment
10:24:52:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
10:25:13:ERROR:WU01:FS02:Exception: Failed to connect to 40.114.52.201:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
10:26:08:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:26:08:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:26:08:ERROR:WU01:FS02:Exception: Could not get an assignment
10:28:46:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:28:46:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:28:46:ERROR:WU01:FS02:Exception: Could not get an assignment
10:33:00:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:33:00:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:33:00:ERROR:WU01:FS02:Exception: Could not get an assignment
10:39:51:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:39:51:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:39:51:ERROR:WU01:FS02:Exception: Could not get an assignment
10:50:57:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:50:57:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:50:57:ERROR:WU01:FS02:Exception: Could not get an assignment
11:08:54:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
11:08:54:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
11:08:54:ERROR:WU01:FS02:Exception: Could not get an assignment
11:37:56:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
11:37:56:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
11:37:56:ERROR:WU01:FS02:Exception: Could not get an assignment
When I'm folding on either the 590 or 580 it works perfectly fine but when they are both active together either one of them seems to fail work units and I don't know enough about this to dig further without assistance. I don't think I have a hardware related issue because I've stressed each card individually for multiple hours on Heaven Benchmark, Furmark and also AIDA64 Stability Test with no artifacting, crashes or temperature weirdness.

Any help would be greatly appreciated :)
Do you even fold bro?
Image
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Failed WU's on AMD GPU's

Post by Neil-B »

Can you post the log header (200 lines or so) this may help people identify what is occurring
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
FireFox-89
Posts: 72
Joined: Wed Nov 27, 2019 9:18 pm
Hardware configuration: Luna-
AMD Ryzen 5 2600 @ stock | Noctua NH-U9s with 2 Noctua B9 Redux 1600 92mm Fans
MSI B450 Tomahawk Max Revision 1.0 BIOS version 3.F0
Corsair Vengeance LPX 4x8GB DDR4-3000 | Timings 16-18-18-36
Sapphire Radeon RX 590 Nitro+ 8GB
SeaSonic M12II 850w 80+Bronze PSU

Terra-
Intel Core i5-4690K @ 4.20GHz with Vcore of 1.138v | Corsair H100i V2 with 2 Corsair SP120 120mm fans
Gigabyte Z87X-UD3H BIOS version 10b
Crucial Ballistix Sport 2x4GB DDR3-1600 | Timings 9-9-9-24
Sapphire Radeon RX 570 Pulse 4GB -100mV undervolt
OCZ Technology ZS Series 550w 80+Bronze PSU

Anubis-
Intel Core i7-5820K @3.80GHz with Vcore of 1.089v | Cooler Master Hyper 212 with Noctua NF-F12 120mm IPPC 3000 fan
Gigabyte X99-SOC Champion BIOS version F23c
G.Skill Trident Z RGB 2x8GB DDR4-3000 @ 2400 | Timings 15-15-15-36
Sapphire Radeon RX 580 Nitro+ 4GB -25mV undervolt
SeaSonic Prime Platinum 750w 80+Platinum PSU

All machines running F@H v7.6.21
Location: Lincolnshire, UK

Re: Failed WU's on AMD GPU's

Post by FireFox-89 »

Neil-B wrote:Can you post the log header (200 lines or so) this may help people identify what is occurring
Ahh no worries, hopefully this is it. Just getting to grips with the log so apologies in advance for noobness :)

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:12:************************* Folding@home Client *************************
14:42:12:        Website: https://foldingathome.org/
14:42:12:      Copyright: (c) 2009-2018 foldingathome.org
14:42:12:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:42:12:           Args: --open-web-control
14:42:12:         Config: C:\Users\Luna\AppData\Roaming\FAHClient\config.xml
14:42:12:******************************** Build ********************************
14:42:12:        Version: 7.5.1
14:42:12:           Date: May 11 2018
14:42:12:           Time: 13:06:32
14:42:12:     Repository: Git
14:42:12:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
14:42:12:         Branch: master
14:42:12:       Compiler: Visual C++ 2008
14:42:12:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:12:       Platform: win32 10
14:42:12:           Bits: 32
14:42:12:           Mode: Release
14:42:12:******************************* System ********************************
14:42:12:            CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:42:12:         CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:42:12:           CPUs: 8
14:42:12:         Memory: 15.96GiB
14:42:12:    Free Memory: 12.96GiB
14:42:12:        Threads: WINDOWS_THREADS
14:42:12:     OS Version: 6.2
14:42:12:    Has Battery: false
14:42:12:     On Battery: false
14:42:12:     UTC Offset: 1
14:42:12:            PID: 7912
14:42:12:            CWD: C:\Users\Luna\AppData\Roaming\FAHClient
14:42:12:             OS: Windows 10 Enterprise
14:42:12:        OS Arch: AMD64
14:42:12:           GPUs: 2
14:42:12:          GPU 0: Bus:2 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
14:42:12:                 470/480/570/580/590]
14:42:12:          GPU 1: Bus:1 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
14:42:12:                 470/480/570/580/590]
14:42:12:           CUDA: Not detected: Failed to open dynamic library 'nvcuda.dll': The
14:42:12:                 specified module could not be found.
14:42:12:
14:42:12:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:3004.8
14:42:12:OpenCL Device 1: Platform:0 Device:1 Bus:2 Slot:0 Compute:1.2 Driver:3004.8
14:42:12:  Win32 Service: false
14:42:12:***********************************************************************
14:42:12:<config>
14:42:12:  <!-- Folding Core -->
14:42:12:  <checkpoint v='10'/>
14:42:12:
14:42:12:  <!-- Network -->
14:42:12:  <proxy v=':8080'/>
14:42:12:
14:42:12:  <!-- Slot Control -->
14:42:12:  <pause-on-battery v='false'/>
14:42:12:
14:42:12:  <!-- User Information -->
14:42:12:  <passkey v='********************************'/>
14:42:12:  <team v='235495'/>
14:42:12:  <user v='Luna'/>
14:42:12:
14:42:12:  <!-- Folding Slots -->
14:42:12:</config>
14:42:12:Trying to access database...
14:42:12:Successfully acquired database lock
14:42:12:Enabled folding slot 00: READY cpu:5
14:42:12:Enabled folding slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590]
14:42:12:Enabled folding slot 02: READY gpu:1:Ellesmere XT [Radeon RX 470/480/570/580/590]
14:42:13:WU00:FS00:Connecting to 65.254.110.245:8080
14:42:13:WU01:FS01:Connecting to 65.254.110.245:8080
14:42:13:WU02:FS02:Connecting to 65.254.110.245:8080
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU00:FS00:Connecting to 18.218.241.186:80
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU02:FS02:Connecting to 18.218.241.186:80
14:42:13:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU01:FS01:Connecting to 18.218.241.186:80
14:42:13:WU00:FS00:Assigned to work server 128.252.203.4
14:42:13:WU00:FS00:Requesting new work unit for slot 00: READY cpu:5 from 128.252.203.4
14:42:13:WU00:FS00:Connecting to 128.252.203.4:8080
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:13:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WU02:FS02:Connecting to 65.254.110.245:8080
14:42:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU01:FS01:Exception: Could not get an assignment
14:42:14:WU01:FS01:Connecting to 65.254.110.245:8080
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:14:WU02:FS02:Connecting to 18.218.241.186:80
14:42:14:WU01:FS01:Assigned to work server 140.163.4.231
14:42:14:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590] from 140.163.4.231
14:42:14:WU01:FS01:Connecting to 140.163.4.231:8080
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WU00:FS00:Downloading 4.36MiB
14:42:18:WU00:FS00:Download complete
14:42:18:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13840 run:0 clone:2329 gen:24 core:0xa7 unit:0x0000001980fccb045e6ee1f49451cfbb
14:42:18:WU00:FS00:Starting
14:42:18:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Luna\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 705 -lifeline 7912 -checkpoint 10 -np 5
14:42:18:WU00:FS00:Started FahCore on PID 7256
14:42:18:WU00:FS00:Core PID:5012
14:42:18:WU00:FS00:FahCore 0xa7 started
14:42:18:8:127.0.0.1:New Web connection
14:42:18:WU00:FS00:0xa7:*********************** Log Started 2020-04-02T14:42:18Z ***********************
14:42:18:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
14:42:18:WU00:FS00:0xa7:       Type: 0xa7
14:42:18:WU00:FS00:0xa7:       Core: Gromacs
14:42:18:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 7256 -checkpoint 10 -np 5
14:42:18:WU00:FS00:0xa7:************************************ CBang *************************************
14:42:18:WU00:FS00:0xa7:       Date: Oct 26 2019
14:42:18:WU00:FS00:0xa7:       Time: 01:38:25
14:42:18:WU00:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
14:42:18:WU00:FS00:0xa7:     Branch: master
14:42:18:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
14:42:18:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:18:WU00:FS00:0xa7:   Platform: win32 10
14:42:18:WU00:FS00:0xa7:       Bits: 64
14:42:18:WU00:FS00:0xa7:       Mode: Release
14:42:18:WU00:FS00:0xa7:************************************ System ************************************
14:42:18:WU00:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:42:18:WU00:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:42:18:WU00:FS00:0xa7:       CPUs: 8
14:42:18:WU00:FS00:0xa7:     Memory: 15.96GiB
14:42:18:WU00:FS00:0xa7:Free Memory: 12.38GiB
14:42:18:WU00:FS00:0xa7:    Threads: WINDOWS_THREADS
14:42:18:WU00:FS00:0xa7: OS Version: 6.2
14:42:18:WU00:FS00:0xa7:Has Battery: false
14:42:18:WU00:FS00:0xa7: On Battery: false
14:42:18:WU00:FS00:0xa7: UTC Offset: 1
14:42:18:WU00:FS00:0xa7:        PID: 5012
14:42:18:WU00:FS00:0xa7:        CWD: C:\Users\Luna\AppData\Roaming\FAHClient\work
14:42:18:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
14:42:18:WU00:FS00:0xa7:    Version: 0.0.18
14:42:18:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:42:18:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
14:42:18:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
14:42:18:WU00:FS00:0xa7:       Date: Oct 26 2019
14:42:18:WU00:FS00:0xa7:       Time: 01:52:30
14:42:18:WU00:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
14:42:18:WU00:FS00:0xa7:     Branch: master
14:42:18:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
14:42:18:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:18:WU00:FS00:0xa7:   Platform: win32 10
14:42:18:WU00:FS00:0xa7:       Bits: 64
14:42:18:WU00:FS00:0xa7:       Mode: Release
14:42:18:WU00:FS00:0xa7:************************************ Build *************************************
14:42:18:WU00:FS00:0xa7:       SIMD: avx_256
14:42:18:WU00:FS00:0xa7:********************************************************************************
14:42:18:WU00:FS00:0xa7:Project: 13840 (Run 0, Clone 2329, Gen 24)
14:42:18:WU00:FS00:0xa7:Unit: 0x0000001980fccb045e6ee1f49451cfbb
14:42:18:WU00:FS00:0xa7:Reading tar file core.xml
14:42:18:WU00:FS00:0xa7:Reading tar file frame24.tpr
14:42:18:WU00:FS00:0xa7:Digital signatures verified
14:42:18:WU00:FS00:0xa7:Reducing thread count from 5 to 4 to avoid domain decomposition by a prime number > 3
14:42:18:WU00:FS00:0xa7:Calling: mdrun -s frame24.tpr -o frame24.trr -x frame24.xtc -e frame24.edr -cpt 10 -nt 4
14:42:18:WU00:FS00:0xa7:Steps: first=3000000 total=125000
14:42:21:WU00:FS00:0xa7:Completed 1 out of 125000 steps (0%)
14:43:02:WU01:FS01:Downloading 7.85MiB
14:43:08:WU01:FS01:Download 53.33%
14:43:13:Removing old file 'configs/config-20200402-091851.xml'
14:43:13:Saving configuration to config.xml
14:43:13:<config>
14:43:13:  <!-- Folding Core -->
14:43:13:  <checkpoint v='10'/>
14:43:13:
14:43:13:  <!-- Network -->
14:43:13:  <proxy v=':8080'/>
14:43:13:
14:43:13:  <!-- Slot Control -->
14:43:13:  <pause-on-battery v='false'/>
14:43:13:
14:43:13:  <!-- User Information -->
14:43:13:  <passkey v='********************************'/>
14:43:13:  <team v='235495'/>
14:43:13:  <user v='Luna'/>
14:43:13:
14:43:13:  <!-- Folding Slots -->
14:43:13:  <slot id='0' type='CPU'/>
14:43:13:  <slot id='1' type='GPU'/>
14:43:13:  <slot id='2' type='GPU'/>
14:43:13:</config>
14:43:13:WU01:FS01:Download complete
14:43:13:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11750 run:0 clone:195 gen:14 core:0x22 unit:0x0000001e8ca304e75e6a801938779c25
14:43:13:WU01:FS01:Starting
14:43:13:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Luna\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 01 -suffix 01 -version 705 -lifeline 7912 -checkpoint 10 -gpu-vendor amd -opencl-platform 0 -opencl-device 1 -gpu 1
14:43:14:WU01:FS01:Started FahCore on PID 4144
14:43:14:WU01:FS01:Core PID:4596
14:43:14:WU01:FS01:FahCore 0x22 started
14:43:14:WU02:FS02:Connecting to 65.254.110.245:8080
14:43:14:WU01:FS01:0x22:*********************** Log Started 2020-04-02T14:43:14Z ***********************
14:43:14:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************
14:43:14:WU01:FS01:0x22:       Type: 0x22
14:43:14:WU01:FS01:0x22:       Core: Core22
14:43:14:WU01:FS01:0x22:    Website: https://foldingathome.org/
14:43:14:WU01:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
14:43:14:WU01:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
14:43:14:WU01:FS01:0x22:             <rafal.wiewiora@choderalab.org>
14:43:14:WU01:FS01:0x22:       Args: -dir 01 -suffix 01 -version 705 -lifeline 4144 -checkpoint 10
14:43:14:WU01:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 1 -gpu 1
14:43:14:WU01:FS01:0x22:     Config: <none>
14:43:14:WU01:FS01:0x22:************************************ Build *************************************
14:43:14:WU01:FS01:0x22:    Version: 0.0.2
14:43:14:WU01:FS01:0x22:       Date: Dec 6 2019
14:43:14:WU01:FS01:0x22:       Time: 21:30:31
14:43:14:WU01:FS01:0x22: Repository: Git
14:43:14:WU01:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
14:43:14:WU01:FS01:0x22:     Branch: HEAD
14:43:14:WU01:FS01:0x22:   Compiler: Visual C++ 2008
14:43:14:WU01:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:43:14:WU01:FS01:0x22:   Platform: win32 10
14:43:14:WU01:FS01:0x22:       Bits: 64
14:43:14:WU01:FS01:0x22:       Mode: Release
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:43:14:WU01:FS01:0x22:************************************ System ************************************
14:43:14:WU02:FS02:Connecting to 18.218.241.186:80
14:43:14:WU01:FS01:0x22:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:43:14:WU01:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:43:14:WU01:FS01:0x22:       CPUs: 8
14:43:14:WU01:FS01:0x22:     Memory: 15.96GiB
14:43:14:WU01:FS01:0x22:Free Memory: 12.09GiB
14:43:14:WU01:FS01:0x22:    Threads: WINDOWS_THREADS
14:43:14:WU01:FS01:0x22: OS Version: 6.2
14:43:14:WU01:FS01:0x22:Has Battery: false
14:43:14:WU01:FS01:0x22: On Battery: false
14:43:14:WU01:FS01:0x22: UTC Offset: 1
14:43:14:WU01:FS01:0x22:        PID: 4596
14:43:14:WU01:FS01:0x22:        CWD: C:\Users\Luna\AppData\Roaming\FAHClient\work
14:43:14:WU01:FS01:0x22:         OS: Windows 10 Pro
14:43:14:WU01:FS01:0x22:    OS Arch: AMD64
14:43:14:WU01:FS01:0x22:********************************************************************************
14:43:14:WU01:FS01:0x22:Project: 11750 (Run 0, Clone 195, Gen 14)
14:43:14:WU01:FS01:0x22:Unit: 0x0000001e8ca304e75e6a801938779c25
14:43:14:WU01:FS01:0x22:Reading tar file core.xml
14:43:14:WU01:FS01:0x22:Reading tar file integrator.xml
14:43:14:WU01:FS01:0x22:Reading tar file state.xml
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:43:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:43:16:WU01:FS01:0x22:Reading tar file system.xml
14:43:16:WU01:FS01:0x22:Digital signatures verified
14:43:16:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
14:43:16:WU01:FS01:0x22:Version 0.0.2
Do you even fold bro?
Image
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Failed WU's on AMD GPU's

Post by toTOW »

Everything look right in the log you posted ...
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Failed WU's on AMD GPU's

Post by Neil-B »

I am really not the expert on this but the first log was showing OpenCL-index set to 2 which may not exist … you may want to look in the configurations for each GPU slut (using advanced control) my gut is telling me one clot should be calling OpenCL index "0" and the other "1" but I wouldn't know which … it may be better to wait for some more knowledgeable input than mine .. obviously looking (by editing but not changing) probably can't do anything bad :)

argh just seen toTOW's post - and he knows more than I so I may be miss-advising - Sorry
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
FireFox-89
Posts: 72
Joined: Wed Nov 27, 2019 9:18 pm
Hardware configuration: Luna-
AMD Ryzen 5 2600 @ stock | Noctua NH-U9s with 2 Noctua B9 Redux 1600 92mm Fans
MSI B450 Tomahawk Max Revision 1.0 BIOS version 3.F0
Corsair Vengeance LPX 4x8GB DDR4-3000 | Timings 16-18-18-36
Sapphire Radeon RX 590 Nitro+ 8GB
SeaSonic M12II 850w 80+Bronze PSU

Terra-
Intel Core i5-4690K @ 4.20GHz with Vcore of 1.138v | Corsair H100i V2 with 2 Corsair SP120 120mm fans
Gigabyte Z87X-UD3H BIOS version 10b
Crucial Ballistix Sport 2x4GB DDR3-1600 | Timings 9-9-9-24
Sapphire Radeon RX 570 Pulse 4GB -100mV undervolt
OCZ Technology ZS Series 550w 80+Bronze PSU

Anubis-
Intel Core i7-5820K @3.80GHz with Vcore of 1.089v | Cooler Master Hyper 212 with Noctua NF-F12 120mm IPPC 3000 fan
Gigabyte X99-SOC Champion BIOS version F23c
G.Skill Trident Z RGB 2x8GB DDR4-3000 @ 2400 | Timings 15-15-15-36
Sapphire Radeon RX 580 Nitro+ 4GB -25mV undervolt
SeaSonic Prime Platinum 750w 80+Platinum PSU

All machines running F@H v7.6.21
Location: Lincolnshire, UK

Re: Failed WU's on AMD GPU's

Post by FireFox-89 »

toTOW wrote:Everything look right in the log you posted ...
I restarted the program which restarted the log, I'm leaving both slots running and waiting for the errors again then will post back here. Also noticed that the OpenCL wasn't set to -1 so set them again, not sure what I was trying to do there but I was getting errors before playing with settings that I don't fully understand yet.

EDIT: Got another error here, just found it in the log.

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:13:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU01:FS01:Exception: Could not get an assignment
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:43:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:44:51:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:44:52:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:44:52:ERROR:WU02:FS02:Exception: Could not get an assignment
14:47:28:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:47:50:WARNING:WU02:FS02:WorkServer connection failed on port 8080 trying 80
14:48:11:ERROR:WU02:FS02:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
17:43:29:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:43:30:ERROR:WU03:FS01:Exception: Could not get an assignment
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:43:30:ERROR:WU03:FS01:Exception: Could not get an assignment
17:44:31:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:44:31:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:44:31:ERROR:WU03:FS01:Exception: Could not get an assignment
17:46:08:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:46:08:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:46:08:ERROR:WU03:FS01:Exception: Could not get an assignment
17:48:45:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:48:45:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:48:45:ERROR:WU03:FS01:Exception: Could not get an assignment
17:52:59:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:52:59:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:52:59:ERROR:WU03:FS01:Exception: Could not get an assignment
17:59:51:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:00:12:WARNING:WU03:FS01:WorkServer connection failed on port 8080 trying 80
18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)
18:03:07:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
18:03:08:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:08:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:03:08:ERROR:WU01:FS01:Exception: Could not get an assignment
18:03:09:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:09:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:03:09:ERROR:WU01:FS01:Exception: Could not get an assignment
18:04:09:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:04:09:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:04:09:ERROR:WU01:FS01:Exception: Could not get an assignment
18:05:46:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:05:46:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:05:46:ERROR:WU01:FS01:Exception: Could not get an assignment
18:08:44:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
18:09:05:ERROR:WU01:FS01:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
18:12:37:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:12:38:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:12:38:ERROR:WU01:FS01:Exception: Could not get an assignment
18:19:29:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:19:29:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:19:29:ERROR:WU01:FS01:Exception: Could not get an assignment
18:30:34:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:30:35:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:30:35:ERROR:WU01:FS01:Exception: Could not get an assignment
18:48:31:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:49:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:49:14:ERROR:WU01:FS01:Exception: Could not get an assignment
19:17:33:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:17:34:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
19:17:34:ERROR:WU01:FS01:Exception: Could not get an assignment
''18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)''
''18:03:07:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)''

I was running dual NVIDIA Quadro K4000's and never ran into any issues, I love AMD but I'm starting to get a little sick of them.

Just installed 20.4.1
Do you even fold bro?
Image
Rel25917
Posts: 303
Joined: Wed Aug 15, 2012 2:31 am

Re: Failed WU's on AMD GPU's

Post by Rel25917 »

If both work fine alone but start failing when used together my first thought would be to check if the power supply is having trouble under the load of both cards. Or maybe heat issues, not sure how well AMD cards handle to much heat, I'm a nvidia guy myself.
ipkh
Posts: 173
Joined: Thu Jul 16, 2015 2:03 pm

Re: Failed WU's on AMD GPU's

Post by ipkh »

I have no idea why fah keeps doing this, but GPU 0 is OpenCL 1 and vice versa. Try manually configuring the slots to Slot 1 GPU 0 OpenCL 1, Slot 2 GPU 1 OpenCL 0.
The log seems to indicate it trying GPU 1 as OpenCL 1 which isn't the same GPU.
NuovaApe
Posts: 53
Joined: Mon Jun 17, 2019 12:49 pm

Re: Failed WU's on AMD GPU's

Post by NuovaApe »

FireFox-89 wrote:

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)
19:17:34:ERROR:WU01:FS01:Exception: Could not get an assignment
I was running dual NVIDIA Quadro K4000's and never ran into any issues, I love AMD but I'm starting to get a little sick of them.
Just installed 20.4.1
Keep loving AMD cos these issues nothing to do with them;-)

CV19 has seen a huge surge in FAH helpers. "No WUs available" means there's no work pending to be done on the FAH servers, because there are 1000's of new helpers downloading slices of work to do faster than these genius boffins can upload them.

I can rarely get any CPU or GPU work to do right now. I'm stuck on the substitute bench waiting my turn, but hey - the team is winning!
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Failed WU's on AMD GPU's

Post by bruce »

The Error invoking kernel "sortShortList: clEnqueueNDRangeKernel (-5)'' is a known AMD driver bug. I wish we could tell you an ETA when it will be fixed, but you probably know more about their history of driver updates than I do. In the meantime, our only option is to block assignments to devices that are likely to run into this error. :(

Yes, I know that's a pretty drastic step. Only Navi is working right now.
FireFox-89
Posts: 72
Joined: Wed Nov 27, 2019 9:18 pm
Hardware configuration: Luna-
AMD Ryzen 5 2600 @ stock | Noctua NH-U9s with 2 Noctua B9 Redux 1600 92mm Fans
MSI B450 Tomahawk Max Revision 1.0 BIOS version 3.F0
Corsair Vengeance LPX 4x8GB DDR4-3000 | Timings 16-18-18-36
Sapphire Radeon RX 590 Nitro+ 8GB
SeaSonic M12II 850w 80+Bronze PSU

Terra-
Intel Core i5-4690K @ 4.20GHz with Vcore of 1.138v | Corsair H100i V2 with 2 Corsair SP120 120mm fans
Gigabyte Z87X-UD3H BIOS version 10b
Crucial Ballistix Sport 2x4GB DDR3-1600 | Timings 9-9-9-24
Sapphire Radeon RX 570 Pulse 4GB -100mV undervolt
OCZ Technology ZS Series 550w 80+Bronze PSU

Anubis-
Intel Core i7-5820K @3.80GHz with Vcore of 1.089v | Cooler Master Hyper 212 with Noctua NF-F12 120mm IPPC 3000 fan
Gigabyte X99-SOC Champion BIOS version F23c
G.Skill Trident Z RGB 2x8GB DDR4-3000 @ 2400 | Timings 15-15-15-36
Sapphire Radeon RX 580 Nitro+ 4GB -25mV undervolt
SeaSonic Prime Platinum 750w 80+Platinum PSU

All machines running F@H v7.6.21
Location: Lincolnshire, UK

Re: Failed WU's on AMD GPU's

Post by FireFox-89 »

bruce wrote:The Error invoking kernel "sortShortList: clEnqueueNDRangeKernel (-5)'' is a known AMD driver bug. I wish we could tell you an ETA when it will be fixed, but you probably know more about their history of driver updates than I do. In the meantime, our only option is to block assignments to devices that are likely to run into this error. :(

Yes, I know that's a pretty drastic step. Only Navi is working right now.
Both cards seemed to play nicely last night since updating to BETA 20.4.1 without a failed WU but this morning checked the log and found a few in there that I haven't seen before.

Code: Select all

07:04:40:WARNING:WU03:FS02:FahCore returned an unknown error code which probably indicates that it crashed
07:04:40:WARNING:WU03:FS02:FahCore returned: UNKNOWN_ENUM (-1073740940 = 0xc0000374)
I think since it is looking like hit and miss on whether shit works or not I think the best thing I can do is keep them running and keep checking AMD's site for driver released and examining the changelog.
Rel25917 wrote:If both work fine alone but start failing when used together my first thought would be to check if the power supply is having trouble under the load of both cards. Or maybe heat issues, not sure how well AMD cards handle to much heat, I'm a nvidia guy myself.
The top card (RX590) is mostly between 58-62 degrees and the bottom card (RX580) is usually about 4-5 degrees cooler, I also thought PSU and it is getting on a bit now but it's a SeaSonic M12II 850w 80+ Bronze but recently had a pair of MSI GeForce GTX 780 Lightnings hanging off it and powered both of them together at the same time and those things suck the power quicker than an Irishman on Guiness :)

Anyway thanks guys for the assistance <3

EDIT:

All 3 slots seem to be working together nicely now without having a full blown argument, hopefully things will be better now.

Image
Do you even fold bro?
Image
Post Reply