Page 1 of 1

Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 12:01 pm
by FireFox-89
Hi Guys,

I think I have issues with my GPU drivers (AMD and annoying drivers, who would have thought it) but I'm not 100% sure that is the issue so I need to start somewhere. The PC having the issues has a Core i7-2600K @ stock on a Gigabyte Z77X-D3H board, Sapphire RX 590 Nitro+ as the primary GPU and a Sapphire RX 580 Nitro+ running on driver version 20.3.1 although I was running on 20.2.2 and also tried 19.9.2 but I'm getting the same issues. With each driver reinstall I first uninstalled it with DisplayDriverUninstaller to make sure I got all of the remnants of the previous install.

Code: Select all

*********************** Log Started 2020-04-02T10:22:46Z ***********************
10:22:47:ERROR:FS02:OpenCL GPU not found for 'opencl-index' = 2 with vendor ID = 0x1002, please correct this by removing the manually configured 'opencl-index' option.
10:23:14:WARNING:WU00:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:15:WARNING:WU00:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:15:ERROR:WU00:FS02:Exception: Could not get an assignment
10:23:29:WU00:FS02:0x22:ERROR:exception: Illegal value for DeviceIndex: 2
10:23:30:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
10:23:30:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:30:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:30:ERROR:WU01:FS02:Exception: Could not get an assignment
10:23:31:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:23:31:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:23:31:ERROR:WU01:FS02:Exception: Could not get an assignment
10:24:52:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
10:25:13:ERROR:WU01:FS02:Exception: Failed to connect to 40.114.52.201:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
10:26:08:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:26:08:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:26:08:ERROR:WU01:FS02:Exception: Could not get an assignment
10:28:46:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:28:46:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:28:46:ERROR:WU01:FS02:Exception: Could not get an assignment
10:33:00:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:33:00:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:33:00:ERROR:WU01:FS02:Exception: Could not get an assignment
10:39:51:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:39:51:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:39:51:ERROR:WU01:FS02:Exception: Could not get an assignment
10:50:57:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
10:50:57:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
10:50:57:ERROR:WU01:FS02:Exception: Could not get an assignment
11:08:54:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
11:08:54:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
11:08:54:ERROR:WU01:FS02:Exception: Could not get an assignment
11:37:56:WARNING:WU01:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
11:37:56:WARNING:WU01:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
11:37:56:ERROR:WU01:FS02:Exception: Could not get an assignment
When I'm folding on either the 590 or 580 it works perfectly fine but when they are both active together either one of them seems to fail work units and I don't know enough about this to dig further without assistance. I don't think I have a hardware related issue because I've stressed each card individually for multiple hours on Heaven Benchmark, Furmark and also AIDA64 Stability Test with no artifacting, crashes or temperature weirdness.

Any help would be greatly appreciated :)

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 2:10 pm
by Neil-B
Can you post the log header (200 lines or so) this may help people identify what is occurring

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 2:46 pm
by FireFox-89
Neil-B wrote:Can you post the log header (200 lines or so) this may help people identify what is occurring
Ahh no worries, hopefully this is it. Just getting to grips with the log so apologies in advance for noobness :)

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:12:************************* Folding@home Client *************************
14:42:12:        Website: https://foldingathome.org/
14:42:12:      Copyright: (c) 2009-2018 foldingathome.org
14:42:12:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:42:12:           Args: --open-web-control
14:42:12:         Config: C:\Users\Luna\AppData\Roaming\FAHClient\config.xml
14:42:12:******************************** Build ********************************
14:42:12:        Version: 7.5.1
14:42:12:           Date: May 11 2018
14:42:12:           Time: 13:06:32
14:42:12:     Repository: Git
14:42:12:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
14:42:12:         Branch: master
14:42:12:       Compiler: Visual C++ 2008
14:42:12:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:12:       Platform: win32 10
14:42:12:           Bits: 32
14:42:12:           Mode: Release
14:42:12:******************************* System ********************************
14:42:12:            CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:42:12:         CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:42:12:           CPUs: 8
14:42:12:         Memory: 15.96GiB
14:42:12:    Free Memory: 12.96GiB
14:42:12:        Threads: WINDOWS_THREADS
14:42:12:     OS Version: 6.2
14:42:12:    Has Battery: false
14:42:12:     On Battery: false
14:42:12:     UTC Offset: 1
14:42:12:            PID: 7912
14:42:12:            CWD: C:\Users\Luna\AppData\Roaming\FAHClient
14:42:12:             OS: Windows 10 Enterprise
14:42:12:        OS Arch: AMD64
14:42:12:           GPUs: 2
14:42:12:          GPU 0: Bus:2 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
14:42:12:                 470/480/570/580/590]
14:42:12:          GPU 1: Bus:1 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
14:42:12:                 470/480/570/580/590]
14:42:12:           CUDA: Not detected: Failed to open dynamic library 'nvcuda.dll': The
14:42:12:                 specified module could not be found.
14:42:12:
14:42:12:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:3004.8
14:42:12:OpenCL Device 1: Platform:0 Device:1 Bus:2 Slot:0 Compute:1.2 Driver:3004.8
14:42:12:  Win32 Service: false
14:42:12:***********************************************************************
14:42:12:<config>
14:42:12:  <!-- Folding Core -->
14:42:12:  <checkpoint v='10'/>
14:42:12:
14:42:12:  <!-- Network -->
14:42:12:  <proxy v=':8080'/>
14:42:12:
14:42:12:  <!-- Slot Control -->
14:42:12:  <pause-on-battery v='false'/>
14:42:12:
14:42:12:  <!-- User Information -->
14:42:12:  <passkey v='********************************'/>
14:42:12:  <team v='235495'/>
14:42:12:  <user v='Luna'/>
14:42:12:
14:42:12:  <!-- Folding Slots -->
14:42:12:</config>
14:42:12:Trying to access database...
14:42:12:Successfully acquired database lock
14:42:12:Enabled folding slot 00: READY cpu:5
14:42:12:Enabled folding slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590]
14:42:12:Enabled folding slot 02: READY gpu:1:Ellesmere XT [Radeon RX 470/480/570/580/590]
14:42:13:WU00:FS00:Connecting to 65.254.110.245:8080
14:42:13:WU01:FS01:Connecting to 65.254.110.245:8080
14:42:13:WU02:FS02:Connecting to 65.254.110.245:8080
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU00:FS00:Connecting to 18.218.241.186:80
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU02:FS02:Connecting to 18.218.241.186:80
14:42:13:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WU01:FS01:Connecting to 18.218.241.186:80
14:42:13:WU00:FS00:Assigned to work server 128.252.203.4
14:42:13:WU00:FS00:Requesting new work unit for slot 00: READY cpu:5 from 128.252.203.4
14:42:13:WU00:FS00:Connecting to 128.252.203.4:8080
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:13:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WU02:FS02:Connecting to 65.254.110.245:8080
14:42:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU01:FS01:Exception: Could not get an assignment
14:42:14:WU01:FS01:Connecting to 65.254.110.245:8080
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:14:WU02:FS02:Connecting to 18.218.241.186:80
14:42:14:WU01:FS01:Assigned to work server 140.163.4.231
14:42:14:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590] from 140.163.4.231
14:42:14:WU01:FS01:Connecting to 140.163.4.231:8080
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WU00:FS00:Downloading 4.36MiB
14:42:18:WU00:FS00:Download complete
14:42:18:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13840 run:0 clone:2329 gen:24 core:0xa7 unit:0x0000001980fccb045e6ee1f49451cfbb
14:42:18:WU00:FS00:Starting
14:42:18:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Luna\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 705 -lifeline 7912 -checkpoint 10 -np 5
14:42:18:WU00:FS00:Started FahCore on PID 7256
14:42:18:WU00:FS00:Core PID:5012
14:42:18:WU00:FS00:FahCore 0xa7 started
14:42:18:8:127.0.0.1:New Web connection
14:42:18:WU00:FS00:0xa7:*********************** Log Started 2020-04-02T14:42:18Z ***********************
14:42:18:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
14:42:18:WU00:FS00:0xa7:       Type: 0xa7
14:42:18:WU00:FS00:0xa7:       Core: Gromacs
14:42:18:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 7256 -checkpoint 10 -np 5
14:42:18:WU00:FS00:0xa7:************************************ CBang *************************************
14:42:18:WU00:FS00:0xa7:       Date: Oct 26 2019
14:42:18:WU00:FS00:0xa7:       Time: 01:38:25
14:42:18:WU00:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
14:42:18:WU00:FS00:0xa7:     Branch: master
14:42:18:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
14:42:18:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:18:WU00:FS00:0xa7:   Platform: win32 10
14:42:18:WU00:FS00:0xa7:       Bits: 64
14:42:18:WU00:FS00:0xa7:       Mode: Release
14:42:18:WU00:FS00:0xa7:************************************ System ************************************
14:42:18:WU00:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:42:18:WU00:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:42:18:WU00:FS00:0xa7:       CPUs: 8
14:42:18:WU00:FS00:0xa7:     Memory: 15.96GiB
14:42:18:WU00:FS00:0xa7:Free Memory: 12.38GiB
14:42:18:WU00:FS00:0xa7:    Threads: WINDOWS_THREADS
14:42:18:WU00:FS00:0xa7: OS Version: 6.2
14:42:18:WU00:FS00:0xa7:Has Battery: false
14:42:18:WU00:FS00:0xa7: On Battery: false
14:42:18:WU00:FS00:0xa7: UTC Offset: 1
14:42:18:WU00:FS00:0xa7:        PID: 5012
14:42:18:WU00:FS00:0xa7:        CWD: C:\Users\Luna\AppData\Roaming\FAHClient\work
14:42:18:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
14:42:18:WU00:FS00:0xa7:    Version: 0.0.18
14:42:18:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:42:18:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
14:42:18:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
14:42:18:WU00:FS00:0xa7:       Date: Oct 26 2019
14:42:18:WU00:FS00:0xa7:       Time: 01:52:30
14:42:18:WU00:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
14:42:18:WU00:FS00:0xa7:     Branch: master
14:42:18:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
14:42:18:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:42:18:WU00:FS00:0xa7:   Platform: win32 10
14:42:18:WU00:FS00:0xa7:       Bits: 64
14:42:18:WU00:FS00:0xa7:       Mode: Release
14:42:18:WU00:FS00:0xa7:************************************ Build *************************************
14:42:18:WU00:FS00:0xa7:       SIMD: avx_256
14:42:18:WU00:FS00:0xa7:********************************************************************************
14:42:18:WU00:FS00:0xa7:Project: 13840 (Run 0, Clone 2329, Gen 24)
14:42:18:WU00:FS00:0xa7:Unit: 0x0000001980fccb045e6ee1f49451cfbb
14:42:18:WU00:FS00:0xa7:Reading tar file core.xml
14:42:18:WU00:FS00:0xa7:Reading tar file frame24.tpr
14:42:18:WU00:FS00:0xa7:Digital signatures verified
14:42:18:WU00:FS00:0xa7:Reducing thread count from 5 to 4 to avoid domain decomposition by a prime number > 3
14:42:18:WU00:FS00:0xa7:Calling: mdrun -s frame24.tpr -o frame24.trr -x frame24.xtc -e frame24.edr -cpt 10 -nt 4
14:42:18:WU00:FS00:0xa7:Steps: first=3000000 total=125000
14:42:21:WU00:FS00:0xa7:Completed 1 out of 125000 steps (0%)
14:43:02:WU01:FS01:Downloading 7.85MiB
14:43:08:WU01:FS01:Download 53.33%
14:43:13:Removing old file 'configs/config-20200402-091851.xml'
14:43:13:Saving configuration to config.xml
14:43:13:<config>
14:43:13:  <!-- Folding Core -->
14:43:13:  <checkpoint v='10'/>
14:43:13:
14:43:13:  <!-- Network -->
14:43:13:  <proxy v=':8080'/>
14:43:13:
14:43:13:  <!-- Slot Control -->
14:43:13:  <pause-on-battery v='false'/>
14:43:13:
14:43:13:  <!-- User Information -->
14:43:13:  <passkey v='********************************'/>
14:43:13:  <team v='235495'/>
14:43:13:  <user v='Luna'/>
14:43:13:
14:43:13:  <!-- Folding Slots -->
14:43:13:  <slot id='0' type='CPU'/>
14:43:13:  <slot id='1' type='GPU'/>
14:43:13:  <slot id='2' type='GPU'/>
14:43:13:</config>
14:43:13:WU01:FS01:Download complete
14:43:13:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11750 run:0 clone:195 gen:14 core:0x22 unit:0x0000001e8ca304e75e6a801938779c25
14:43:13:WU01:FS01:Starting
14:43:13:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Luna\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 01 -suffix 01 -version 705 -lifeline 7912 -checkpoint 10 -gpu-vendor amd -opencl-platform 0 -opencl-device 1 -gpu 1
14:43:14:WU01:FS01:Started FahCore on PID 4144
14:43:14:WU01:FS01:Core PID:4596
14:43:14:WU01:FS01:FahCore 0x22 started
14:43:14:WU02:FS02:Connecting to 65.254.110.245:8080
14:43:14:WU01:FS01:0x22:*********************** Log Started 2020-04-02T14:43:14Z ***********************
14:43:14:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************
14:43:14:WU01:FS01:0x22:       Type: 0x22
14:43:14:WU01:FS01:0x22:       Core: Core22
14:43:14:WU01:FS01:0x22:    Website: https://foldingathome.org/
14:43:14:WU01:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
14:43:14:WU01:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
14:43:14:WU01:FS01:0x22:             <rafal.wiewiora@choderalab.org>
14:43:14:WU01:FS01:0x22:       Args: -dir 01 -suffix 01 -version 705 -lifeline 4144 -checkpoint 10
14:43:14:WU01:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 1 -gpu 1
14:43:14:WU01:FS01:0x22:     Config: <none>
14:43:14:WU01:FS01:0x22:************************************ Build *************************************
14:43:14:WU01:FS01:0x22:    Version: 0.0.2
14:43:14:WU01:FS01:0x22:       Date: Dec 6 2019
14:43:14:WU01:FS01:0x22:       Time: 21:30:31
14:43:14:WU01:FS01:0x22: Repository: Git
14:43:14:WU01:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
14:43:14:WU01:FS01:0x22:     Branch: HEAD
14:43:14:WU01:FS01:0x22:   Compiler: Visual C++ 2008
14:43:14:WU01:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:43:14:WU01:FS01:0x22:   Platform: win32 10
14:43:14:WU01:FS01:0x22:       Bits: 64
14:43:14:WU01:FS01:0x22:       Mode: Release
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:43:14:WU01:FS01:0x22:************************************ System ************************************
14:43:14:WU02:FS02:Connecting to 18.218.241.186:80
14:43:14:WU01:FS01:0x22:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
14:43:14:WU01:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
14:43:14:WU01:FS01:0x22:       CPUs: 8
14:43:14:WU01:FS01:0x22:     Memory: 15.96GiB
14:43:14:WU01:FS01:0x22:Free Memory: 12.09GiB
14:43:14:WU01:FS01:0x22:    Threads: WINDOWS_THREADS
14:43:14:WU01:FS01:0x22: OS Version: 6.2
14:43:14:WU01:FS01:0x22:Has Battery: false
14:43:14:WU01:FS01:0x22: On Battery: false
14:43:14:WU01:FS01:0x22: UTC Offset: 1
14:43:14:WU01:FS01:0x22:        PID: 4596
14:43:14:WU01:FS01:0x22:        CWD: C:\Users\Luna\AppData\Roaming\FAHClient\work
14:43:14:WU01:FS01:0x22:         OS: Windows 10 Pro
14:43:14:WU01:FS01:0x22:    OS Arch: AMD64
14:43:14:WU01:FS01:0x22:********************************************************************************
14:43:14:WU01:FS01:0x22:Project: 11750 (Run 0, Clone 195, Gen 14)
14:43:14:WU01:FS01:0x22:Unit: 0x0000001e8ca304e75e6a801938779c25
14:43:14:WU01:FS01:0x22:Reading tar file core.xml
14:43:14:WU01:FS01:0x22:Reading tar file integrator.xml
14:43:14:WU01:FS01:0x22:Reading tar file state.xml
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:43:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:43:16:WU01:FS01:0x22:Reading tar file system.xml
14:43:16:WU01:FS01:0x22:Digital signatures verified
14:43:16:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
14:43:16:WU01:FS01:0x22:Version 0.0.2

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 4:33 pm
by toTOW
Everything look right in the log you posted ...

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 4:35 pm
by Neil-B
I am really not the expert on this but the first log was showing OpenCL-index set to 2 which may not exist … you may want to look in the configurations for each GPU slut (using advanced control) my gut is telling me one clot should be calling OpenCL index "0" and the other "1" but I wouldn't know which … it may be better to wait for some more knowledgeable input than mine .. obviously looking (by editing but not changing) probably can't do anything bad :)

argh just seen toTOW's post - and he knows more than I so I may be miss-advising - Sorry

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 7:17 pm
by FireFox-89
toTOW wrote:Everything look right in the log you posted ...
I restarted the program which restarted the log, I'm leaving both slots running and waiting for the errors again then will post back here. Also noticed that the OpenCL wasn't set to -1 so set them again, not sure what I was trying to do there but I was getting errors before playing with settings that I don't fully understand yet.

EDIT: Got another error here, just found it in the log.

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:13:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:13:ERROR:WU02:FS02:Exception: Could not get an assignment
14:42:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU01:FS01:Exception: Could not get an assignment
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:42:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:42:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:43:14:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:43:14:ERROR:WU02:FS02:Exception: Could not get an assignment
14:44:51:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:44:52:WARNING:WU02:FS02:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
14:44:52:ERROR:WU02:FS02:Exception: Could not get an assignment
14:47:28:WARNING:WU02:FS02:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
14:47:50:WARNING:WU02:FS02:WorkServer connection failed on port 8080 trying 80
14:48:11:ERROR:WU02:FS02:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
17:43:29:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:43:30:ERROR:WU03:FS01:Exception: Could not get an assignment
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:43:30:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:43:30:ERROR:WU03:FS01:Exception: Could not get an assignment
17:44:31:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:44:31:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:44:31:ERROR:WU03:FS01:Exception: Could not get an assignment
17:46:08:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:46:08:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:46:08:ERROR:WU03:FS01:Exception: Could not get an assignment
17:48:45:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:48:45:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:48:45:ERROR:WU03:FS01:Exception: Could not get an assignment
17:52:59:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:52:59:WARNING:WU03:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
17:52:59:ERROR:WU03:FS01:Exception: Could not get an assignment
17:59:51:WARNING:WU03:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:00:12:WARNING:WU03:FS01:WorkServer connection failed on port 8080 trying 80
18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)
18:03:07:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
18:03:08:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:08:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:03:08:ERROR:WU01:FS01:Exception: Could not get an assignment
18:03:09:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:09:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:03:09:ERROR:WU01:FS01:Exception: Could not get an assignment
18:04:09:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:04:09:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:04:09:ERROR:WU01:FS01:Exception: Could not get an assignment
18:05:46:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:05:46:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:05:46:ERROR:WU01:FS01:Exception: Could not get an assignment
18:08:44:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
18:09:05:ERROR:WU01:FS01:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
18:12:37:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:12:38:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:12:38:ERROR:WU01:FS01:Exception: Could not get an assignment
18:19:29:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:19:29:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:19:29:ERROR:WU01:FS01:Exception: Could not get an assignment
18:30:34:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:30:35:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:30:35:ERROR:WU01:FS01:Exception: Could not get an assignment
18:48:31:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:49:14:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
18:49:14:ERROR:WU01:FS01:Exception: Could not get an assignment
19:17:33:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:17:34:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
19:17:34:ERROR:WU01:FS01:Exception: Could not get an assignment
''18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)''
''18:03:07:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)''

I was running dual NVIDIA Quadro K4000's and never ran into any issues, I love AMD but I'm starting to get a little sick of them.

Just installed 20.4.1

Re: Failed WU's on AMD GPU's

Posted: Thu Apr 02, 2020 10:55 pm
by Rel25917
If both work fine alone but start failing when used together my first thought would be to check if the power supply is having trouble under the load of both cards. Or maybe heat issues, not sure how well AMD cards handle to much heat, I'm a nvidia guy myself.

Re: Failed WU's on AMD GPU's

Posted: Fri Apr 03, 2020 12:03 am
by ipkh
I have no idea why fah keeps doing this, but GPU 0 is OpenCL 1 and vice versa. Try manually configuring the slots to Slot 1 GPU 0 OpenCL 1, Slot 2 GPU 1 OpenCL 0.
The log seems to indicate it trying GPU 1 as OpenCL 1 which isn't the same GPU.

Re: Failed WU's on AMD GPU's

Posted: Fri Apr 03, 2020 1:22 am
by NuovaApe
FireFox-89 wrote:

Code: Select all

*********************** Log Started 2020-04-02T14:42:12Z ***********************
14:42:13:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
18:03:06:WU03:FS01:0x22:ERROR:exception: Error invoking kernel sortShortList: clEnqueueNDRangeKernel (-5)
19:17:34:ERROR:WU01:FS01:Exception: Could not get an assignment
I was running dual NVIDIA Quadro K4000's and never ran into any issues, I love AMD but I'm starting to get a little sick of them.
Just installed 20.4.1
Keep loving AMD cos these issues nothing to do with them;-)

CV19 has seen a huge surge in FAH helpers. "No WUs available" means there's no work pending to be done on the FAH servers, because there are 1000's of new helpers downloading slices of work to do faster than these genius boffins can upload them.

I can rarely get any CPU or GPU work to do right now. I'm stuck on the substitute bench waiting my turn, but hey - the team is winning!

Re: Failed WU's on AMD GPU's

Posted: Fri Apr 03, 2020 2:38 am
by bruce
The Error invoking kernel "sortShortList: clEnqueueNDRangeKernel (-5)'' is a known AMD driver bug. I wish we could tell you an ETA when it will be fixed, but you probably know more about their history of driver updates than I do. In the meantime, our only option is to block assignments to devices that are likely to run into this error. :(

Yes, I know that's a pretty drastic step. Only Navi is working right now.

Re: Failed WU's on AMD GPU's

Posted: Fri Apr 03, 2020 9:47 am
by FireFox-89
bruce wrote:The Error invoking kernel "sortShortList: clEnqueueNDRangeKernel (-5)'' is a known AMD driver bug. I wish we could tell you an ETA when it will be fixed, but you probably know more about their history of driver updates than I do. In the meantime, our only option is to block assignments to devices that are likely to run into this error. :(

Yes, I know that's a pretty drastic step. Only Navi is working right now.
Both cards seemed to play nicely last night since updating to BETA 20.4.1 without a failed WU but this morning checked the log and found a few in there that I haven't seen before.

Code: Select all

07:04:40:WARNING:WU03:FS02:FahCore returned an unknown error code which probably indicates that it crashed
07:04:40:WARNING:WU03:FS02:FahCore returned: UNKNOWN_ENUM (-1073740940 = 0xc0000374)
I think since it is looking like hit and miss on whether shit works or not I think the best thing I can do is keep them running and keep checking AMD's site for driver released and examining the changelog.
Rel25917 wrote:If both work fine alone but start failing when used together my first thought would be to check if the power supply is having trouble under the load of both cards. Or maybe heat issues, not sure how well AMD cards handle to much heat, I'm a nvidia guy myself.
The top card (RX590) is mostly between 58-62 degrees and the bottom card (RX580) is usually about 4-5 degrees cooler, I also thought PSU and it is getting on a bit now but it's a SeaSonic M12II 850w 80+ Bronze but recently had a pair of MSI GeForce GTX 780 Lightnings hanging off it and powered both of them together at the same time and those things suck the power quicker than an Irishman on Guiness :)

Anyway thanks guys for the assistance <3

EDIT:

All 3 slots seem to be working together nicely now without having a full blown argument, hopefully things will be better now.

Image