Page 1 of 3

opencl-index issues

Posted: Mon Apr 30, 2018 8:22 am
by foldy
The problem is with the slots index not found
06:14:38:ERROR:WU02:FS02:Failed to start core: OpenCL device matching slot 2 not found, try setting 'opencl-index' manually
Your first machine working config looks like this
05:16:24: <slot id='1' type='GPU'>
05:16:24: <client-type v='advanced'/>
05:16:24: <gpu-index v='0'/>
05:16:24: </slot>
05:16:24: <slot id='2' type='GPU'>
05:16:24: <client-type v='advanced'/>
05:16:24: <gpu-index v='1'/>
05:16:24: </slot>
But your second machine failing config looks like this
06:14:17: <slot id='1' type='GPU'>
06:14:17: <client-type v='advanced'/>
06:14:17: <paused v='true'/>
06:14:17: </slot>
06:14:17: <slot id='2' type='GPU'>
06:14:17: <client-type v='advanced'/>
06:14:17: <paused v='true'/>
06:14:17: </slot>
The difference is the second failing config has no gpu-index set.

Can you try to edit the GPU slots and add the gpu-index 0 and 1

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Tue May 01, 2018 12:19 am
by Joe_H
Looking at the log for the second machine that is showing an error that it could not devices matching the slot:
06:14:55:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
it stirred an old memory of a problem that occasionally was reported in early testing of the beta client. The client would work well at configuring two or more GPU slots where the cards were different, for example a GTX 980 with a GTX 970 or an AMD card. But if they were the same, sometimes the client would not set up the slots properly and the workaround was to set the OpenCL indices manually. I don't recall the details, but perhaps someone who ran into that problem will have their memory jogged and posts on what worked for them to fix it.

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Tue May 01, 2018 3:51 pm
by toTOW
You can still use the little tool called "clinfo" that will print all informations about your OpenCL installation. If more than one platform is detected, this is the issue.

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Tue May 01, 2018 7:56 pm
by Nick200
Thanks for all the advice which helped a lot. After re-installing the geforce driver (for a third time), I am back to full folding on the machine which was having problems with folding on two GTX1080TI cards.

I think the OpenCL problem on my main machine stemmed from an earlier driver update for my igpu, and that resulted in the machine hanging. The other two machines righted themselves eventually with no intervention from me given the WS suddenly started delivering WUs

I will double-check with clinfo for all 4 machines and see what's needed for the igpus. I tend to use and forget that bit of the PC. Lesson learnt.

Cheers

Nick

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Wed May 02, 2018 7:29 am
by bruce
Joe_H wrote:Looking at the log for the second machine that is showing an error that it could not devices matching the slot:
06:14:55:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
it stirred an old memory of a problem that occasionally was reported in early testing of the beta client. The client would work well at configuring two or more GPU slots where the cards were different, for example a GTX 980 with a GTX 970 or an AMD card. But if they were the same, sometimes the client would not set up the slots properly and the workaround was to set the OpenCL indices manually. I don't recall the details, but perhaps someone who ran into that problem will have their memory jogged and posts on what worked for them to fix it.
I have a similar old memory -- in that I reported the problem of two identical GPUs. From what I remember, when FAHClient is (re-)started, it performs device detection and chooses defaults for any setting that has not already been set. With identical GPU, either one might be detected and assigned to the first set of default values and the other will be assigned the second set of default values. That suggests a potential race condition where the second set of values may be processed first so the remaining GPU will be assigned the THIRD set of default values -- and there's no available GPU, even though the first set was never actually used. Manually locking in workable sets of values bypasses the potential race condition.

Later versions of FAHClient may have improved or fixed this problem.

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Thu May 03, 2018 5:13 am
by Nick200
Well, I think I spoke too soon as one machine has started failing across both the GPU slots:
*********************** Log Started 2018-05-02T19:59:27Z ***********************
19:59:38:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:59:53:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:11:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:32:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:10:07:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:11:14:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:12:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:27:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:13:32:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:13:58:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:14:57:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:15:18:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
******************************* Date: 2018-05-03 *******************************
with the full log as follows:
*********************** Log Started 2018-05-02T19:59:27Z ***********************
19:59:27:ERROR:Exception: Failed to register systray icon: Unspecified error
19:59:38:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:59:45:WU03:FS02:0x21:ERROR:126: Bad platformId size.
19:59:45:WU02:FS01:0x21:ERROR:126: Bad platformId size.
19:59:46:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:59:46:WARNING:WU03:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:59:48:ERROR:WU04:FS02:Exception: Server did not assign work unit
19:59:53:WU00:FS01:0x21:ERROR:126: Bad platformId size.
19:59:53:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:01:WU02:FS01:0x21:ERROR:126: Bad platformId size.
20:00:01:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:10:WU00:FS01:0x21:ERROR:126: Bad platformId size.
20:00:11:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:23:WU04:FS02:0x21:ERROR:126: Bad platformId size.
20:00:23:WARNING:WU04:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:31:WU00:FS02:0x21:ERROR:126: Bad platformId size.
20:00:32:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:39:WU02:FS01:0x21:ERROR:126: Bad platformId size.
20:00:39:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:00:56:WU03:FS02:0x21:ERROR:126: Bad platformId size.
20:00:56:WARNING:WU03:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:01:06:WU02:FS02:0x21:ERROR:126: Bad platformId size.
20:01:07:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:10:06:WU00:FS01:0x21:ERROR:126: Bad platformId size.
20:10:07:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:10:17:WU03:FS01:0x21:ERROR:126: Bad platformId size.
20:10:17:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:10:59:WU02:FS02:0x21:ERROR:126: Bad platformId size.
20:10:59:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:11:14:WU00:FS01:0x21:ERROR:126: Bad platformId size.
20:11:14:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:02:WU03:FS02:0x21:ERROR:126: Bad platformId size.
20:12:03:WARNING:WU03:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:05:WU02:FS01:0x21:ERROR:126: Bad platformId size.
20:12:06:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:12:WU00:FS02:0x21:ERROR:126: Bad platformId size.
20:12:12:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:17:WU03:FS01:0x21:ERROR:126: Bad platformId size.
20:12:17:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:21:WU02:FS02:0x21:ERROR:126: Bad platformId size.
20:12:22:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:27:WU00:FS01:0x21:ERROR:126: Bad platformId size.
20:12:27:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:31:WU03:FS02:0x21:ERROR:126: Bad platformId size.
20:12:32:WARNING:WU03:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:36:WU02:FS01:0x21:ERROR:126: Bad platformId size.
20:12:36:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:12:47:WU03:FS01:0x21:ERROR:126: Bad platformId size.
20:12:48:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:13:32:WU00:FS02:0x21:ERROR:126: Bad platformId size.
20:13:32:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:13:44:WU02:FS01:0x21:ERROR:126: Bad platformId size.
20:13:44:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:13:57:WU00:FS01:0x21:ERROR:126: Bad platformId size.
20:13:58:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:14:10:WU03:FS02:0x21:ERROR:126: Bad platformId size.
20:14:10:WARNING:WU03:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:14:57:WU00:FS02:0x21:ERROR:126: Bad platformId size.
20:14:57:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:15:06:WU02:FS02:0x21:ERROR:126: Bad platformId size.
20:15:07:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:15:17:WU00:FS02:0x21:ERROR:126: Bad platformId size.
20:15:18:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
******************************* Date: 2018-05-03 *******************************
Here's the system report, which shows that it is back to full folding after downgrading the driver:

Code: Select all

*********************** Log Started 2018-05-03T05:08:36Z ***********************
05:08:36:************************* Folding@home Client *************************
05:08:36:        Website: http://folding.stanford.edu/
05:08:36:      Copyright: (c) 2009-2016 Stanford University
05:08:36:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
05:08:36:           Args: --open-web-control
05:08:36:         Config: C:\Users\Nick Montague\AppData\Roaming\FAHClient\config.xml
05:08:36:******************************** Build ********************************
05:08:36:        Version: 7.4.16
05:08:36:           Date: Jan 6 2017
05:08:36:           Time: 00:25:14
05:08:36:     Repository: Git
05:08:36:       Revision: a9e9e27dc2ee6ff01398c439677bc27f6cb74032
05:08:36:         Branch: master
05:08:36:       Compiler: Visual C++ 2008
05:08:36:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox -arch:SSE /MT
05:08:36:       Platform: win32 10
05:08:36:           Bits: 32
05:08:36:           Mode: Release
05:08:36:******************************* System ********************************
05:08:36:            CPU: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz
05:08:36:         CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
05:08:36:           CPUs: 8
05:08:36:         Memory: 15.89GiB
05:08:36:    Free Memory: 10.01GiB
05:08:36:        Threads: WINDOWS_THREADS
05:08:36:     OS Version: 6.2
05:08:36:    Has Battery: false
05:08:36:     On Battery: false
05:08:36:     UTC Offset: 12
05:08:36:            PID: 7496
05:08:36:            CWD: C:\Users\Nick Montague\AppData\Roaming\FAHClient
05:08:36:             OS: Windows 10 Enterprise
05:08:36:        OS Arch: AMD64
05:08:36:           GPUs: 2
05:08:36:          GPU 0: Bus:6 Slot:0 Func:0 NVIDIA:7 GP104 [GeForce GTX 1080] 8873
05:08:36:          GPU 1: Bus:1 Slot:0 Func:0 NVIDIA:7 GP104 [GeForce GTX 1080] 8873
05:08:36:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:9.1
05:08:36:  CUDA Device 1: Platform:0 Device:1 Bus:6 Slot:0 Compute:6.1 Driver:9.1
05:08:36:OpenCL Device 0: Platform:0 Device:0 Bus:NA Slot:NA Compute:1.2 Driver:20.19
05:08:36:OpenCL Device 2: Platform:1 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:391.35
05:08:36:OpenCL Device 3: Platform:1 Device:1 Bus:6 Slot:0 Compute:1.2 Driver:391.35
05:08:36:  Win32 Service: false
05:08:36:***********************************************************************
05:08:36:<config>
05:08:36:  <!-- Network -->
05:08:36:  <proxy v=':8080'/>
05:08:36:
05:08:36:  <!-- Slot Control -->
05:08:36:  <power v='full'/>
05:08:36:
05:08:36:  <!-- User Information -->
05:08:36:  <passkey v='********************************'/>
05:08:36:  <team v='142900'/>
05:08:36:  <user v='Montague-Cripps'/>
05:08:36:
05:08:36:  <!-- Folding Slots -->
05:08:36:  <slot id='0' type='CPU'>
05:08:36:    <cpus v='6'/>
05:08:36:  </slot>
05:08:36:  <slot id='1' type='GPU'>
05:08:36:    <client-type v='advanced'/>
05:08:36:  </slot>
05:08:36:  <slot id='2' type='GPU'>
05:08:36:    <client-type v='advanced'/>
05:08:36:  </slot>
05:08:36:</config>
05:08:36:Trying to access database...
05:08:36:Successfully acquired database lock
05:08:36:Enabled folding slot 00: READY cpu:6
05:08:36:Enabled folding slot 01: READY gpu:0:GP104 [GeForce GTX 1080] 8873
05:08:36:Enabled folding slot 02: READY gpu:1:GP104 [GeForce GTX 1080] 8873
05:08:36:WU00:FS00:Starting
05:08:36:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:\Users\Nick Montague\AppData\Roaming\FAHClient\cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe" -dir 00 -suffix 01 -version 704 -lifeline 7496 -checkpoint 15 -np 6
05:08:36:WU00:FS00:Started FahCore on PID 8724
05:08:36:WU00:FS00:Core PID:6648
05:08:36:WU00:FS00:FahCore 0xa4 started
05:08:36:WU00:FS00:0xa4:
05:08:36:WU00:FS00:0xa4:*------------------------------*
05:08:36:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
05:08:36:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
05:08:36:WU00:FS00:0xa4:
05:08:36:WU00:FS00:0xa4:Preparing to commence simulation
05:08:36:WU00:FS00:0xa4:- Looking at optimizations...
05:08:36:WU00:FS00:0xa4:- Files status OK
05:08:36:WU00:FS00:0xa4:- Expanded 821033 -> 1946552 (decompressed 237.0 percent)
05:08:36:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=821033 data_size=1946552, decompressed_data_size=1946552 diff=0
05:08:36:WU00:FS00:0xa4:- Digital signature verified
05:08:36:WU00:FS00:0xa4:
05:08:36:WU00:FS00:0xa4:Project: 14049 (Run 5, Clone 247, Gen 6)
05:08:36:WU00:FS00:0xa4:
05:08:36:WU00:FS00:0xa4:Assembly optimizations on if available.
05:08:36:WU00:FS00:0xa4:Entering M.D.
05:08:37:WU01:FS01:Connecting to 171.67.108.45:80
05:08:37:WU02:FS02:Connecting to 171.67.108.45:80
05:08:37:WU02:FS02:Assigned to work server 140.163.4.231
05:08:37:WU02:FS02:Requesting new work unit for slot 02: READY gpu:1:GP104 [GeForce GTX 1080] 8873 from 140.163.4.231
05:08:37:WU01:FS01:Assigned to work server 155.247.166.219
05:08:37:WU02:FS02:Connecting to 140.163.4.231:8080
05:08:37:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP104 [GeForce GTX 1080] 8873 from 155.247.166.219
05:08:37:WU01:FS01:Connecting to 155.247.166.219:8080
05:08:38:9:127.0.0.1:New Web connection
05:08:38:WU02:FS02:Downloading 16.94MiB
05:08:39:WU01:FS01:Downloading 904.37KiB
05:08:40:28:127.0.0.1:New Web connection
05:08:42:WU00:FS00:0xa4:Using Gromacs checkpoints
05:08:42:WU00:FS00:0xa4:Mapping NT from 6 to 6 
05:08:43:WU00:FS00:0xa4:Resuming from checkpoint
05:08:43:WU00:FS00:0xa4:Verified 00/wudata_01.log
05:08:43:WU00:FS00:0xa4:Verified 00/wudata_01.trr
05:08:43:WU00:FS00:0xa4:Verified 00/wudata_01.xtc
05:08:43:WU00:FS00:0xa4:Verified 00/wudata_01.edr
05:08:43:WU00:FS00:0xa4:Completed 1675200 out of 2500000 steps  (67%)
05:08:44:WU02:FS02:Download 3.32%
05:08:45:WU01:FS01:Download 77.84%
05:08:46:WU01:FS01:Download complete
05:08:46:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:13783 run:269 clone:4 gen:198 core:0x21 unit:0x000000f40002894b5a833f91c9942895
05:08:46:WU01:FS01:Starting
05:08:46:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:\Users\Nick Montague\AppData\Roaming\FAHClient\cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe" -dir 01 -suffix 01 -version 704 -lifeline 7496 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 1 -opencl-device 1 -cuda-device 1 -gpu 1
05:08:46:WU01:FS01:Started FahCore on PID 8588
05:08:46:WU01:FS01:Core PID:1892
05:08:46:WU01:FS01:FahCore 0x21 started
05:08:46:WU01:FS01:0x21:*********************** Log Started 2018-05-03T05:08:46Z ***********************
05:08:46:WU01:FS01:0x21:Project: 13783 (Run 269, Clone 4, Gen 198)
05:08:46:WU01:FS01:0x21:Unit: 0x000000f40002894b5a833f91c9942895
05:08:46:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
05:08:46:WU01:FS01:0x21:Machine: 1
05:08:46:WU01:FS01:0x21:Reading tar file core.xml
05:08:46:WU01:FS01:0x21:Reading tar file integrator.xml
05:08:46:WU01:FS01:0x21:Reading tar file state.xml
05:08:46:WU01:FS01:0x21:Reading tar file system.xml
05:08:47:WU01:FS01:0x21:Digital signatures verified
05:08:47:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
05:08:47:WU01:FS01:0x21:Version 0.0.18
05:08:50:WU02:FS02:Download 7.75%
05:08:55:WU01:FS01:0x21:Completed 0 out of 5000000 steps (0%)
05:08:55:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
05:08:56:WU02:FS02:Download 15.12%
05:09:02:WU02:FS02:Download 21.39%
05:09:08:WU02:FS02:Download 33.57%
05:09:14:WU02:FS02:Download 45.37%
05:09:20:WU02:FS02:Download 56.81%
05:09:26:WU02:FS02:Download 72.30%
05:09:32:WU02:FS02:Download 82.63%
05:09:38:WU02:FS02:Download 94.80%
05:09:40:WU02:FS02:Download complete
05:09:40:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11710 run:3 clone:122 gen:73 core:0x21 unit:0x000000688ca304e75814df4c3fd36169
05:09:40:WU02:FS02:Starting
05:09:40:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:\Users\Nick Montague\AppData\Roaming\FAHClient\cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe" -dir 02 -suffix 01 -version 704 -lifeline 7496 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 1 -opencl-device 0 -cuda-device 0 -gpu 0
05:09:40:WU02:FS02:Started FahCore on PID 7200
05:09:40:WU02:FS02:Core PID:8380
05:09:40:WU02:FS02:FahCore 0x21 started
05:09:41:WU02:FS02:0x21:*********************** Log Started 2018-05-03T05:09:40Z ***********************
05:09:41:WU02:FS02:0x21:Project: 11710 (Run 3, Clone 122, Gen 73)
05:09:41:WU02:FS02:0x21:Unit: 0x000000688ca304e75814df4c3fd36169
05:09:41:WU02:FS02:0x21:CPU: 0x00000000000000000000000000000000
05:09:41:WU02:FS02:0x21:Machine: 2
05:09:41:WU02:FS02:0x21:Reading tar file core.xml
05:09:41:WU02:FS02:0x21:Reading tar file integrator.xml
05:09:41:WU02:FS02:0x21:Reading tar file state.xml
05:09:41:WU02:FS02:0x21:Reading tar file system.xml
05:09:41:WU02:FS02:0x21:Digital signatures verified
05:09:41:WU02:FS02:0x21:Folding@home GPU Core21 Folding@home Core
05:09:41:WU02:FS02:0x21:Version 0.0.18
05:09:45:WU01:FS01:0x21:Completed 50000 out of 5000000 steps (1%)
05:09:51:WU02:FS02:0x21:Completed 0 out of 7500000 steps (0%)
05:09:51:WU02:FS02:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
So, if it helps others, what works is to down-grade the Nvidia driver to version 391.35 - it looks as though other posts are right to report that the latest version of the driver is at fault.

opencl-index issues

Posted: Mon May 28, 2018 2:01 am
by bollix47
I had a problem recently that was much like others are reporting here.
Here is my system info:

Code: Select all

11:04:22:            CPU: Intel(R) Core(TM) i5-7600K CPU @ 3.80GHz
11:04:22:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
11:04:22:           CPUs: 4
11:04:22:         Memory: 7.90GiB
11:04:22:    Free Memory: 6.04GiB
11:04:22:        Threads: WINDOWS_THREADS
11:04:22:     OS Version: 6.2
11:04:22:    Has Battery: false
11:04:22:     On Battery: false
11:04:22:     UTC Offset: -4
11:04:22:            PID: 6624
11:04:22:            CWD: C:\Users\bolli\AppData\Roaming\FAHClient
11:04:22:             OS: Windows 10 Pro
11:04:22:        OS Arch: AMD64
11:04:22:           GPUs: 2
11:04:22:          GPU 0: Bus:2 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 3GB] 3935
11:04:22:          GPU 1: Bus:1 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
11:04:22:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:9.1
11:04:22:  CUDA Device 1: Platform:0 Device:1 Bus:2 Slot:0 Compute:6.1 Driver:9.1
11:04:22:OpenCL Device 0: Platform:0 Device:0 Bus:NA Slot:NA Compute:2.1 Driver:22.20
11:04:22:OpenCL Device 2: Platform:1 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:391.35
11:04:22:OpenCL Device 3: Platform:1 Device:1 Bus:2 Slot:0 Compute:1.2 Driver:391.35
11:04:22:  Win32 Service: false
11:04:22:***********************************************************************
As you can see there are two usable GPUs (391.35) that can be used for folding. Without adding any changes to the indices in the configuration, folding was a complete bust (including a message "no compute device"). I noticed the OpenCL enumerations for my two GPUs were 2 & 3. So I tried changing the opencl-index to 2 & 3 and although I was able to download work and process it the two WUs were being done on 1 GPU and there was a message about an invalid opencl-index. Having noticed the OpenCL devices went from 0 to 2 I decided to change the indices for opencl to 1 & 2 in my config and once I did folding worked properly on both GPUs.
I don't remember disabling my IGP but everything worked fine prior to replacing a 960 with a second 1060 and I was using the latest beta 7.5.1 whereas prior to the hardware change I was using 7.4.16.

Re: Failing to get WU today work server refuses [171.67.108.

Posted: Mon May 28, 2018 8:48 am
by toTOW
You have two OpenCL platforms installed : Platform 1 which is nVidia, but also Platform 0 which from Intel ... remove the Intel OpenCL drivers to avoid conflicts ...

GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 12:35 pm
by Petaflops
Hi. I've just purchased a cheap GTX Titan Z. I've installed it in the machine and GPU Z sees both GPU's. However in FAH it is only seeing one GPU. I've fiddled around with configuring the slots but I can't make it see both GPU's. Will it only see one or am I doing something wrong? I'd like to get the most out of the card and would appreciate any assistance given.
Many thanks.
Pete

Re: GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 2:21 pm
by Joe_H
Welcome to the folding support forum.

Generally when changing a hardware configuration the folding client works best at detecting new GPU's by reinstalling the client. If any WU is processing, set it to Finish, and after it is done uninstall the client including data. Keeping a copy of the config.xml file can be useful so you can copy and paste your passkey and username information.

Update your video drivers, sometimes this takes installing once for each GPU in the system. Then reinstall the folding client. It should detect and configure both GPU's then.

Re: GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 8:13 pm
by Petaflops
Yes I've downloaded the latest Nvidia drivers already. I'll finish my current units and then try your advice. Thanks. Pete

Re: GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 11:05 pm
by Petaflops
OK. I've made some progress. I've uninstalled the older client and reinstalled the most up to date one. The CPU slot and the 2 GPU slots are now detected. However only the CPU slot and one GPU slot are working. The other GPU slot is just held on "Ready" but is not doing anything. I've looked at the log and there seems to be something wrong. The log is long already so I've just copied the last bit of it for you to see.

Code: Select all

22:46:04:  <!-- Slot Control -->
22:46:04:  <power v='FULL'/>
22:46:04:
22:46:04:  <!-- User Information -->
22:46:04:  <team v='35947'/>
22:46:04:  <user v='Petaflops'/>
22:46:04:
22:46:04:  <!-- Folding Slots -->
22:46:04:  <slot id='0' type='CPU'>
22:46:04:    <paused v='true'/>
22:46:04:  </slot>
22:46:04:  <slot id='1' type='GPU'/>
22:46:04:  <slot id='2' type='GPU'>
22:46:04:    <paused v='true'/>
22:46:04:  </slot>
22:46:04:</config>
22:47:38:WU01:FS01:Starting
22:47:38:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:51:52:WU01:FS01:Starting
22:51:52:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:51:52:WU01:FS01:Sending unit results: id:01 state:SEND error:FAILED project:11713 run:4 clone:136 gen:190 core:0x21 unit:0x000000f88ca304e75a5cfe1c6eac1a9c
22:51:52:WU01:FS01:Connecting to 140.163.4.231:8080
22:51:53:WU01:FS01:Server responded WORK_ACK (400)
22:51:53:WU01:FS01:Cleaning up
22:51:53:WU03:FS01:Connecting to 65.254.110.245:8080
22:51:53:WU03:FS01:Assigned to work server 140.163.4.231
22:51:53:WU03:FS01:Requesting new work unit for slot 01: READY gpu:0:GK110 [GeForce GTX Titan Z] 8122 from 140.163.4.231
22:51:53:WU03:FS01:Connecting to 140.163.4.231:8080
22:51:54:WU03:FS01:Downloading 16.51MiB
22:52:00:WU03:FS01:Download 71.53%
22:52:01:WU03:FS01:Download complete
22:52:01:WU03:FS01:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:11713 run:24 clone:142 gen:197 core:0x21 unit:0x000000f48ca304e75a5cfe214c56b9fe
22:52:01:WU03:FS01:Starting
22:52:01:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:52:02:WU03:FS01:Starting
22:52:02:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:53:02:WU03:FS01:Starting
22:53:02:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:54:39:WU03:FS01:Starting
22:54:39:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
22:54:45:FS01:Paused
22:55:13:Saving configuration to config.xml
22:55:13:<config>
22:55:13:  <!-- Slot Control -->
22:55:13:  <power v='FULL'/>
22:55:13:
22:55:13:  <!-- User Information -->
22:55:13:  <team v='35947'/>
22:55:13:  <user v='Petaflops'/>
22:55:13:
22:55:13:  <!-- Folding Slots -->
22:55:13:  <slot id='0' type='CPU'>
22:55:13:    <paused v='true'/>
22:55:13:  </slot>
22:55:13:  <slot id='1' type='GPU'>
22:55:13:    <paused v='true'/>
22:55:13:  </slot>
22:55:13:  <slot id='2' type='GPU'>
22:55:13:    <paused v='true'/>
22:55:13:  </slot>
22:55:13:</config>
Hope that all makes sense to someone out there! Any advice on how to get both GPU's folding?

Mod edit: added Code tags to log file listing

Re: GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 11:20 pm
by bruce
The last page is useful, but the first two pages are important, too. I'd have to guess why the GPU slots are not starting and to see what FAH detects when it starts.

Re: GTX Titan Z - Only one GPU showing.

Posted: Thu Jun 28, 2018 11:33 pm
by Petaflops
Thanks. Hopefully this is a bit more useful for you.

Code: Select all

*********************** Log Started 2018-06-28T23:24:48Z ***********************
23:24:48:************************* Folding@home Client *************************
23:24:48:        Website: https://foldingathome.org/
23:24:48:      Copyright: (c) 2009-2018 foldingathome.org
23:24:48:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
23:24:48:           Args: 
23:24:48:         Config: C:\Users\User\AppData\Roaming\FAHClient\config.xml
23:24:48:******************************** Build ********************************
23:24:48:        Version: 7.5.1
23:24:48:           Date: May 11 2018
23:24:48:           Time: 13:06:32
23:24:48:     Repository: Git
23:24:48:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
23:24:48:         Branch: master
23:24:48:       Compiler: Visual C++ 2008
23:24:48:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
23:24:48:       Platform: win32 10
23:24:48:           Bits: 32
23:24:48:           Mode: Release
23:24:48:******************************* System ********************************
23:24:48:            CPU: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz
23:24:48:         CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
23:24:48:           CPUs: 8
23:24:48:         Memory: 15.89GiB
23:24:48:    Free Memory: 13.22GiB
23:24:48:        Threads: WINDOWS_THREADS
23:24:48:     OS Version: 6.2
23:24:48:    Has Battery: false
23:24:48:     On Battery: false
23:24:48:     UTC Offset: 1
23:24:48:            PID: 12616
23:24:48:            CWD: C:\Users\User\AppData\Roaming\FAHClient
23:24:48:             OS: Windows 10 Enterprise
23:24:48:        OS Arch: AMD64
23:24:48:           GPUs: 2
23:24:48:          GPU 0: Bus:4 Slot:0 Func:0 NVIDIA:3 GK110 [GeForce GTX Titan Z] 8122
23:24:48:          GPU 1: Bus:3 Slot:0 Func:0 NVIDIA:3 GK110 [GeForce GTX Titan Z] 8122
23:24:48:  CUDA Device 0: Platform:0 Device:0 Bus:3 Slot:0 Compute:3.5 Driver:9.2
23:24:48:  CUDA Device 1: Platform:0 Device:1 Bus:3 Slot:0 Compute:3.5 Driver:9.2
23:24:48:OpenCL Device 0: Platform:0 Device:0 Bus:NA Slot:NA Compute:1.2 Driver:20.19
23:24:48:OpenCL Device 2: Platform:1 Device:0 Bus:3 Slot:0 Compute:1.2 Driver:398.36
23:24:48:OpenCL Device 3: Platform:1 Device:1 Bus:3 Slot:0 Compute:1.2 Driver:398.36
23:24:48:  Win32 Service: false
23:24:48:***********************************************************************
23:24:48:<config>
23:24:48:  <!-- Folding Core -->
23:24:48:  <checkpoint v='10'/>
23:24:48:
23:24:48:  <!-- Network -->
23:24:48:  <proxy v=':8080'/>
23:24:48:
23:24:48:  <!-- Slot Control -->
23:24:48:  <power v='FULL'/>
23:24:48:
23:24:48:  <!-- User Information -->
23:24:48:  <team v='35947'/>
23:24:48:  <user v='Petaflops'/>
23:24:48:
23:24:48:  <!-- Folding Slots -->
23:24:48:  <slot id='0' type='CPU'>
23:24:48:    <paused v='true'/>
23:24:48:  </slot>
23:24:48:  <slot id='1' type='GPU'>
23:24:48:    <paused v='true'/>
23:24:48:  </slot>
23:24:48:  <slot id='2' type='GPU'>
23:24:48:    <paused v='true'/>
23:24:48:  </slot>
23:24:48:</config>
23:24:48:Trying to access database...
23:24:48:Successfully acquired database lock
23:24:48:Enabled folding slot 00: PAUSED cpu:6 (by user)
23:24:48:Enabled folding slot 01: PAUSED gpu:0:GK110 [GeForce GTX Titan Z] 8122 (by user)
23:24:48:Enabled folding slot 02: PAUSED gpu:1:GK110 [GeForce GTX Titan Z] 8122 (by user)
23:24:48:ERROR:No compute devices matched GPU #0 NVIDIA:3 GK110 [GeForce GTX Titan Z] 8122.  You may need to update your graphics drivers.
23:26:46:FS00:Unpaused
23:26:46:FS01:Unpaused
23:26:46:FS02:Unpaused
23:26:46:WU02:FS02:Starting
23:26:46:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\User\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 02 -suffix 01 -version 705 -lifeline 12616 -checkpoint 10 -gpu-vendor nvidia -opencl-platform 1 -opencl-device 0 -cuda-device 0 -gpu 0
23:26:46:WU02:FS02:Started FahCore on PID 9168
23:26:47:WU02:FS02:Core PID:2752
23:26:47:WU02:FS02:FahCore 0x21 started
23:26:47:WU00:FS00:Starting
23:26:47:WARNING:WU00:FS00:AS lowered CPUs from 6 to 4
23:26:47:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\User\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/Win32/AMD64/AVX/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 705 -lifeline 12616 -checkpoint 10 -np 4
23:26:47:WU00:FS00:Started FahCore on PID 3400
23:26:47:WU00:FS00:Core PID:6172
23:26:47:WU00:FS00:FahCore 0xa7 started
23:26:47:WU03:FS01:Starting
23:26:47:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
23:26:47:WU03:FS01:Starting
23:26:47:ERROR:WU03:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually
23:26:47:WU02:FS02:0x21:*********************** Log Started 2018-06-28T23:26:47Z ***********************
23:26:47:WU02:FS02:0x21:Project: 11713 (Run 14, Clone 307, Gen 93)
23:26:47:WU02:FS02:0x21:Unit: 0x0000007b8ca304e75adf78ac32a3777d
23:26:47:WU02:FS02:0x21:CPU: 0x00000000000000000000000000000000
23:26:47:WU02:FS02:0x21:Machine: 2
23:26:47:WU02:FS02:0x21:Digital signatures verified
23:26:47:WU02:FS02:0x21:Folding@home GPU Core21 Folding@home Core
23:26:47:WU02:FS02:0x21:Version 0.0.18

Re: GTX Titan Z - Only one GPU showing.

Posted: Fri Jun 29, 2018 3:16 am
by bruce
thanks.

Unfortunately I'm stumped.

I've never seen the error message You may need to update your graphics drivers. before.

Did Windows install the video drivers or have you downloaded/installed them from NVidia?
Are your OpenCL drivers the ones supplied by NVidia or were they obtained from elsewhere?
Were both GPUs installed when you installed those drivers or did you add one later?