Page 4 of 4

Core 22 should not be on Advanced

Posted: Thu Mar 21, 2019 10:48 pm
by HaloJones
I'm running P11733 on a Windows 1070 and hitting 859K so I can't understand why you're only getting 581K.

Re: Core 22 on Advanced

Posted: Sat Mar 23, 2019 3:06 pm
by toTOW
Two possibilities : throttling or bad states ...

Re: Core 22 on Advanced

Posted: Sat Mar 23, 2019 3:35 pm
by bruce
toTOW wrote:Two possibilities : throttling or bad states ...
... leaving you the option of updating your drivers and deciding whether to keep running them or to revert back to the ones you're currently using.

Re: Core 22 on Advanced

Posted: Tue Apr 16, 2019 6:58 am
by antropofob
rafwiewiora wrote:With the explosion of problems between beta and advanced (which is only a 2.5x increase from ~400 --> ~ 1000 GPUs), this unfortunately might be the only way to learn - I worry we might not have a diverse enough representation of systems in beta right now. Anyways, it's off advanced now, thank you all for your sacrifices. :) I get nice science logs with system configurations for all the failures now, so we'll go from there, see you with the next version!

Greetings rafwiewiora,
is CUDA version getting some love also? Maybe you could drop us a tentative ETA?
Thanks for your work.

Re: Core 22 on Advanced

Posted: Tue Apr 16, 2019 4:18 pm
by bruce
Development of a new FAHCore has always started with a version for OpenCL. THe first requirement is to get it stable and working correctly for whatever projects are distributed to it. That part of the core's beta testing has not yet been completed and it's impossible to predict when that might happen.

FAH does not follow the general trend of the software industry by making predictions when anything will happen. FAHCore_22 will be released WHENEVER IT IS READY.

Once that version is released, FAH may or may not develop a CUDA version. (It has been suggested that this may happen, but it may not.) At that time, the costs of the extra development work and the extra support costs of developing that second FAHCore are evaluated against the potential increases in FAH production and compared to the benefits of other potential development.

No, that decision has not been made yet. The only ETA is "maybe someday"

Re: Core 22 and Project 11733

Posted: Tue Apr 16, 2019 5:10 pm
by oliverjdent
I have the following GPU running in a Windows 10 machine. It has been running stable and clean for five or more years folding until the last month. It gets about 15% through project 11733, starts to fail, goes to the last good checkpoint, and finally fails. It has been doing this for several weeks. I thought it might be a NVIDIA driver issue, but none of my other folding machines are experiencing problems and I downgraded then upgraded the NVIDIA drivers on this unit only to see the same results. The NVIDIA drivers used now are the latest drivers available.

If you need more information please let me know what I should provide.

What recommendations do you have to either fix this issue or work around this issue?

Thank you.

Oliver.

Folding@Home
Version: 7.5.1

OS and GPU Information
OS: Windows 10 Enterprise [Home]
OS Arch: AMD64
GPUs: 1
GPU 0: Bus: 1 Slot: 0 Func: 0 NVIDIA: 3 GK106 [GeForce GTX660]
CUDA Device 0 Platform: 0 Device: 0 Bus: 1 Slot: 0 Compute: 3.0 Driver: 10.1
OpenCL Device 0 Platform: 0 Device: 0 Bus: 1 Slot: 0 Compute: 1.2 Driver 425.31
Win32 Service: false

Slot 1 Log Output:

Code: Select all

21:27:51:WU00:FS01:0x21:Completed 0 out of 25000000 steps (0%)
21:27:51:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
21:44:25:WU00:FS01:0x21:Completed 250000 out of 25000000 steps (1%)
21:53:43:WU00:FS01:0x21:ERROR:exception: Error downloading array energyBuffer: clEnqueueReadBuffer (-5)
21:53:43:WU00:FS01:0x21:Saving result file logfile_01.txt
21:53:43:WU00:FS01:0x21:Saving result file log.txt
21:53:43:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
21:53:43:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
21:53:43:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:14173 run:67 clone:2 gen:13 core:0x21 unit:0x0000000e0002894c5c9a6edec99ee9d0
21:53:43:WU00:FS01:Uploading 2.61KiB to 155.247.166.220
21:53:43:WU00:FS01:Connecting to 155.247.166.220:8080
21:53:43:WU00:FS01:Upload complete
21:53:44:WU00:FS01:Server responded WORK_ACK (400)
21:53:44:WU00:FS01:Cleaning up
21:53:44:WU02:FS01:Connecting to 65.254.110.245:8080
21:53:44:WU02:FS01:Assigned to work server 155.247.166.220
21:53:44:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GK106 [GeForce GTX 660] from 155.247.166.220
21:53:44:WU02:FS01:Connecting to 155.247.166.220:8080
21:53:46:WU02:FS01:Downloading 994.76KiB
21:53:46:WU02:FS01:Download complete
21:53:47:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:14173 run:78 clone:0 gen:14 core:0x21 unit:0x0000000f0002894c5c9a6ee3466cf0ad
21:53:47:WU02:FS01:Starting
21:53:47:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "D:\My Folding\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/beta/Core_21.fah/FahCore_21.exe" -dir 02 -suffix 01 -version 705 -lifeline 18076 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
21:53:47:WU02:FS01:Started FahCore on PID 14148
21:53:47:WU02:FS01:Core PID:13192
21:53:47:WU02:FS01:FahCore 0x21 started
21:53:47:WU02:FS01:0x21:*********************** Log Started 2019-04-15T21:53:47Z ***********************
21:53:47:WU02:FS01:0x21:Project: 14173 (Run 78, Clone 0, Gen 14)
21:53:47:WU02:FS01:0x21:Unit: 0x0000000f0002894c5c9a6ee3466cf0ad
21:53:47:WU02:FS01:0x21:CPU: 0x00000000000000000000000000000000
21:53:47:WU02:FS01:0x21:Machine: 1
21:53:47:WU02:FS01:0x21:Reading tar file core.xml
21:53:47:WU02:FS01:0x21:Reading tar file integrator.xml
21:53:47:WU02:FS01:0x21:Reading tar file state.xml
21:53:47:WU02:FS01:0x21:Reading tar file system.xml
21:53:47:WU02:FS01:0x21:Digital signatures verified
21:53:47:WU02:FS01:0x21:Folding@home GPU Core21 Folding@home Core
21:53:47:WU02:FS01:0x21:Version 0.0.20
21:53:49:WU02:FS01:0x21:Completed 0 out of 25000000 steps (0%)
21:53:49:WU02:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
21:56:03:WU02:FS01:0x21:ERROR:exception: Error downloading array energyBuffer: clEnqueueReadBuffer (-5)
21:56:03:WU02:FS01:0x21:Saving result file logfile_01.txt
21:56:03:WU02:FS01:0x21:Saving result file log.txt
21:56:03:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
21:56:04:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
21:56:04:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:14173 run:78 clone:0 gen:14 core:0x21 unit:0x0000000f0002894c5c9a6ee3466cf0ad
21:56:04:WU02:FS01:Uploading 2.56KiB to 155.247.166.220
21:56:04:WU02:FS01:Connecting to 155.247.166.220:8080
21:56:04:WU02:FS01:Upload complete
21:56:04:WU02:FS01:Server responded WORK_ACK (400)
21:56:04:WU02:FS01:Cleaning up
21:56:04:WU00:FS01:Connecting to 65.254.110.245:8080
21:56:04:WU00:FS01:Assigned to work server 155.247.166.220
21:56:04:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GK106 [GeForce GTX 660] from 155.247.166.220
21:56:04:WU00:FS01:Connecting to 155.247.166.220:8080
21:56:06:WU00:FS01:Downloading 994.74KiB
21:56:07:WU00:FS01:Download complete
21:56:07:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:14173 run:19 clone:1 gen:14 core:0x21 unit:0x0000000f0002894c5c9a6eb5ddca585e
21:56:07:WU00:FS01:Starting
21:56:07:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "D:\My Folding\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/beta/Core_21.fah/FahCore_21.exe" -dir 00 -suffix 01 -version 705 -lifeline 18076 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
21:56:07:WU00:FS01:Started FahCore on PID 7300
21:56:07:WU00:FS01:Core PID:10388
21:56:07:WU00:FS01:FahCore 0x21 started
21:56:08:WU00:FS01:0x21:*********************** Log Started 2019-04-15T21:56:07Z ***********************
21:56:08:WU00:FS01:0x21:Project: 14173 (Run 19, Clone 1, Gen 14)
21:56:08:WU00:FS01:0x21:Unit: 0x0000000f0002894c5c9a6eb5ddca585e
21:56:08:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
21:56:08:WU00:FS01:0x21:Machine: 1
21:56:08:WU00:FS01:0x21:Reading tar file core.xml
21:56:08:WU00:FS01:0x21:Reading tar file integrator.xml
21:56:08:WU00:FS01:0x21:Reading tar file state.xml
21:56:08:WU00:FS01:0x21:Reading tar file system.xml
21:56:08:WU00:FS01:0x21:Digital signatures verified
21:56:08:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
21:56:08:WU00:FS01:0x21:Version 0.0.20
21:56:09:WU00:FS01:0x21:Completed 0 out of 25000000 steps (0%)
21:56:09:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:03:28:WU00:FS01:0x21:ERROR:exception: Error downloading array energyBuffer: clEnqueueReadBuffer (-5)
22:03:28:WU00:FS01:0x21:Saving result file logfile_01.txt
22:03:28:WU00:FS01:0x21:Saving result file log.txt
22:03:28:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
22:03:29:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
22:03:29:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:14173 run:19 clone:1 gen:14 core:0x21 unit:0x0000000f0002894c5c9a6eb5ddca585e
22:03:29:WU00:FS01:Uploading 2.59KiB to 155.247.166.220
22:03:29:WU00:FS01:Connecting to 155.247.166.220:8080
22:03:29:WU00:FS01:Upload complete
22:03:29:WU00:FS01:Server responded WORK_ACK (400)
22:03:29:WU00:FS01:Cleaning up
22:03:29:WU02:FS01:Connecting to 65.254.110.245:8080
22:03:30:WU02:FS01:Assigned to work server 155.247.166.220
22:03:30:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GK106 [GeForce GTX 660] from 155.247.166.220
22:03:30:WU02:FS01:Connecting to 155.247.166.220:8080
22:03:32:WU02:FS01:Downloading 994.16KiB
22:03:33:WU02:FS01:Download complete
22:03:33:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:14173 run:68 clone:0 gen:15 core:0x21 unit:0x000000100002894c5c9a6ede6429d4e0
22:03:33:WU02:FS01:Starting
22:03:33:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "D:\My Folding\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/beta/Core_21.fah/FahCore_21.exe" -dir 02 -suffix 01 -version 705 -lifeline 18076 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
22:03:33:WU02:FS01:Started FahCore on PID 652
22:03:33:WU02:FS01:Core PID:4328
22:03:33:WU02:FS01:FahCore 0x21 started
22:03:34:WU02:FS01:0x21:*********************** Log Started 2019-04-15T22:03:33Z ***********************
22:03:34:WU02:FS01:0x21:Project: 14173 (Run 68, Clone 0, Gen 15)
22:03:34:WU02:FS01:0x21:Unit: 0x000000100002894c5c9a6ede6429d4e0
22:03:34:WU02:FS01:0x21:CPU: 0x00000000000000000000000000000000
22:03:34:WU02:FS01:0x21:Machine: 1
22:03:34:WU02:FS01:0x21:Reading tar file core.xml
22:03:34:WU02:FS01:0x21:Reading tar file integrator.xml
22:03:34:WU02:FS01:0x21:Reading tar file state.xml
22:03:34:WU02:FS01:0x21:Reading tar file system.xml
22:03:34:WU02:FS01:0x21:Digital signatures verified
22:03:34:WU02:FS01:0x21:Folding@home GPU Core21 Folding@home Core
22:03:34:WU02:FS01:0x21:Version 0.0.20
22:03:35:WU02:FS01:0x21:Completed 0 out of 25000000 steps (0%)
22:03:35:WU02:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:03:51:WU02:FS01:0x21:ERROR:exception: Error downloading array energyBuffer: clEnqueueReadBuffer (-5)
22:03:51:WU02:FS01:0x21:Saving result file logfile_01.txt
22:03:51:WU02:FS01:0x21:Saving result file log.txt
22:03:51:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
22:03:51:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
22:03:51:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:14173 run:68 clone:0 gen:15 core:0x21 unit:0x000000100002894c5c9a6ede6429d4e0
22:03:51:WU02:FS01:Uploading 2.55KiB to 155.247.166.220
22:03:51:WU02:FS01:Connecting to 155.247.166.220:8080
22:03:51:WU02:FS01:Upload complete
22:03:52:WU02:FS01:Server responded WORK_ACK (400)
22:03:52:WU02:FS01:Cleaning up
22:03:52:WU00:FS01:Connecting to 65.254.110.245:8080
22:03:52:WU00:FS01:Assigned to work server 155.247.166.220
22:03:52:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GK106 [GeForce GTX 660] from 155.247.166.220
22:03:52:WU00:FS01:Connecting to 155.247.166.220:8080
22:03:54:WU00:FS01:Downloading 994.65KiB
22:03:55:WU00:FS01:Download complete
22:03:55:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:14173 run:28 clone:2 gen:13 core:0x21 unit:0x0000000d0002894c5c9a6ebb0d4cbac5
22:03:55:WU00:FS01:Starting
22:03:55:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "D:\My Folding\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/beta/Core_21.fah/FahCore_21.exe" -dir 00 -suffix 01 -version 705 -lifeline 18076 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
22:03:55:WU00:FS01:Started FahCore on PID 2580
22:03:55:WU00:FS01:Core PID:18500
22:03:55:WU00:FS01:FahCore 0x21 started
22:03:55:WU00:FS01:0x21:*********************** Log Started 2019-04-15T22:03:55Z ***********************
22:03:55:WU00:FS01:0x21:Project: 14173 (Run 28, Clone 2, Gen 13)
22:03:55:WU00:FS01:0x21:Unit: 0x0000000d0002894c5c9a6ebb0d4cbac5
22:03:55:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
22:03:55:WU00:FS01:0x21:Machine: 1
22:03:55:WU00:FS01:0x21:Reading tar file core.xml
22:03:55:WU00:FS01:0x21:Reading tar file integrator.xml
22:03:55:WU00:FS01:0x21:Reading tar file state.xml
22:03:55:WU00:FS01:0x21:Reading tar file system.xml
22:03:55:WU00:FS01:0x21:Digital signatures verified
22:03:55:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
22:03:55:WU00:FS01:0x21:Version 0.0.20
22:03:57:WU00:FS01:0x21:Completed 0 out of 25000000 steps (0%)
22:03:57:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:06:23:WU00:FS01:0x21:ERROR:exception: Error downloading array energyBuffer: clEnqueueReadBuffer (-5)
22:06:23:WU00:FS01:0x21:Saving result file logfile_01.txt
22:06:23:WU00:FS01:0x21:Saving result file log.txt
22:06:23:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
22:06:24:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
22:06:24:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:14173 run:28 clone:2 gen:13 core:0x21 unit:0x0000000d0002894c5c9a6ebb0d4cbac5
22:06:24:WU00:FS01:Uploading 2.58KiB to 155.247.166.220
22:06:24:WU00:FS01:Connecting to 155.247.166.220:8080
22:06:24:WU00:FS01:Upload complete
22:06:24:WU00:FS01:Server responded WORK_ACK (400)
22:06:24:WU00:FS01:Cleaning up
22:06:24:WU02:FS01:Connecting to 65.254.110.245:8080
22:06:24:WU02:FS01:Assigned to work server 140.163.4.241
22:06:24:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GK106 [GeForce GTX 660] from 140.163.4.241
22:06:24:WU02:FS01:Connecting to 140.163.4.241:8080
22:06:25:WU02:FS01:Downloading 11.66MiB
22:06:27:WU02:FS01:Download complete
22:06:27:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11733 run:0 clone:770 gen:118 core:0x22 unit:0x0000007a8ca304f15c8acee1a3ed31b1
22:06:27:WU02:FS01:Starting
22:06:27:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "D:\My Folding\cores/cores.foldingathome.org/Win32/AMD64/NVIDIA/Fermi/beta/Core_22.fah/FahCore_22.exe" -dir 02 -suffix 01 -version 705 -lifeline 18076 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
22:06:27:WU02:FS01:Started FahCore on PID 13636
22:06:27:WU02:FS01:Core PID:13884
22:06:27:WU02:FS01:FahCore 0x22 started
22:06:27:WU02:FS01:0x22:*********************** Log Started 2019-04-15T22:06:27Z ***********************
22:06:27:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
22:06:27:WU02:FS01:0x22:       Type: 0x22
22:06:27:WU02:FS01:0x22:       Core: Core22
22:06:27:WU02:FS01:0x22:    Website: https://foldingathome.org/
22:06:27:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
22:06:27:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
22:06:27:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
22:06:27:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 13636 -checkpoint 15
22:06:27:WU02:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
22:06:27:WU02:FS01:0x22:             0 -gpu 0
22:06:27:WU02:FS01:0x22:     Config: <none>
22:06:27:WU02:FS01:0x22:************************************ Build *************************************
22:06:27:WU02:FS01:0x22:    Version: 0.0.1
22:06:27:WU02:FS01:0x22:       Date: Feb 25 2019
22:06:27:WU02:FS01:0x22:       Time: 19:14:08
22:06:27:WU02:FS01:0x22: Repository: Git
22:06:27:WU02:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
22:06:27:WU02:FS01:0x22:     Branch: HEAD
22:06:27:WU02:FS01:0x22:   Compiler: Visual C++ 2008
22:06:27:WU02:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
22:06:27:WU02:FS01:0x22:   Platform: win32 10
22:06:27:WU02:FS01:0x22:       Bits: 64
22:06:27:WU02:FS01:0x22:       Mode: Release
22:06:27:WU02:FS01:0x22:************************************ System ************************************
22:06:27:WU02:FS01:0x22:        CPU: Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz
22:06:27:WU02:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
22:06:27:WU02:FS01:0x22:       CPUs: 4
22:06:27:WU02:FS01:0x22:     Memory: 15.95GiB
22:06:27:WU02:FS01:0x22:Free Memory: 8.03GiB
22:06:27:WU02:FS01:0x22:    Threads: WINDOWS_THREADS
22:06:27:WU02:FS01:0x22: OS Version: 6.2
22:06:27:WU02:FS01:0x22:Has Battery: false
22:06:27:WU02:FS01:0x22: On Battery: false
22:06:27:WU02:FS01:0x22: UTC Offset: -5
22:06:27:WU02:FS01:0x22:        PID: 13884
22:06:27:WU02:FS01:0x22:        CWD: D:\My Folding\work
22:06:27:WU02:FS01:0x22:         OS: Windows 10 Pro
22:06:27:WU02:FS01:0x22:    OS Arch: AMD64
22:06:27:WU02:FS01:0x22:********************************************************************************
22:06:27:WU02:FS01:0x22:Project: 11733 (Run 0, Clone 770, Gen 118)
22:06:27:WU02:FS01:0x22:Unit: 0x0000007a8ca304f15c8acee1a3ed31b1
22:06:27:WU02:FS01:0x22:Reading tar file core.xml
22:06:27:WU02:FS01:0x22:Reading tar file integrator.xml
22:06:27:WU02:FS01:0x22:Reading tar file state.xml
22:06:27:WU02:FS01:0x22:Reading tar file system.xml
22:06:27:WU02:FS01:0x22:Digital signatures verified
22:06:27:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
22:06:27:WU02:FS01:0x22:Version 0.0.1
22:06:29:WU02:FS01:0x22:Completed 0 out of 5000000 steps (0%)
22:06:30:WU02:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:12:51:WU02:FS01:0x22:Completed 50000 out of 5000000 steps (1%)
22:19:12:WU02:FS01:0x22:Completed 100000 out of 5000000 steps (2%)
22:25:32:WU02:FS01:0x22:Completed 150000 out of 5000000 steps (3%)
22:31:53:WU02:FS01:0x22:Completed 200000 out of 5000000 steps (4%)
22:38:13:WU02:FS01:0x22:Completed 250000 out of 5000000 steps (5%)
22:44:34:WU02:FS01:0x22:Completed 300000 out of 5000000 steps (6%)
22:50:54:WU02:FS01:0x22:Completed 350000 out of 5000000 steps (7%)
22:57:14:WU02:FS01:0x22:Completed 400000 out of 5000000 steps (8%)
23:03:34:WU02:FS01:0x22:Completed 450000 out of 5000000 steps (9%)
23:09:54:WU02:FS01:0x22:Completed 500000 out of 5000000 steps (10%)
23:16:15:WU02:FS01:0x22:Completed 550000 out of 5000000 steps (11%)
23:22:36:WU02:FS01:0x22:Completed 600000 out of 5000000 steps (12%)
23:28:56:WU02:FS01:0x22:Completed 650000 out of 5000000 steps (13%)
23:35:16:WU02:FS01:0x22:Completed 700000 out of 5000000 steps (14%)
23:41:35:WU02:FS01:0x22:Completed 750000 out of 5000000 steps (15%)
23:43:24:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
23:43:24:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
23:49:43:WU02:FS01:0x22:Completed 550000 out of 5000000 steps (11%)
23:54:14:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
23:54:14:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
00:06:55:WU02:FS01:0x22:Completed 600000 out of 5000000 steps (12%)
00:08:19:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
00:08:19:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
00:08:19:WU02:FS01:0x22:ERROR:114: Max Retries Reached
00:08:19:WU02:FS01:0x22:Saving result file ..\logfile_01.txt
00:08:19:WU02:FS01:0x22:Saving result file badstate-0.xml
00:08:19:WU02:FS01:0x22:Saving result file badstate-1.xml
00:08:19:WU02:FS01:0x22:Saving result file badstate-2.xml
00:08:19:WU02:FS01:0x22:Saving result file checkpointState.xml
00:08:19:WU02:FS01:0x22:Saving result file checkpt.crc
00:08:19:WU02:FS01:0x22:Saving result file positions.xtc
00:08:19:WU02:FS01:0x22:Saving result file science.log
00:08:19:WU02:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
00:08:20:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
00:08:20:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:11733 run:0 clone:770 gen:118 core:0x22 unit:0x0000007a8ca304f15c8acee1a3ed31b1
00:08:20:WU02:FS01:Uploading 13.08MiB to 140.163.4.241
00:08:20:WU02:FS01:Connecting to 140.163.4.241:8080
00:08:44:WU02:FS01:Upload 14.81%
00:08:50:WU02:FS01:Upload 24.37%
00:08:56:WU02:FS01:Upload 70.24%
00:09:01:WU02:FS01:Upload complete
00:09:01:WU02:FS01:Server responded WORK_ACK (400)
00:09:01:WU02:FS01:Cleaning up
Mod edit: replaced Quote tags with Code tags on log

Re: Core 22 on Advanced

Posted: Tue Apr 16, 2019 5:24 pm
by Joe_H
Are you receiving the Core 22 WU's with your client set to Advanced? If so I will contact the person running this project and have them check their server settings.

Otherwise, if you are using the Beta setting my recommendation is to stop. Support for that setting is only done in the Beta Team forum. You are welcome to read posts there, and if you want to post there you can apply to be a member.

P.S. The WU is not faulty, it was successfully processed by another. Core 22 does work GPU's harder, so you probably need to reduce any overclock or may need to underclock this card.

Re: Core 22 on Advanced

Posted: Tue Apr 16, 2019 5:47 pm
by rafwiewiora
Replying to above posts - yeah the ETA for anything with the core is "when I finally get to it on my grad student-style to-do list".. (which is continuously updated with always more pressing deadlines..). Science depends on it though, so it's actually in the "within this month" category. I've started floating around the idea of a full time computational lab tech to do this in reasonable time frames, but I think currently we only have grant money for server hardware.

Re: advanced - project's not there, user must be running beta.

Re: Core 22 and Project 11733

Posted: Wed Apr 17, 2019 12:40 am
by bruce
oliverjdent wrote:What recommendations do you have to either fix this issue or work around this issue?
I notice you' have not joined the beta team.
1) If you choose to use the beta flag, you should join.
2) Part of the registration process involves the understanding that when you test beta WUs or beta FAHCores, the failure rate WILL BE HIGHER.

The work-around is to avoid beta testing. If you do not wish to beta test, kindly remove the beta flag.
A Brief Overview Of F@h Beta Team Membership

Re: Core 22 on Advanced

Posted: Wed Apr 17, 2019 3:08 am
by foldy
@oliverjdent: Looks like your GPU gtx 660 cannot handle neither Core_21 beta 0.0.20 nor Core_22 beta 0.0.1 stable

You can try to downclock your gtx 660 to see if it is an overclock issue.