I run FAH on my GPU only, usually when I'm running Handbrake on my CPU. Recently it has started to fail repeatedly somewhere between 3% and 10% of the way through a work unit, and sometimes crash altogether.
I have tried reinstalling FAH from scratch and updating my graphics drivers to no avail. CoreTemp and GPU-Z tell me that I don't have any overheating issues. I have tried running FAH on my GPU with the PC otherwise idle, and get the same results.
I've searched the forums, but can't find anything that seems to apply. My search-fu is weak, so I apologise if I've missed an obvious answer.
My system:
i7-3770k, 16GB RAM, nVidia GT980Ti, running Windows 7 Ultimate 64-bit
I've attached the logfile from my latest run. I've read through it, but don't understand what it's telling me.
Can anyone offer a solution, or a suggestion for further investigation, please?
Code: Select all
*********************** Log Started 2017-09-10T18:27:40Z ***********************
18:27:40:************************* Folding@home Client *************************
18:27:40: Website: http://folding.stanford.edu/
18:27:40: Copyright: (c) 2009-2014 Stanford University
18:27:40: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
18:27:40: Args: --open-web-control
18:27:40: Config: C:/Users/Euphonium/AppData/Roaming/FAHClient/config.xml
18:27:40:******************************** Build ********************************
18:27:40: Version: 7.4.4
18:27:40: Date: Mar 4 2014
18:27:40: Time: 20:26:54
18:27:40: SVN Rev: 4130
18:27:40: Branch: fah/trunk/client
18:27:40: Compiler: Intel(R) C++ MSVC 1500 mode 1200
18:27:40: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
18:27:40: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
18:27:40: Platform: win32 XP
18:27:40: Bits: 32
18:27:40: Mode: Release
18:27:40:******************************* System ********************************
18:27:40: CPU: Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz
18:27:40: CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
18:27:40: CPUs: 8
18:27:40: Memory: 15.89GiB
18:27:40: Free Memory: 9.15GiB
18:27:40: Threads: WINDOWS_THREADS
18:27:40: OS Version: 6.1
18:27:40: Has Battery: false
18:27:40: On Battery: false
18:27:40: UTC Offset: 1
18:27:40: PID: 1308
18:27:40: CWD: C:/Users/Euphonium/AppData/Roaming/FAHClient
18:27:40: OS: Windows 7 Ultimate
18:27:40: OS Arch: AMD64
18:27:40: GPUs: 1
18:27:40: GPU 0: NVIDIA:7 GM200 [GeForce GTX 980 Ti] 5632
18:27:40: CUDA: 5.2
18:27:40: CUDA Driver: 9000
18:27:40:Win32 Service: false
18:27:40:***********************************************************************
18:27:40:<config>
18:27:40: <!-- Folding Core -->
18:27:40: <checkpoint v='5'/>
18:27:40:
18:27:40: <!-- Folding Slot Configuration -->
18:27:40: <cause v='CANCER'/>
18:27:40:
18:27:40: <!-- Network -->
18:27:40: <proxy v=':8080'/>
18:27:40:
18:27:40: <!-- Slot Control -->
18:27:40: <pause-on-start v='true'/>
18:27:40:
18:27:40: <!-- User Information -->
18:27:40: <passkey v='********************************'/>
18:27:40: <team v='229610'/>
18:27:40: <user v='Euphonium'/>
18:27:40:
18:27:40: <!-- Folding Slots -->
18:27:40: <slot id='0' type='CPU'/>
18:27:40: <slot id='1' type='GPU'/>
18:27:40:</config>
18:27:40:Trying to access database...
18:27:40:Successfully acquired database lock
18:27:40:Enabled folding slot 00: PAUSED cpu:6 (by user)
18:27:40:Enabled folding slot 01: PAUSED gpu:0:GM200 [GeForce GTX 980 Ti] 5632 (by user)
18:27:43:8:127.0.0.1:New Web connection
18:28:06:FS01:Unpaused
18:28:06:WU00:FS01:Connecting to 171.67.108.45:80
18:28:07:WU00:FS01:Assigned to work server 171.67.108.157
18:28:07:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 171.67.108.157
18:28:07:WU00:FS01:Connecting to 171.67.108.157:8080
18:28:08:WU00:FS01:Downloading 5.17MiB
18:28:14:WU00:FS01:Download 54.39%
18:28:18:WU00:FS01:Download complete
18:28:18:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9415 run:1255 clone:1 gen:120 core:0x21 unit:0x0000008bab436c9d585e06d482a81349
18:28:18:WU00:FS01:Starting
18:28:18:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Euphonium/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 5 -gpu 0 -gpu-vendor nvidia
18:28:18:WU00:FS01:Started FahCore on PID 7324
18:28:18:WU00:FS01:Core PID:3520
18:28:18:WU00:FS01:FahCore 0x21 started
18:28:18:WU00:FS01:0x21:*********************** Log Started 2017-09-10T18:28:18Z ***********************
18:28:18:WU00:FS01:0x21:Project: 9415 (Run 1255, Clone 1, Gen 120)
18:28:18:WU00:FS01:0x21:Unit: 0x0000008bab436c9d585e06d482a81349
18:28:18:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
18:28:18:WU00:FS01:0x21:Machine: 1
18:28:18:WU00:FS01:0x21:Reading tar file core.xml
18:28:18:WU00:FS01:0x21:Reading tar file integrator.xml
18:28:18:WU00:FS01:0x21:Reading tar file state.xml
18:28:18:WU00:FS01:0x21:Reading tar file system.xml
18:28:19:WU00:FS01:0x21:Digital signatures verified
18:28:19:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
18:28:19:WU00:FS01:0x21:Version 0.0.18
18:28:28:WU00:FS01:0x21:Completed 0 out of 6250000 steps (0%)
18:28:28:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
18:28:37:WARNING:WU00:FS01:FahCore returned an unknown error code which probably indicates that it crashed
18:28:37:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
18:28:37:WU00:FS01:Starting
18:28:37:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Euphonium/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 5 -gpu 0 -gpu-vendor nvidia
18:28:37:WU00:FS01:Started FahCore on PID 9540
18:28:37:WU00:FS01:Core PID:3088
18:28:37:WU00:FS01:FahCore 0x21 started
18:28:39:WU00:FS01:0x21:*********************** Log Started 2017-09-10T18:28:38Z ***********************
18:28:39:WU00:FS01:0x21:Project: 9415 (Run 1255, Clone 1, Gen 120)
18:28:39:WU00:FS01:0x21:Unit: 0x0000008bab436c9d585e06d482a81349
18:28:39:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
18:28:39:WU00:FS01:0x21:Machine: 1
18:28:39:WU00:FS01:0x21:Digital signatures verified
18:28:39:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
18:28:39:WU00:FS01:0x21:Version 0.0.18
18:28:42:WU00:FS01:0x21:Completed 0 out of 6250000 steps (0%)
18:28:42:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
18:29:51:FS01:Finishing
18:30:29:WU00:FS01:0x21:Completed 62500 out of 6250000 steps (1%)
18:32:05:WARNING:WU00:FS01:FahCore returned an unknown error code which probably indicates that it crashed
18:32:05:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
18:32:05:WU00:FS01:Starting
18:32:05:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Euphonium/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 5 -gpu 0 -gpu-vendor nvidia
18:32:05:WU00:FS01:Started FahCore on PID 2348
18:32:05:WU00:FS01:Core PID:6116
18:32:05:WU00:FS01:FahCore 0x21 started
18:32:06:WU00:FS01:0x21:*********************** Log Started 2017-09-10T18:32:06Z ***********************
18:32:06:WU00:FS01:0x21:Project: 9415 (Run 1255, Clone 1, Gen 120)
18:32:06:WU00:FS01:0x21:Unit: 0x0000008bab436c9d585e06d482a81349
18:32:06:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
18:32:06:WU00:FS01:0x21:Machine: 1
18:32:06:WU00:FS01:0x21:Digital signatures verified
18:32:06:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
18:32:06:WU00:FS01:0x21:Version 0.0.18
18:32:06:WU00:FS01:0x21: Found a checkpoint file
18:32:10:WU00:FS01:0x21:Completed 100000 out of 6250000 steps (1%)
18:32:10:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
18:32:55:WU00:FS01:0x21:Completed 125000 out of 6250000 steps (2%)
18:34:45:WU00:FS01:0x21:Completed 187500 out of 6250000 steps (3%)
18:36:06:WARNING:WU00:FS01:FahCore returned an unknown error code which probably indicates that it crashed
18:36:06:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
18:36:06:WU00:FS01:Starting
18:36:06:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Euphonium/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 5 -gpu 0 -gpu-vendor nvidia
18:36:06:WU00:FS01:Started FahCore on PID 8816
18:36:07:WU00:FS01:Core PID:2836
18:36:07:WU00:FS01:FahCore 0x21 started
18:36:08:WU00:FS01:0x21:*********************** Log Started 2017-09-10T18:36:07Z ***********************
18:36:08:WU00:FS01:0x21:Project: 9415 (Run 1255, Clone 1, Gen 120)
18:36:08:WU00:FS01:0x21:Unit: 0x0000008bab436c9d585e06d482a81349
18:36:08:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
18:36:08:WU00:FS01:0x21:Machine: 1
18:36:08:WU00:FS01:0x21:Digital signatures verified
18:36:08:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
18:36:08:WU00:FS01:0x21:Version 0.0.18
18:36:08:WU00:FS01:0x21: Found a checkpoint file
18:36:10:WU00:FS01:0x21:Completed 200000 out of 6250000 steps (3%)
18:36:10:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
18:36:56:WARNING:WU00:FS01:FahCore returned an unknown error code which probably indicates that it crashed
18:36:56:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
18:37:06:WU00:FS01:Starting
18:37:06:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Euphonium/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 5 -gpu 0 -gpu-vendor nvidia
18:37:06:WU00:FS01:Started FahCore on PID 6004
18:37:07:WU00:FS01:Core PID:3460
18:37:07:WU00:FS01:FahCore 0x21 started
18:37:07:WU00:FS01:0x21:*********************** Log Started 2017-09-10T18:37:07Z ***********************
18:37:07:WU00:FS01:0x21:Project: 9415 (Run 1255, Clone 1, Gen 120)
18:37:07:WU00:FS01:0x21:Unit: 0x0000008bab436c9d585e06d482a81349
18:37:07:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
18:37:07:WU00:FS01:0x21:Machine: 1
18:37:07:WU00:FS01:0x21:Digital signatures verified
18:37:07:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
18:37:07:WU00:FS01:0x21:Version 0.0.18
18:37:08:WU00:FS01:0x21: Found a checkpoint file
18:37:11:WU00:FS01:0x21:Completed 200000 out of 6250000 steps (3%)
18:37:11:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
18:37:58:WARNING:WU00:FS01:FahCore returned an unknown error code which probably indicates that it crashed
18:37:58:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
18:37:58:WARNING:WU00:FS01:Too many errors, failing
18:37:58:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:9415 run:1255 clone:1 gen:120 core:0x21 unit:0x0000008bab436c9d585e06d482a81349
18:37:58:WU00:FS01:Connecting to 171.67.108.157:8080
18:37:58:WU00:FS01:Server responded WORK_ACK (400)
18:37:58:WU00:FS01:Cleaning up