13001 WU failure
Posted: Sat Oct 04, 2014 4:22 pm
This system is running Mint 17, I have a 750ti and NVIDIA driver 343.22. All stock clocks. Been running 9201 WUs fine, this is the first 13001 I have seen and it failed with:
15:49:07:WU00:FS01:0x17:ERROR:exception: Force RMSE error of 447.223 with threshold of 5
What does this error mean?
Mod edit: Please use Code tags instead of Quote tags around log files
15:49:07:WU00:FS01:0x17:ERROR:exception: Force RMSE error of 447.223 with threshold of 5
What does this error mean?
Code: Select all
*********************** Log Started 2014-10-04T15:45:26Z ***********************
15:45:26:************************* Folding@home Client *************************
15:45:26: Website: http://folding.stanford.edu/
15:45:26: Copyright: (c) 2009-2014 Stanford University
15:45:26: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:45:26: Args: --child --lifeline 2647 /etc/fahclient/config.xml --run-as
15:45:26: fahclient --pid-file=/var/run/fahclient.pid --daemon
15:45:26: Config: /etc/fahclient/config.xml
15:45:26:******************************** Build ********************************
15:45:26: Version: 7.4.4
15:45:26: Date: Mar 4 2014
15:45:26: Time: 12:02:38
15:45:26: SVN Rev: 4130
15:45:26: Branch: fah/trunk/client
15:45:26: Compiler: GNU 4.4.7
15:45:26: Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
15:45:26: -fno-unsafe-math-optimizations -msse2
15:45:26: Platform: linux2 3.2.0-1-amd64
15:45:26: Bits: 64
15:45:26: Mode: Release
15:45:26:******************************* System ********************************
15:45:26: CPU: AMD Phenom(tm) II X6 1045T Processor
15:45:26: CPU ID: AuthenticAMD Family 16 Model 10 Stepping 0
15:45:26: CPUs: 6
15:45:26: Memory: 7.80GiB
15:45:26:Free Memory: 6.92GiB
15:45:26: Threads: POSIX_THREADS
15:45:26: OS Version: 3.13
15:45:26:Has Battery: false
15:45:26: On Battery: false
15:45:26: UTC Offset: -6
15:45:26: PID: 2649
15:45:26: CWD: /var/lib/fahclient
15:45:26: OS: Linux 3.13.0-24-generic x86_64
15:45:26: OS Arch: AMD64
15:45:26: GPUs: 1
15:45:26: GPU 0: NVIDIA:4 GM107 [GeForce GTX 750 Ti]
15:45:26: CUDA: 5.0
15:45:26:CUDA Driver: 6050
15:45:26:***********************************************************************
15:45:26:<config>
15:45:26: <!-- Client Control -->
15:45:26: <fold-anon v='true'/>
15:45:26:
15:45:26: <!-- Network -->
15:45:26: <proxy v=':8080'/>
15:45:26:
15:45:26: <!-- Slot Control -->
15:45:26: <power v='full'/>
15:45:26:
15:45:26: <!-- User Information -->
15:45:26: <passkey v='********************************'/>
15:45:26: <team v='37726'/>
15:45:26: <user v='bfromcolo'/>
15:45:26:
15:45:26: <!-- Folding Slots -->
15:45:26: <slot id='1' type='GPU'/>
15:45:26:</config>
15:45:26:Switching to user fahclient
15:45:26:Trying to access database...
15:45:27:Successfully acquired database lock
15:45:27:Enabled folding slot 01: READY gpu:0:GM107 [GeForce GTX 750 Ti]
15:45:27:WU00:FS01:Connecting to 171.67.108.201:80
15:45:28:WU00:FS01:Assigned to work server 140.163.4.231
15:45:28:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM107 [GeForce GTX 750 Ti] from 140.163.4.231
15:45:28:WU00:FS01:Connecting to 140.163.4.231:8080
15:45:29:WU00:FS01:Downloading 4.84MiB
15:45:35:WU00:FS01:Download 71.05%
15:45:37:WU00:FS01:Download complete
15:45:37:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13001 run:378 clone:1 gen:68 core:0x17 unit:0x00000096538b3db75328bad892c4b6cd
15:45:38:WU00:FS01:Starting
15:45:38:WU00:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17 -dir 00 -suffix 01 -version 704 -lifeline 2649 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
15:45:38:WU00:FS01:Started FahCore on PID 2667
15:45:38:WU00:FS01:Core PID:2671
15:45:38:WU00:FS01:FahCore 0x17 started
15:45:38:WU00:FS01:0x17:*********************** Log Started 2014-10-04T15:45:38Z ***********************
15:45:38:WU00:FS01:0x17:Project: 13001 (Run 378, Clone 1, Gen 68)
15:45:38:WU00:FS01:0x17:Unit: 0x00000096538b3db75328bad892c4b6cd
15:45:38:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
15:45:38:WU00:FS01:0x17:Machine: 1
15:45:38:WU00:FS01:0x17:Reading tar file state.xml
15:45:39:WU00:FS01:0x17:Reading tar file system.xml
15:45:39:WU00:FS01:0x17:Reading tar file integrator.xml
15:45:39:WU00:FS01:0x17:Reading tar file core.xml
15:45:39:WU00:FS01:0x17:Digital signatures verified
15:49:07:WU00:FS01:0x17:ERROR:exception: Force RMSE error of 447.223 with threshold of 5
15:49:07:WU00:FS01:0x17:Saving result file logfile_01.txt
15:49:07:WU00:FS01:0x17:Saving result file badStateCheckpoint_57114166
15:49:08:WU00:FS01:0x17:Saving result file badStateForceGroup0_57114166Core.xml
15:49:11:WU00:FS01:0x17:Saving result file badStateForceGroup0_57114166Ref.xml
15:49:14:WU00:FS01:0x17:Saving result file badStateForceGroup1_57114166Core.xml
15:49:16:WU00:FS01:0x17:Saving result file badStateForceGroup1_57114166Ref.xml
15:49:19:WU00:FS01:0x17:Saving result file badStateForceGroup2_57114166Core.xml
15:49:21:WU00:FS01:0x17:Saving result file badStateForceGroup2_57114166Ref.xml
15:49:23:WU00:FS01:0x17:Saving result file log.txt
15:49:23:WU00:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
15:49:24:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:49:24:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:13001 run:378 clone:1 gen:68 core:0x17 unit:0x00000096538b3db75328bad892c4b6cd
15:49:24:WU00:FS01:Uploading 24.64MiB to 140.163.4.231
15:49:24:WU00:FS01:Connecting to 140.163.4.231:8080