Sys:
i7 2600K@4300MHz, MSI N670 PE Factory-OC (but stable), Ubuntu 13.04, nVidia-Driver 313.30
Folded since several days (internal and 7810/7811) without any problem, but the last WU stopped at 72% with:
Bad State detected . . .
Log:
Code: Select all
*********************** Log Started 2013-10-05T22:38:12Z ***********************
22:38:12:************************* Folding@home Client *************************
22:38:12: Website: http://folding.stanford.edu/
22:38:12: Copyright: (c) 2009-2013 Stanford University
22:38:12: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:38:12: Args: --child --lifeline 1016 /etc/fahclient/config.xml --run-as
22:38:12: fahclient --pid-file=/var/run/fahclient.pid --daemon
22:38:12: Config: /etc/fahclient/config.xml
22:38:12:******************************** Build ********************************
22:38:12: Version: 7.3.6
22:38:12: Date: Feb 18 2013
22:38:12: Time: 07:24:08
22:38:12: SVN Rev: 3923
22:38:12: Branch: fah/trunk/client
22:38:12: Compiler: GNU 4.4.7
22:38:12: Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
22:38:12: -fno-unsafe-math-optimizations -msse2
22:38:12: Platform: linux2 3.2.0-1-amd64
22:38:12: Bits: 64
22:38:12: Mode: Release
22:38:12:******************************* System ********************************
22:38:12: CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
22:38:12: CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
22:38:12: CPUs: 8
22:38:12: Memory: 7.77GiB
22:38:12:Free Memory: 7.52GiB
22:38:12: Threads: POSIX_THREADS
22:38:12:Has Battery: false
22:38:12: On Battery: false
22:38:12: UTC offset: 2
22:38:12: PID: 1052
22:38:12: CWD: /var/lib/fahclient
22:38:12: OS: Linux 3.8.0-31-generic x86_64
22:38:12: OS Arch: AMD64
22:38:12: GPUs: 1
22:38:12: GPU 0: NVIDIA:3 GK104 [GeForce GTX 670]
22:38:12: CUDA: Not detected
22:38:12:***********************************************************************
. . .
22:35:14:<config>
22:35:14: <!-- Client Control -->
22:35:14: <fold-anon v='true'/>
22:35:14:
22:35:14: <!-- Folding Core -->
22:35:14: <core-priority v='low'/>
22:35:14:
22:35:14: <!-- Folding Slot Configuration -->
22:35:14: <power v='full'/>
22:35:14:
22:35:14: <!-- HTTP Server -->
22:35:14: <allow v='127.0.0.1 192.168.2.100-192.168.2.110'/>
22:35:14:
22:35:14: <!-- Logging -->
22:35:14: <log-rotate-max v='50'/>
22:35:14:
22:35:14: <!-- Network -->
22:35:14: <proxy v=':8080'/>
22:35:14:
22:35:14: <!-- Remote Command Server -->
22:35:14: <command-allow-no-pass v='127.0.0.1 192.168.2.100-192.168.2.110'/>
22:35:14:
22:35:14: <!-- User Information -->
22:35:14: <passkey v='********************************'/>
22:35:14: <team v='70335'/>
22:35:14: <user v='folding_hoomer'/>
22:35:14:
22:35:14: <!-- Folding Slots -->
22:35:14: <slot id='0' type='CPU'>
22:35:14: <client-type v='beta'/>
22:35:14: <cpus v='6'/>
22:35:14: <next-unit-percentage v='100'/>
22:35:14: <pause-on-start v='true'/>
22:35:14: </slot>
22:35:14: <slot id='1' type='GPU'>
22:35:14: <client-type v='beta'/>
22:35:14: <next-unit-percentage v='100'/>
22:35:14: <pause-on-start v='true'/>
22:35:14: </slot>
22:35:14:</config>
. . .
08:10:53:WU01:FS01:0x17:Completed 1980000 out of 2000000 steps (99%)
08:12:32:WU01:FS01:0x17:Completed 2000000 out of 2000000 steps (100%)
08:12:33:WU02:FS01:Connecting to assign-GPU.stanford.edu:80
08:12:35:WU02:FS01:News: Welcome to Folding@Home
08:12:35:WU02:FS01:Assigned to work server 171.64.65.98
08:12:35:WU02:FS01:Requesting new work unit for slot 01: RUNNING gpu:0:GK104 [GeForce GTX 670] from 171.64.65.98
08:12:35:WU02:FS01:Connecting to 171.64.65.98:8080
08:12:36:WU02:FS01:Downloading 2.09MiB
08:12:36:WU01:FS01:0x17:Saving result file logfile_01.txt
08:12:36:WU01:FS01:0x17:Saving result file checkpointState.xml
08:12:37:WU01:FS01:0x17:Saving result file checkpt.crc
08:12:37:WU01:FS01:0x17:Saving result file log.txt
08:12:37:WU01:FS01:0x17:Saving result file positions.xtc
08:12:37:WU01:FS01:0x17:Folding@home Core Shutdown: FINISHED_UNIT
08:12:38:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
08:12:38:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:7811 run:0 clone:452 gen:364 core:0x17 unit:0x000001810a3b1e8651db4a3592af4da6
08:12:38:WU01:FS01:Uploading 4.29MiB to 171.64.65.98
08:12:38:WU01:FS01:Connecting to 171.64.65.98:8080
08:12:42:WU02:FS01:Download 95.68%
08:12:42:WU02:FS01:Download complete
08:12:42:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:69 gen:273 core:0x17 unit:0x000001270a3b1e8651d3466aaac4a564
08:12:42:WU02:FS01:Starting
08:12:42:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 02 -suffix 01 -version 703 -lifeline 1052 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
08:12:42:WU02:FS01:Started FahCore on PID 6221
08:12:42:WU02:FS01:Core PID:6225
08:12:42:WU02:FS01:FahCore 0x17 started
08:12:42:WU02:FS01:0x17:*********************** Log Started 2013-10-10T08:12:42Z ***********************
08:12:42:WU02:FS01:0x17:Project: 7810 (Run 0, Clone 69, Gen 273)
08:12:42:WU02:FS01:0x17:Unit: 0x000001270a3b1e8651d3466aaac4a564
08:12:42:WU02:FS01:0x17:CPU: 0x00000000000000000000000000000000
08:12:42:WU02:FS01:0x17:Machine: 1
08:12:42:WU02:FS01:0x17:Reading tar file state.xml
08:12:42:WU02:FS01:0x17:Reading tar file system.xml
08:12:42:WU02:FS01:0x17:Reading tar file integrator.xml
08:12:42:WU02:FS01:0x17:Reading tar file core.xml
08:12:42:WU02:FS01:0x17:Digital signatures verified
08:12:44:WU01:FS01:Upload 2.91%
08:12:50:WU01:FS01:Upload 16.01%
08:12:56:WU01:FS01:Upload 27.66%
08:13:03:WU01:FS01:Upload 36.40%
08:13:07:WU02:FS01:0x17:Completed 0 out of 2000000 steps (0%)
08:13:09:WU01:FS01:Upload 48.04%
08:13:15:WU01:FS01:Upload 59.69%
08:13:22:WU01:FS01:Upload 69.88%
08:13:28:WU01:FS01:Upload 82.98%
08:13:34:WU01:FS01:Upload 91.72%
08:13:42:WU01:FS01:Upload complete
08:13:42:WU01:FS01:Server responded WORK_ACK (400)
08:13:42:WU01:FS01:Final credit estimate, 10022.00 points
08:13:42:WU01:FS01:Cleaning up
08:15:25:WU02:FS01:0x17:Completed 20000 out of 2000000 steps (1%)
08:17:38:WU02:FS01:0x17:Completed 40000 out of 2000000 steps (2%)
08:19:56:WU02:FS01:0x17:Completed 60000 out of 2000000 steps (3%)
. . .
10:48:32:WU02:FS01:0x17:Completed 1380000 out of 2000000 steps (69%)
10:50:45:WU02:FS01:0x17:Completed 1400000 out of 2000000 steps (70%)
10:53:03:WU02:FS01:0x17:Completed 1420000 out of 2000000 steps (71%)
******************************* Date: 2013-10-10 *******************************
10:55:17:WU02:FS01:0x17:Completed 1440000 out of 2000000 steps (72%)
10:56:36:WU02:FS01:0x17:Bad State detected... attempting to resume from last good checkpoint
10:58:51:WU02:FS01:0x17:Completed 1420000 out of 2000000 steps (71%)
11:01:05:WU02:FS01:0x17:Completed 1440000 out of 2000000 steps (72%)
11:02:25:WU02:FS01:0x17:Bad State detected... attempting to resume from last good checkpoint
11:04:40:WU02:FS01:0x17:Completed 1420000 out of 2000000 steps (71%)
11:06:55:WU02:FS01:0x17:Completed 1440000 out of 2000000 steps (72%)
11:08:14:WU02:FS01:0x17:Bad State detected... attempting to resume from last good checkpoint
11:08:14:WU02:FS01:0x17:Max number of retries reached. Aborting.
11:08:14:WU02:FS01:0x17:ERROR:exception: Max Retries Reached
11:08:14:WU02:FS01:0x17:Saving result file logfile_01.txt
11:08:14:WU02:FS01:0x17:Saving result file badStateCheckpoint_1538501661
11:08:15:WU02:FS01:0x17:Saving result file badStateCheckpoint_1543817796
11:08:16:WU02:FS01:0x17:Saving result file badStateCheckpoint_1829727287
11:08:17:WU02:FS01:0x17:Saving result file badStateForceGroup0_1538501661Core.xml
11:08:18:WU02:FS01:0x17:Saving result file badStateForceGroup0_1538501661Ref.xml
11:08:20:WU02:FS01:0x17:Saving result file badStateForceGroup0_1543817796Core.xml
11:08:21:WU02:FS01:0x17:Saving result file badStateForceGroup0_1543817796Ref.xml
11:08:22:WU02:FS01:0x17:Saving result file badStateForceGroup0_1829727287Core.xml
11:08:24:WU02:FS01:0x17:Saving result file badStateForceGroup0_1829727287Ref.xml
11:08:25:WU02:FS01:0x17:Saving result file badStateForceGroup1_1538501661Core.xml
11:08:26:WU02:FS01:0x17:Saving result file badStateForceGroup1_1538501661Ref.xml
11:08:28:WU02:FS01:0x17:Saving result file badStateForceGroup1_1543817796Core.xml
11:08:29:WU02:FS01:0x17:Saving result file badStateForceGroup1_1543817796Ref.xml
11:08:30:WU02:FS01:0x17:Saving result file badStateForceGroup1_1829727287Core.xml
11:08:31:WU02:FS01:0x17:Saving result file badStateForceGroup1_1829727287Ref.xml
11:08:32:WU02:FS01:0x17:Saving result file badStateForceGroup2_1538501661Core.xml
11:08:33:WU02:FS01:0x17:Saving result file badStateForceGroup2_1538501661Ref.xml
11:08:34:WU02:FS01:0x17:Saving result file badStateForceGroup2_1543817796Core.xml
11:08:36:WU02:FS01:0x17:Saving result file badStateForceGroup2_1543817796Ref.xml
11:08:37:WU02:FS01:0x17:Saving result file badStateForceGroup2_1829727287Core.xml
11:08:38:WU02:FS01:0x17:Saving result file badStateForceGroup2_1829727287Ref.xml
11:08:39:WU02:FS01:0x17:Saving result file log.txt
11:08:39:WU02:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
11:08:39:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
11:08:39:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:7810 run:0 clone:69 gen:273 core:0x17 unit:0x000001270a3b1e8651d3466aaac4a564
11:08:39:WU02:FS01:Uploading 37.23MiB to 171.64.65.98
11:08:39:WU02:FS01:Connecting to 171.64.65.98:8080
. . .
11:08:45:WU02:FS01:Upload 0.34%
11:08:51:WU02:FS01:Upload 0.67%
. . .
11:16:52:WU02:FS01:Upload 98.71%
11:16:58:WU02:FS01:Upload 99.55%
11:17:03:WU02:FS01:Upload complete
11:17:03:WU02:FS01:Server responded WORK_ACK (400)
11:17:03:WU02:FS01:Cleaning up