Page 1 of 1

Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 11, 2013 4:06 am
by ChristianVirtual
One Faulty 7810, also after 72% as folding_hoomer before ...

Latest NV driver 331.20

Code: Select all

*********************** Log Started 2013-11-08T04:30:28Z ***********************
04:30:28:************************* Folding@home Client *************************
04:30:28:    Website: http://folding.stanford.edu/
04:30:28:  Copyright: (c) 2009-2013 Stanford University
04:30:28:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
04:30:28:       Args: --child --lifeline 1113 /etc/fahclient/config.xml --run-as
04:30:28:             fahclient --pid-file=/var/run/fahclient.pid --daemon
04:30:28:     Config: /etc/fahclient/config.xml
04:30:28:******************************** Build ********************************
04:30:28:    Version: 7.3.6
04:30:28:       Date: Feb 18 2013
04:30:28:       Time: 07:24:08
04:30:28:    SVN Rev: 3923
04:30:28:     Branch: fah/trunk/client
04:30:28:   Compiler: GNU 4.4.7
04:30:28:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
04:30:28:             -fno-unsafe-math-optimizations -msse2
04:30:28:   Platform: linux2 3.2.0-1-amd64
04:30:28:       Bits: 64
04:30:28:       Mode: Release
04:30:28:******************************* System ********************************
04:30:28:        CPU: Intel(R) Core(TM) i7-2600S CPU @ 2.80GHz
04:30:28:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
04:30:28:       CPUs: 8
04:30:28:     Memory: 7.74GiB
04:30:28:Free Memory: 7.38GiB
04:30:28:    Threads: POSIX_THREADS
04:30:28:Has Battery: false
04:30:28: On Battery: false
04:30:28: UTC offset: 9
04:30:28:        PID: 1243
04:30:28:        CWD: /var/lib/fahclient
04:30:28:         OS: Linux 3.8.0-32-generic x86_64
04:30:28:    OS Arch: AMD64
04:30:28:       GPUs: 3
04:30:28:      GPU 0: NVIDIA:3 GK110 [GeForce GTX 780]
04:30:28:      GPU 1: NVIDIA:3 GK110 [GeForce GTX 780]
04:30:28:      GPU 2: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
04:30:28:       CUDA: 3.5
04:30:28:CUDA Driver: 6000
04:30:28:***********************************************************************
04:30:28:<config>
04:30:28:  <!-- Folding Slot Configuration -->
04:30:28:  <gpu v='true'/>
04:30:28:  <power v='full'/>
04:30:28:
04:30:28:  <!-- HTTP Server -->
04:30:28:
04:30:28:  <!-- Logging -->
04:30:28:  <log-rotate-max v='1024'/>
04:30:28:
04:30:28:  <!-- Network -->
04:30:28:  <proxy v=':8080'/>
04:30:28:
04:30:28:  <!-- User Information -->
04:30:28:  <team v='3446'/>
04:30:28:  <user v='ChristianFAH'/>
04:30:28:
04:30:28:  <!-- Folding Slots -->
04:30:28:  <slot id='0' type='CPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <cpus v='4'/>
04:30:28:    <next-unit-percentage v='99'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='1' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='2' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='3' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:</config>
04:30:28:Switching to user fahclient
04:30:28:Trying to access database...
04:30:28:Successfully acquired database lock
04:30:28:Enabled folding slot 00: PAUSED cpu:4 (paused)
04:30:28:Enabled folding slot 01: PAUSED gpu:0:GK110 [GeForce GTX 780] (paused)
04:30:28:Enabled folding slot 02: PAUSED gpu:1:GK110 [GeForce GTX 780] (paused)
04:30:28:Enabled folding slot 03: PAUSED gpu:2:GK104 [GeForce GTX 660 Ti] (paused)
04:36:01:FS01:Unpaused

...

18:44:02:WU01:FS02:0x17:Saving result file log.txt
18:44:02:WU01:FS02:0x17:Saving result file positions.xtc
18:44:03:WU01:FS02:0x17:Folding@home Core Shutdown: FINISHED_UNIT
18:44:03:WU01:FS02:FahCore returned: FINISHED_UNIT (100 = 0x64)
18:44:03:WU01:FS02:Sending unit results: id:01 state:SEND error:NO_ERROR project:7810 run:0 clone:694 gen:292 core:0x17 unit:0x0000013e0a3b1e8651d34d8137690c39
18:44:03:WU01:FS02:Uploading 5.74MiB to 171.64.65.98
18:44:03:WU01:FS02:Connecting to 171.64.65.98:8080
18:44:06:WU00:FS02:Download 29.89%
18:44:12:WU00:FS02:Download 56.78%
18:44:12:WU01:FS02:Upload complete
18:44:12:WU01:FS02:Server responded WORK_ACK (400)
18:44:12:WU01:FS02:Final credit estimate, 16751.00 points
18:44:12:WU01:FS02:Cleaning up
18:44:18:WU00:FS02:Download 92.65%
18:44:18:WU00:FS02:Download complete


18:44:18:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:66 gen:402 core:0x17 unit:0x000001b50a3b1e8651d34661a08b8f39
18:44:18:WU00:FS02:Starting
18:44:18:WU00:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 00 -suffix 01 -version 703 -lifeline 1243 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
18:44:18:WU00:FS02:Started FahCore on PID 7316
18:44:18:WU00:FS02:Core PID:7320
18:44:18:WU00:FS02:FahCore 0x17 started
18:44:19:WU00:FS02:0x17:*********************** Log Started 2013-11-10T18:44:18Z ***********************
18:44:19:WU00:FS02:0x17:Project: 7810 (Run 0, Clone 66, Gen 402)
18:44:19:WU00:FS02:0x17:Unit: 0x000001b50a3b1e8651d34661a08b8f39
18:44:19:WU00:FS02:0x17:CPU: 0x00000000000000000000000000000000
18:44:19:WU00:FS02:0x17:Machine: 2
18:44:19:WU00:FS02:0x17:Reading tar file state.xml
18:44:19:WU00:FS02:0x17:Reading tar file system.xml
18:44:19:WU00:FS02:0x17:Reading tar file integrator.xml
18:44:19:WU00:FS02:0x17:Reading tar file core.xml
18:44:19:WU00:FS02:0x17:Digital signatures verified
18:44:38:WU00:FS02:0x17:Completed 0 out of 2000000 steps (0%)
18:46:12:WU00:FS02:0x17:Completed 20000 out of 2000000 steps (1%)
18:47:43:WU00:FS02:0x17:Completed 40000 out of 2000000 steps (2%)
18:48:22:WU02:FS01:0x17:Completed 880000 out of 2000000 steps (44%)
18:49:17:WU00:FS02:0x17:Completed 60000 out of 2000000 steps (3%)
18:49:56:WU02:FS01:0x17:Completed 900000 out of 2000000 steps (45%)
18:50:48:WU00:FS02:0x17:Completed 80000 out of 2000000 steps (4%)
18:51:32:WU02:FS01:0x17:Completed 920000 out of 2000000 steps (46%)
18:52:20:WU00:FS02:0x17:Completed 100000 out of 2000000 steps (5%)
18:53:06:WU02:FS01:0x17:Completed 940000 out of 2000000 steps (47%)
18:53:54:WU00:FS02:0x17:Completed 120000 out of 2000000 steps (6%)
18:54:43:WU02:FS01:0x17:Completed 960000 out of 2000000 steps (48%)
18:55:25:WU00:FS02:0x17:Completed 140000 out of 2000000 steps (7%)
18:56:17:WU02:FS01:0x17:Completed 980000 out of 2000000 steps (49%)
18:56:59:WU00:FS02:0x17:Completed 160000 out of 2000000 steps (8%)

...

19:32:24:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:32:38:WU02:FS01:0x17:Completed 1440000 out of 2000000 steps (72%)
19:33:56:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:34:15:WU02:FS01:0x17:Completed 1460000 out of 2000000 steps (73%)
19:34:49:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:35:48:WU02:FS01:0x17:Completed 1480000 out of 2000000 steps (74%)
19:36:20:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:37:22:WU02:FS01:0x17:Completed 1500000 out of 2000000 steps (75%)
19:37:52:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:38:45:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:38:59:WU02:FS01:0x17:Completed 1520000 out of 2000000 steps (76%)
19:40:17:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:40:32:WU02:FS01:0x17:Completed 1540000 out of 2000000 steps (77%)
19:41:48:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:42:09:WU02:FS01:0x17:Completed 1560000 out of 2000000 steps (78%)
19:42:42:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:42:42:WU00:FS02:0x17:Max number of retries reached. Aborting.
19:42:42:WU00:FS02:0x17:ERROR:exception: Max Retries Reached
19:42:42:WU00:FS02:0x17:Saving result file logfile_01.txt
19:42:42:WU00:FS02:0x17:Saving result file badStateCheckpoint_416496834
19:42:42:WU00:FS02:0x17:Saving result file badStateCheckpoint_798946464
19:42:43:WU00:FS02:0x17:Saving result file badStateCheckpoint_8351929
19:42:43:WU00:FS02:0x17:Saving result file badStateForceGroup0_416496834Core.xml
19:42:44:WU00:FS02:0x17:Saving result file badStateForceGroup0_416496834Ref.xml
19:42:45:WU00:FS02:0x17:Saving result file badStateForceGroup0_798946464Core.xml
19:42:46:WU00:FS02:0x17:Saving result file badStateForceGroup0_798946464Ref.xml
19:42:47:WU00:FS02:0x17:Saving result file badStateForceGroup0_8351929Core.xml
19:42:48:WU00:FS02:0x17:Saving result file badStateForceGroup0_8351929Ref.xml
19:42:48:WU00:FS02:0x17:Saving result file badStateForceGroup1_416496834Core.xml
19:42:49:WU00:FS02:0x17:Saving result file badStateForceGroup1_416496834Ref.xml
19:42:50:WU00:FS02:0x17:Saving result file badStateForceGroup1_798946464Core.xml
19:42:51:WU00:FS02:0x17:Saving result file badStateForceGroup1_798946464Ref.xml
19:42:52:WU00:FS02:0x17:Saving result file badStateForceGroup1_8351929Core.xml
19:42:52:WU00:FS02:0x17:Saving result file badStateForceGroup1_8351929Ref.xml
19:42:53:WU00:FS02:0x17:Saving result file badStateForceGroup2_416496834Core.xml
19:42:54:WU00:FS02:0x17:Saving result file badStateForceGroup2_416496834Ref.xml
19:42:55:WU00:FS02:0x17:Saving result file badStateForceGroup2_798946464Core.xml
19:42:55:WU00:FS02:0x17:Saving result file badStateForceGroup2_798946464Ref.xml
19:42:56:WU00:FS02:0x17:Saving result file badStateForceGroup2_8351929Core.xml
19:42:57:WU00:FS02:0x17:Saving result file badStateForceGroup2_8351929Ref.xml
19:42:58:WU00:FS02:0x17:Saving result file log.txt
19:42:58:WU00:FS02:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
[93m19:42:58:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)[0m
19:42:58:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:7810 run:0 clone:66 gen:402 core:0x17 unit:0x000001b50a3b1e8651d34661a08b8f39
19:42:58:WU00:FS02:Uploading 37.33MiB to 171.64.65.98
19:42:58:WU00:FS02:Connecting to 171.64.65.98:8080
19:42:58:WU01:FS02:Connecting to assign-GPU.stanford.edu:80
19:42:59:WU01:FS02:News: Welcome to Folding@Home
19:42:59:WU01:FS02:Assigned to work server 171.64.65.98
19:42:59:WU01:FS02:Requesting new work unit for slot 02: READY gpu:1:GK110 [GeForce GTX 780] from 171.64.65.98
19:42:59:WU01:FS02:Connecting to 171.64.65.98:8080
19:42:59:WU01:FS02:Downloading 2.07MiB
19:43:04:WU00:FS02:Upload 17.75%
19:43:05:WU01:FS02:Download 24.10%
19:43:10:WU00:FS02:Upload 34.99%
19:43:11:WU01:FS02:Download 60.25%
19:43:16:WU00:FS02:Upload 50.73%
19:43:17:WU01:FS02:Download 84.36%
19:43:21:WU01:FS02:Download complete
19:43:21:WU01:FS02:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:465 gen:423 core:0x17 unit:0x000001c70a3b1e8651d34ae9e3c92149
19:43:21:WU01:FS02:Starting
19:43:21:WU01:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 01 -suffix 01 -version 703 -lifeline 1243 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
19:43:21:WU01:FS02:Started FahCore on PID 10881
19:43:21:WU01:FS02:Core PID:10885
19:43:21:WU01:FS02:FahCore 0x17 started
19:43:22:WU01:FS02:0x17:*********************** Log Started 2013-11-10T19:43:21Z ***********************
19:43:22:WU01:FS02:0x17:Project: 7810 (Run 0, Clone 465, Gen 423)
19:43:22:WU01:FS02:0x17:Unit: 0x000001c70a3b1e8651d34ae9e3c92149
19:43:22:WU01:FS02:0x17:CPU: 0x00000000000000000000000000000000
19:43:22:WU01:FS02:0x17:Machine: 2
19:43:22:WU01:FS02:0x17:Reading tar file state.xml
19:43:22:WU01:FS02:0x17:Reading tar file system.xml
19:43:22:WU01:FS02:0x17:Reading tar file integrator.xml
19:43:22:WU01:FS02:0x17:Reading tar file core.xml
19:43:22:WU01:FS02:0x17:Digital signatures verified
19:43:22:WU00:FS02:Upload 61.28%
19:43:28:WU00:FS02:Upload 71.83%
19:43:34:WU00:FS02:Upload 83.55%
19:43:40:WU00:FS02:Upload 95.10%
19:43:41:WU01:FS02:0x17:Completed 0 out of 2000000 steps (0%)
19:43:42:WU02:FS01:0x17:Completed 1580000 out of 2000000 steps (79%)
19:43:43:WU00:FS02:Upload complete
19:43:43:WU00:FS02:Server responded WORK_ACK (400)
19:43:43:WU00:FS02:Cleaning up
19:45:14:WU01:FS02:0x17:Completed 20000 out of 2000000 steps (1%)
Before and after that WU the GPU was working ok; bad WU ?

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 11, 2013 4:40 am
by P5-133XL
Looks bad, It has failed on seven people so far. I marked it bad, so it won't be given out any more.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 11, 2013 7:38 am
by ChristianVirtual
Thank you for the quick action P5-133XL; good to know it will not come back to others, too.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 11:46 am
by Hans
I'm still folding one and 7811 is doing the same .

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 12:01 pm
by ChristianVirtual
Which Run/Clone/Gen ?

Can you share relevant parts from log file incl. config ? Makes it easier to get checked ...

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 12:21 pm
by Hans
18:56:28:WU01:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
18:56:28:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
18:56:28:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:7811 run:0 clone:110 gen:536 core:0x17 unit:0x000002490a3b1e8651db472bbc4d0986

4 hours 16 mins 67658 7810 (0, 262, 516)
1 hours 00 mins 115082 7811 (0, 188, 538)

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 3:50 pm
by ChristianVirtual
What GPU and Os are you using ? Any overclocked version ? Can you share the first part of the log file with with configuration ? Just remove your password or passkeys if visible.

As for checking the projects: this needs a helping hand of a mod to check if done by others ...

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 3:56 pm
by P5-133XL
Someone else successfully completed project:7811 run:0 clone:110 gen:536

The problem is on your end rather than a faulty WU.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 4:50 pm
by Hans
Shure whatever :)
The faulty WU is info from the logfile so ?
I've finnished also a lot of the 7810 and 7811.

So both wu's can have similar problems. ??

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Mon Nov 18, 2013 10:18 pm
by bruce
Unfortunately the BAD_WORK_UNIT characterization does not guarantee that the WU was corrupt when it was downloaded. Things can happen to them after they're downloaded. Two somethat common reports: (1) An AV program decides something looks like a virus and "protects" your system by corrupting some of FAH's data. (2) Dust or overclocking or fan limitations push the temperatures in a system too close to unstable (or a voltage glitch or ??) and a hardware error occurs. Though it may not crash the OS, corrupt data may be introduced in the calculations which are detected by FAH's quality assurance software.

Failures of two WUs that did not fail on other hardware is not definitive proof of anything, but I'd take it as a strong indication that something in your system is too close to instability.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Tue Nov 19, 2013 10:33 am
by Hans
It's brand new machine and working 24/7 ....

Another 7811 last night.

Code: Select all

03:15:35:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/admin/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 02 -suffix 01 -version 703 -lifeline 4744 -checkpoint 15 -gpu 1 -gpu-vendor ati
03:15:35:WU02:FS00:Started FahCore on PID 4392
03:15:35:WU02:FS00:Core PID:2236
03:15:35:WU02:FS00:FahCore 0x17 started
03:15:35:WU02:FS00:0x17:*********************** Log Started 2013-11-19T03:15:35Z ***********************
03:15:35:WU02:FS00:0x17:Project: 7811 (Run 0, Clone 524, Gen 491)
03:15:35:WU02:FS00:0x17:Unit: 0x000002180a3b1e8651db4ad9849b66d4
03:15:35:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
03:15:35:WU02:FS00:0x17:Machine: 0
03:15:35:WU02:FS00:0x17:Reading tar file state.xml
03:15:35:WU02:FS00:0x17:Reading tar file system.xml
03:15:36:WU02:FS00:0x17:Reading tar file integrator.xml
03:15:36:WU02:FS00:0x17:Reading tar file core.xml
03:15:36:WU02:FS00:0x17:Digital signatures verified
03:15:41:WU01:FS00:Upload 7.59%
03:15:47:WU01:FS00:Upload 15.18%
03:15:53:WU01:FS00:Upload 22.77%
03:15:59:WU01:FS00:Upload 29.28%
03:16:01:WU02:FS00:0x17:Completed 0 out of 2000000 steps (0%)
03:16:05:WU01:FS00:Upload 36.87%
03:16:11:WU01:FS00:Upload 44.46%
03:16:17:WU01:FS00:Upload 52.05%
03:16:23:WU01:FS00:Upload 59.64%
03:16:29:WU01:FS00:Upload 67.24%
03:16:35:WU01:FS00:Upload 74.83%
03:16:41:WU01:FS00:Upload 82.42%
03:16:42:WU00:FS01:0x17:Completed 840000 out of 2000000 steps (42%)
03:16:47:WU01:FS00:Upload 88.92%
03:16:53:WU01:FS00:Upload 96.52%
03:17:00:WU01:FS00:Upload complete
03:17:00:WU01:FS00:Server responded WORK_ACK (400)
03:17:00:WU01:FS00:Final credit estimate, 13907.00 points
03:17:00:WU01:FS00:Cleaning up
03:17:46:WU02:FS00:0x17:Completed 20000 out of 2000000 steps (1%)
03:19:00:WU00:FS01:0x17:Completed 860000 out of 2000000 steps (43%)
03:19:25:WU02:FS00:0x17:Completed 40000 out of 2000000 steps (2%)
03:21:10:WU00:FS01:0x17:Completed 880000 out of 2000000 steps (44%)
03:21:11:WU02:FS00:0x17:Completed 60000 out of 2000000 steps (3%)
03:22:50:WU02:FS00:0x17:Completed 80000 out of 2000000 steps (4%)
03:23:19:WU00:FS01:0x17:Completed 900000 out of 2000000 steps (45%)
03:24:29:WU02:FS00:0x17:Completed 100000 out of 2000000 steps (5%)
03:25:38:WU00:FS01:0x17:Completed 920000 out of 2000000 steps (46%)
03:26:14:WU02:FS00:0x17:Completed 120000 out of 2000000 steps (6%)
03:27:47:WU00:FS01:0x17:Completed 940000 out of 2000000 steps (47%)
03:27:53:WU02:FS00:0x17:Completed 140000 out of 2000000 steps (7%)
03:29:39:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:30:06:WU00:FS01:0x17:Completed 960000 out of 2000000 steps (48%)
03:31:18:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:32:15:WU00:FS01:0x17:Completed 980000 out of 2000000 steps (49%)
03:32:57:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:33:11:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:34:01:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:34:25:WU00:FS01:0x17:Completed 1000000 out of 2000000 steps (50%)
03:35:39:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:36:43:WU00:FS01:0x17:Completed 1020000 out of 2000000 steps (51%)
03:37:18:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:37:33:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:38:22:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:38:53:WU00:FS01:0x17:Completed 1040000 out of 2000000 steps (52%)
03:40:01:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:41:12:WU00:FS01:0x17:Completed 1060000 out of 2000000 steps (53%)
03:41:40:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:41:54:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:41:54:WU02:FS00:0x17:Max number of retries reached. Aborting.
03:41:54:WU02:FS00:0x17:ERROR:exception: Max Retries Reached
03:41:54:WU02:FS00:0x17:Saving result file logfile_01.txt
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_18467
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_41
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_6334
03:41:55:WU02:FS00:0x17:Saving result file badStateForceGroup0_18467Core.xml
03:41:55:WU02:FS00:0x17:Saving result file badStateForceGroup0_18467Ref.xml
03:41:56:WU02:FS00:0x17:Saving result file badStateForceGroup0_41Core.xml
03:41:57:WU02:FS00:0x17:Saving result file badStateForceGroup0_41Ref.xml
03:41:58:WU02:FS00:0x17:Saving result file badStateForceGroup0_6334Core.xml
03:41:59:WU02:FS00:0x17:Saving result file badStateForceGroup0_6334Ref.xml
03:41:59:WU02:FS00:0x17:Saving result file badStateForceGroup1_18467Core.xml
03:42:00:WU02:FS00:0x17:Saving result file badStateForceGroup1_18467Ref.xml
03:42:01:WU02:FS00:0x17:Saving result file badStateForceGroup1_41Core.xml
03:42:01:WU02:FS00:0x17:Saving result file badStateForceGroup1_41Ref.xml
03:42:02:WU02:FS00:0x17:Saving result file badStateForceGroup1_6334Core.xml
03:42:03:WU02:FS00:0x17:Saving result file badStateForceGroup1_6334Ref.xml
03:42:03:WU02:FS00:0x17:Saving result file badStateForceGroup2_18467Core.xml
03:42:04:WU02:FS00:0x17:Saving result file badStateForceGroup2_18467Ref.xml
03:42:04:WU02:FS00:0x17:Saving result file badStateForceGroup2_41Core.xml
03:42:05:WU02:FS00:0x17:Saving result file badStateForceGroup2_41Ref.xml
03:42:05:WU02:FS00:0x17:Saving result file badStateForceGroup2_6334Core.xml
03:42:06:WU02:FS00:0x17:Saving result file badStateForceGroup2_6334Ref.xml
03:42:07:WU02:FS00:0x17:Saving result file log.txt
03:42:07:WU02:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
03:42:07:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
03:42:07:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:7811 run:0 clone:524 gen:491 core:0x17 unit:0x000002180a3b1e8651db4ad9849b66d4
03:42:07:WU02:FS00:Uploading 28.24MiB to 171.64.65.98
03:42:07:WU02:FS00:Connecting to 171.64.65.98:8080
03:42:07:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
03:42:08:WU01:FS00:News: Welcome to Folding@Home
03:42:08:WU01:FS00:Assigned to work server 171.64.65.98
03:42:08:WU01:FS00:Requesting new work unit for slot 00: READY gpu:1:Tahiti [Radeon HD 7900 Series] from 171.64.65.98
03:42:08:WU01:FS00:Connecting to 171.64.65.98:8080
03:42:09:WU01:FS00:Downloading 2.09MiB
03:42:13:WU01:FS00:Download complete
03:42:13:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:153 gen:437 core:0x17 unit:0x000001d00a3b1e8651d3475d01d47255
and hell I don't know :)

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Tue Nov 19, 2013 11:07 am
by bollix47
It's not unusual for a core 17 work unit to fail if the GPU is overclocked. If that's the case try returning the memory clock back to stock to see if that helps. The memory clock speed has very little effect on folding performance and reducing it can result in fewer failures.

Currently your latest work unit has failed for a number of folders but that's not necessarily an indication of a bad work unit for core 17. I have seen this happen numerous times, especially with Run 0, but eventually someone does complete the work successfully.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Tue Nov 19, 2013 2:36 pm
by Hans
Thank you all.

i think its safe to say that the 7810 WU is stil available and therefor not faulty.
My new compu is doing mainly 7810 at the time and some 7811.
I'll monitor the log file more closely for e while cause i think there where no problems with
the 9800 WU's on my side the past months.
Clocking i did not do on purpose ( folding is the only game the pc plays and i know overclocking to be of no use)but i am a little in doubt about the catalys control center ??
Should it continue i'll try using one gpu to see.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Posted: Wed Nov 20, 2013 1:51 am
by PantherX
Hans wrote:...Clocking i did not do on purpose ( folding is the only game the pc plays and i know overclocking to be of no use)but i am a little in doubt about the catalys control center ??...
Generally speaking, overclocking your GPU to a folding stable setting would increase your PPD. However, the increase would vary on the amount of OC and the Project. For F@H, overclocking the Shaders is more useful than overclocking the memory. Please note that overclocking would require you to monitor your GPU for instabilities for future projects since they may be more computationally intensive and thus, could push your once stable OC into the unstable zone. For a (mostly) hassle free option, use the vendor stock settings.