Page 1 of 1

Project: 7810 (Run 0, Clone 126, Gen 55) ERROR:Guru Meditati

Posted: Thu Aug 08, 2013 11:16 am
by rhavern
This WU failed immediately when restarting after a power failure, so it may be related to that.

Code: Select all

*********************** Log Started 2013-08-08T02:13:52Z ***********************
02:13:52:************************* Folding@home Client *************************
02:13:52:      Website: http://folding.stanford.edu/
02:13:52:    Copyright: (c) 2009-2013 Stanford University
02:13:52:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:13:52:         Args: 
02:13:52:       Config: C:/Users/Rick/AppData/Roaming/FAHClient/config.xml
02:13:52:******************************** Build ********************************
02:13:52:      Version: 7.3.6
02:13:52:         Date: Feb 18 2013
02:13:52:         Time: 15:25:17
02:13:52:      SVN Rev: 3923
02:13:52:       Branch: fah/trunk/client
02:13:52:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
02:13:52:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
02:13:52:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
02:13:52:     Platform: win32 XP
02:13:52:         Bits: 32
02:13:52:         Mode: Release
02:13:52:******************************* System ********************************
02:13:52:          CPU: AMD Athlon(tm) 64 X2 Dual Core Processor 4000+
02:13:52:       CPU ID: AuthenticAMD Family 15 Model 107 Stepping 1
02:13:52:         CPUs: 2
02:13:52:       Memory: 2.00GiB
02:13:52:  Free Memory: 1.18GiB
02:13:52:      Threads: WINDOWS_THREADS
02:13:52:  Has Battery: false
02:13:52:   On Battery: false
02:13:52:   UTC offset: 1
02:13:52:          PID: 3672
02:13:52:          CWD: C:/Users/Rick/AppData/Roaming/FAHClient
02:13:52:           OS: Windows 7 Professional Service Pack 1
02:13:52:      OS Arch: X86
02:13:52:         GPUs: 2
02:13:52:        GPU 0: ATI:5 Tahiti XT [Radeon HD 7970]
02:13:52:        GPU 1: ATI:5 Tahiti XT [Radeon HD 7970]
02:13:52:         CUDA: Not detected
02:13:52:Win32 Service: false
02:13:52:***********************************************************************
02:13:52:<config>
02:13:52:  <!-- Folding Core -->
02:13:52:  <core-priority v='low'/>
02:13:52:
02:13:52:  <!-- Folding Slot Configuration -->
02:13:52:  <client-type v='beta'/>
02:13:52:  <power v='full'/>
02:13:52:  <smp v='false'/>
02:13:52:
02:13:52:  <!-- HTTP Server -->
02:13:52:  <allow v='127.0.0.1, 192.168.1.0/24'/>
02:13:52:
02:13:52:  <!-- Network -->
02:13:52:  <proxy v=':8080'/>
02:13:52:
02:13:52:  <!-- Remote Command Server -->
02:13:52:  <command-allow-no-pass v='127.0.0.1, 192.168.1.0/24'/>
02:13:52:
02:13:52:  <!-- User Information -->
02:13:52:  <passkey v='********************************'/>
02:13:52:  <team v='33'/>
02:13:52:  <user v='rhavern'/>
02:13:52:
02:13:52:  <!-- Work Unit Control -->
02:13:52:  <next-unit-percentage v='100'/>
02:13:52:
02:13:52:  <!-- Folding Slots -->
02:13:52:  <slot id='1' type='GPU'>
02:13:52:    <cuda-index v='0'/>
02:13:52:    <gpu-index v='0'/>
02:13:52:    <opencl-index v='0'/>
02:13:52:  </slot>
02:13:52:  <slot id='0' type='GPU'/>
02:13:52:</config>
02:13:52:Trying to access database...
02:13:55:Successfully acquired database lock
02:13:55:Enabled folding slot 01: READY gpu:0:Tahiti XT [Radeon HD 7970]
02:13:55:Enabled folding slot 00: READY gpu:1:Tahiti XT [Radeon HD 7970]
02:13:56:WU01:FS01:Starting
02:13:56:WU01:FS01:Running FahCore: \"C:\\Program Files\\FAHClient/FAHCoreWrapper.exe\" C:/Users/Rick/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 3672 -checkpoint 15 -gpu 0 -gpu-vendor ati
02:14:02:WU01:FS01:Started FahCore on PID 4076
02:14:05:WU01:FS01:Core PID:1688
02:14:05:WU01:FS01:FahCore 0x17 started
02:14:07:WU00:FS00:Starting
02:14:07:WU00:FS00:Running FahCore: \"C:\\Program Files\\FAHClient/FAHCoreWrapper.exe\" C:/Users/Rick/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 703 -lifeline 3672 -checkpoint 15 -gpu 1 -gpu-vendor ati
02:14:07:WU00:FS00:Started FahCore on PID 3708
02:14:07:WU00:FS00:Core PID:3816
02:14:07:WU00:FS00:FahCore 0x17 started
02:14:13:WU00:FS00:0x17:*********************** Log Started 2013-08-08T02:14:12Z ***********************
02:14:13:WU01:FS01:0x17:*********************** Log Started 2013-08-08T02:14:12Z ***********************
02:14:13:WU00:FS00:0x17:Project: 7810 (Run 0, Clone 126, Gen 55)
02:14:13:WU00:FS00:0x17:Unit: 0x0000003c0a3b1e8651d3470e1e3a4944
02:14:13:WU00:FS00:0x17:CPU: 0x00000000000000000000000000000000
02:14:13:WU00:FS00:0x17:Machine: 0
02:14:13:WU00:FS00:0x17:Digital signatures verified
02:14:13:WU01:FS01:0x17:Project: 8900 (Run 13, Clone 0, Gen 76)
02:14:13:WU01:FS01:0x17:Unit: 0x00000067028c1266519a61e55dac201c
02:14:13:WU01:FS01:0x17:CPU: 0x00000000000000000000000000000000
02:14:13:WU01:FS01:0x17:Machine: 1
02:14:13:WU01:FS01:0x17:Digital signatures verified
02:14:17:WU01:FS01:0x17:  Found a checkpoint file
02:14:17:WU00:FS00:0x17:  Found a checkpoint file
02:17:12:WU00:FS00:0x17:ERROR:Guru Meditation #0.3122d3c2d129 (6.6) '00/01/stepsDone'
02:17:12:WU00:FS00:0x17:WARNING:Unexpected exit() call
02:17:12:WU00:FS00:0x17:WARNING:Unexpected exit from science code
02:17:12:WU00:FS00:0x17:Saving result file logfile_01.txt
02:17:12:WU00:FS00:0x17:Saving result file checkpt.crc
02:17:12:WU00:FS00:0x17:Saving result file log.txt
02:17:12:WU00:FS00:0x17:WARNING:While cleaning up: Failed to remove directory '01': boost::filesystem::remove: The process cannot access the file because it is being used by another process: \"01\\stepsDone\"
02:17:12:WU00:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
02:17:13:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:17:13:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:7810 run:0 clone:126 gen:55 core:0x17 unit:0x0000003c0a3b1e8651d3470e1e3a4944
02:17:13:WU00:FS00:Uploading 4.68KiB to 171.64.65.98
02:17:13:WU00:FS00:Connecting to 171.64.65.98:8080
02:17:13:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
02:17:14:WU02:FS00:News: Welcome to Folding@Home
02:17:14:WU02:FS00:Assigned to work server 171.64.65.69
02:17:14:WU02:FS00:Requesting new work unit for slot 00: READY gpu:1:Tahiti XT [Radeon HD 7970] from 171.64.65.69
02:17:14:WU02:FS00:Connecting to 171.64.65.69:8080
02:17:15:WU02:FS00:Downloading 4.17MiB
02:17:18:WU00:FS00:Upload complete
02:17:18:WU00:FS00:Server responded WORK_ACK (400)
02:17:18:WU00:FS00:Cleaning up
02:17:21:WU02:FS00:Download 40.42%
02:17:24:WU02:FS00:Download complete
02:17:24:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8900 run:1 clone:2 gen:116 core:0x17 unit:0x0000008d028c1266519a5f4702dcd883
02:17:24:WU02:FS00:Starting
02:17:24:WU02:FS00:Running FahCore: \"C:\\Program Files\\FAHClient/FAHCoreWrapper.exe\" C:/Users/Rick/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 02 -suffix 01 -version 703 -lifeline 3672 -checkpoint 15 -gpu 1 -gpu-vendor ati
02:17:24:WU02:FS00:Started FahCore on PID 2596
02:17:24:WU02:FS00:Core PID:3204
02:17:24:WU02:FS00:FahCore 0x17 started
02:17:25:WU02:FS00:0x17:*********************** Log Started 2013-08-08T02:17:25Z ***********************
02:17:25:WU02:FS00:0x17:Project: 8900 (Run 1, Clone 2, Gen 116)
02:17:25:WU02:FS00:0x17:Unit: 0x0000008d028c1266519a5f4702dcd883
02:17:25:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
02:17:25:WU02:FS00:0x17:Machine: 0

Re: Project: 7810 (Run 0, Clone 126, Gen 55) ERROR:Guru Medi

Posted: Thu Aug 08, 2013 11:31 am
by bollix47
Another folder was able to complete this work unit successfully:

Hi *****(team *****),
Your WU (P7810 R0 C126 G55) was added to the stats database on 2013-08-07 23:01:46 for 16220.7 points of credit.

Re: Project: 7810 (Run 0, Clone 126, Gen 55) ERROR:Guru Medi

Posted: Thu Aug 08, 2013 12:10 pm
by rhavern
That being said the WU shouldn't be in a state where a power failure breaks things :wink:

Thanks for checking bollix47.

Re: Project: 7810 (Run 0, Clone 126, Gen 55) ERROR:Guru Medi

Posted: Fri Aug 09, 2013 1:07 am
by bruce
Sorry, but I'm sure there's a corollary to Murphy's Law that says that Power failures will occur at those times when the checkpoint that's being written is invalid. The file system is only valid if the operating system is able to complete a normal shutdown. The only guaranteed way to finish writing a valid checkpoint is to have a UPS which performs a systematic shutdown that syncs the cached sectors to the HD.