Page 1 of 1

Did I lose a work unit?

Posted: Tue Nov 06, 2012 9:44 am
by RMouse
I am not sure what happened here. I paused my computer with one of the cores at 77% and when I restarted it, it was back to 0%. Did I lose a work unit? I cant make sense of this log.

Code: Select all

00:50:27:WU00:FS00:0x11:Completed 51%
01:20:49:WU00:FS00:0x11:Completed 52%
01:34:00:WU00:FS00:0x11:Completed 53%
01:46:36:WU00:FS00:0x11:Completed 54%
01:49:31:WU02:FS01:0xa4:Completed 1110000 out of 1500000 steps  (74%)
02:04:36:WU00:FS00:0x11:Completed 55%
02:17:23:WU00:FS00:0x11:Completed 56%
02:29:35:WU00:FS00:0x11:Completed 57%
02:42:21:WU00:FS00:0x11:Completed 58%
02:55:20:WU00:FS00:0x11:Completed 59%
03:07:58:WU00:FS00:0x11:Completed 60%
03:20:34:WU00:FS00:0x11:Completed 61%
03:27:50:FS00:Paused
03:27:50:FS01:Paused
03:27:50:FS00:Shutting core down
03:27:50:FS01:Shutting core down
03:27:59:WU02:FS01:0xa4:Client no longer detected. Shutting down core 
03:27:59:WU02:FS01:0xa4:
03:27:59:WU02:FS01:0xa4:Folding@home Core Shutdown: CLIENT_DIED
03:27:59:WU00:FS00:0x11:Client no longer detected. Shutting down core 
03:27:59:WU00:FS00:0x11:
03:27:59:WU00:FS00:0x11:Folding@home Core Shutdown: CLIENT_DIED
03:27:59:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
03:27:59:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
04:14:11:FS00:Unpaused
04:14:11:FS01:Unpaused
04:14:11:WU02:FS01:Starting
04:14:11:WU02:FS01:Running FahCore: E:\FAHClient/FAHCoreWrapper.exe "C:/Documents and Settings/name/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a4.fah/FahCore_a4.exe" -dir 02 -suffix 01 -version 701 -lifeline 1712 -checkpoint 15 -np 2
04:14:12:WU02:FS01:Started FahCore on PID 2488
04:14:16:WU02:FS01:Core PID:2744
04:14:16:WU02:FS01:FahCore 0xa4 started
04:14:16:WU00:FS00:Starting
04:14:16:WU00:FS00:Running FahCore: E:\FAHClient/FAHCoreWrapper.exe "C:/Documents and Settings/name/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/G80/Core_11.fah/FahCore_11.exe" -dir 00 -suffix 01 -version 701 -lifeline 1712 -checkpoint 15 -gpu 0
04:14:16:WU00:FS00:Started FahCore on PID 5768
04:14:16:WU00:FS00:Core PID:2592
04:14:16:WU00:FS00:FahCore 0x11 started
04:14:16:WU02:FS01:0xa4:
04:14:16:WU02:FS01:0xa4:*------------------------------*
04:14:16:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
04:14:16:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
04:14:16:WU02:FS01:0xa4:
04:14:16:WU02:FS01:0xa4:Preparing to commence simulation
04:14:16:WU02:FS01:0xa4:- Looking at optimizations...
04:14:16:WU02:FS01:0xa4:- Files status OK
04:14:17:WU02:FS01:0xa4:- Expanded 2079550 -> 5386224 (decompressed 259.0 percent)
04:14:17:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=2079550 data_size=5386224, decompressed_data_size=5386224 diff=0
04:14:17:WU02:FS01:0xa4:- Digital signature verified
04:14:17:WU02:FS01:0xa4:
04:14:17:WU02:FS01:0xa4:Project: 7809 (Run 9, Clone 117, Gen 103)
04:14:17:WU02:FS01:0xa4:
04:14:17:WU02:FS01:0xa4:Assembly optimizations on if available.
04:14:17:WU02:FS01:0xa4:Entering M.D.
04:14:17:WU00:FS00:0x11:
04:14:17:WU00:FS00:0x11:*------------------------------*
04:14:17:WU00:FS00:0x11:Folding@Home GPU Core
04:14:17:WU00:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
04:14:17:WU00:FS00:0x11:
04:14:17:WU00:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
04:14:17:WU00:FS00:0x11:Build host: amoeba
04:14:17:WU00:FS00:0x11:Board Type: Nvidia
04:14:17:WU00:FS00:0x11:Core      : 
04:14:17:WU00:FS00:0x11:Preparing to commence simulation
04:14:17:WU00:FS00:0x11:- Looking at optimizations...
04:14:17:WU00:FS00:0x11:- Files status OK
04:14:17:WU00:FS00:0x11:- Expanded 45429 -> 251112 (decompressed 552.7 percent)
04:14:17:WU00:FS00:0x11:Called DecompressByteArray: compressed_data_size=45429 data_size=251112, decompressed_data_size=251112 diff=0
04:14:17:WU00:FS00:0x11:- Digital signature verified
04:14:17:WU00:FS00:0x11:
04:14:17:WU00:FS00:0x11:Project: 5771 (Run 1, Clone 194, Gen 2585)
04:14:17:WU00:FS00:0x11:
04:14:17:WU00:FS00:0x11:Assembly optimizations on if available.
04:14:17:WU00:FS00:0x11:Entering M.D.
04:14:23:WU00:FS00:0x11:Will resume from checkpoint file
04:14:23:WU00:FS00:0x11:Tpr hash 00/wudata_01.tpr:  179053866 2528564702 746454696 3710068585 1170749563
04:14:23:WU00:FS00:0x11:
04:14:23:WU00:FS00:0x11:Calling fah_main args: 14 usage=100
04:14:23:WU00:FS00:0x11:
04:14:23:WU02:FS01:0xa4:Using Gromacs checkpoints
04:14:23:WU02:FS01:0xa4:Mapping NT from 2 to 2 
04:14:23:WU00:FS00:0x11:Working on Protein
04:14:24:WU02:FS01:0xa4:Resuming from checkpoint
04:14:24:WU02:FS01:0xa4:Verified 02/wudata_01.log
04:14:25:WU02:FS01:0xa4:Verified 02/wudata_01.trr
04:14:25:WU02:FS01:0xa4:Verified 02/wudata_01.xtc
04:14:25:WU02:FS01:0xa4:Verified 02/wudata_01.edr
04:14:25:WU00:FS00:0x11:Client config unavailable.
04:14:25:WU00:FS00:0x11:Starting GUI Server
04:14:25:WU02:FS01:0xa4:Completed 1115470 out of 1500000 steps  (74%)
04:14:25:WU00:FS00:0x11:Resuming from checkpoint
04:14:25:WU00:FS00:0x11:fcCheckPointResume: retreived and current tpr file hash:
04:14:25:WU00:FS00:0x11:   0    179053866    179053866
04:14:25:WU00:FS00:0x11:   1   2528564702   2528564702
04:14:25:WU00:FS00:0x11:   2    746454696    746454696
04:14:25:WU00:FS00:0x11:   3   3710068585   3710068585
04:14:25:WU00:FS00:0x11:   4   1170749563   1170749563
04:14:25:WU00:FS00:0x11:fcCheckPointResume: file hashes same.
04:14:25:WU00:FS00:0x11:fcCheckPointResume: state restored.
04:14:25:WU00:FS00:0x11:Verified 00/wudata_01.log
04:14:25:WU00:FS00:0x11:Verified 00/wudata_01.edr
04:14:25:WU00:FS00:0x11:Verified 00/wudata_01.xtc
04:14:25:WU00:FS00:0x11:Completed 61%
04:29:50:WU00:FS00:0x11:Completed 62%
04:43:58:WU00:FS00:0x11:Completed 63%
04:57:14:WU00:FS00:0x11:Completed 64%
05:10:53:WU00:FS00:0x11:Completed 65%
05:32:05:WU00:FS00:0x11:Completed 66%
05:46:55:WU00:FS00:0x11:Completed 67%
06:05:14:WU00:FS00:0x11:Completed 68%
06:21:05:WU00:FS00:0x11:Completed 69%
******************************** Date: 06/11/12 ********************************
06:35:31:WU00:FS00:0x11:Completed 70%
06:48:29:WU00:FS00:0x11:Completed 71%
07:00:53:WU00:FS00:0x11:Completed 72%
07:14:51:WU00:FS00:0x11:Completed 73%
07:27:34:WU00:FS00:0x11:Completed 74%
07:58:13:WU00:FS00:0x11:Completed 75%
08:10:20:WU00:FS00:0x11:Completed 76%
08:29:56:WU00:FS00:0x11:Completed 77%
08:37:02:WU02:FS01:0xa4:Completed 1125000 out of 1500000 steps  (75%)
08:42:13:FS00:Paused
08:42:13:FS01:Paused
08:42:13:FS00:Shutting core down
08:42:13:FS01:Shutting core down
08:42:17:WU02:FS01:0xa4:Client no longer detected. Shutting down core 
08:42:17:WU02:FS01:0xa4:
08:42:17:WU02:FS01:0xa4:Folding@home Core Shutdown: CLIENT_DIED
08:42:17:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
08:42:21:WU00:FS00:0x11:Client no longer detected. Shutting down core 
08:42:21:WU00:FS00:0x11:
08:42:21:WU00:FS00:0x11:Folding@home Core Shutdown: CLIENT_DIED
08:42:24:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
09:38:14:Server connection id=10 ended
09:38:16:Server connection id=11 on 0.0.0.0:36330 from 127.0.0.1
09:38:32:FS00:Unpaused
09:38:32:FS01:Unpaused
09:38:32:WU02:FS01:Starting
09:38:32:WU02:FS01:Running FahCore: E:\FAHClient/FAHCoreWrapper.exe "C:/Documents and Settings/name/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a4.fah/FahCore_a4.exe" -dir 02 -suffix 01 -version 701 -lifeline 1712 -checkpoint 15 -np 2
09:38:32:WU02:FS01:Started FahCore on PID 3428
09:38:38:WU02:FS01:Core PID:1120
09:38:38:WU02:FS01:FahCore 0xa4 started
09:38:39:WU00:FS00:Starting
09:38:39:WU00:FS00:Running FahCore: E:\FAHClient/FAHCoreWrapper.exe "C:/Documents and Settings/name/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/G80/Core_11.fah/FahCore_11.exe" -dir 00 -suffix 01 -version 701 -lifeline 1712 -checkpoint 15 -gpu 0
09:38:39:WU00:FS00:Started FahCore on PID 4180
09:38:39:WU00:FS00:Core PID:4208
09:38:39:WU00:FS00:FahCore 0x11 started
09:38:39:WU02:FS01:0xa4:
09:38:39:WU02:FS01:0xa4:*------------------------------*
09:38:39:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
09:38:39:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
09:38:40:WU02:FS01:0xa4:
09:38:40:WU02:FS01:0xa4:Preparing to commence simulation
09:38:40:WU02:FS01:0xa4:- Looking at optimizations...
09:38:40:WU02:FS01:0xa4:- Files status OK
09:38:40:WU00:FS00:0x11:
09:38:40:WU00:FS00:0x11:*------------------------------*
09:38:40:WU00:FS00:0x11:Folding@Home GPU Core
09:38:40:WU00:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
09:38:40:WU00:FS00:0x11:
09:38:40:WU00:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
09:38:40:WU00:FS00:0x11:Build host: amoeba
09:38:40:WU00:FS00:0x11:Board Type: Nvidia
09:38:41:WU00:FS00:0x11:Core      : 
09:38:41:WU00:FS00:0x11:Preparing to commence simulation
09:38:41:WU00:FS00:0x11:- Looking at optimizations...
09:38:41:WU00:FS00:0x11:- Files status OK
09:38:41:WU00:FS00:0x11:- Expanded 45429 -> 251112 (decompressed 552.7 percent)
09:38:41:WU00:FS00:0x11:Called DecompressByteArray: compressed_data_size=45429 data_size=251112, decompressed_data_size=251112 diff=0
09:38:41:WU00:FS00:0x11:- Digital signature verified
09:38:41:WU00:FS00:0x11:
09:38:41:WU00:FS00:0x11:Project: 5771 (Run 1, Clone 194, Gen 2585)
09:38:41:WU00:FS00:0x11:
09:38:41:WU00:FS00:0x11:Assembly optimizations on if available.
09:38:41:WU00:FS00:0x11:Entering M.D.
09:38:41:WU02:FS01:0xa4:- Expanded 2079550 -> 5386224 (decompressed 259.0 percent)
09:38:41:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=2079550 data_size=5386224, decompressed_data_size=5386224 diff=0
09:38:41:WU02:FS01:0xa4:- Digital signature verified
09:38:41:WU02:FS01:0xa4:
09:38:42:WU02:FS01:0xa4:Project: 7809 (Run 9, Clone 117, Gen 103)
09:38:42:WU02:FS01:0xa4:
09:38:42:WU02:FS01:0xa4:Assembly optimizations on if available.
09:38:42:WU02:FS01:0xa4:Entering M.D.
09:38:46:WU00:FS00:0x11:Will resume from checkpoint file
09:38:46:WU00:FS00:0x11:Tpr hash 00/wudata_01.tpr:  179053866 2528564702 746454696 3710068585 1170749563
09:38:46:WU00:FS00:0x11:
09:38:46:WU00:FS00:0x11:Calling fah_main args: 14 usage=100
09:38:46:WU00:FS00:0x11:
09:38:47:WU00:FS00:0x11:Working on Protein
09:38:47:WU02:FS01:0xa4:Using Gromacs checkpoints
09:38:48:WU02:FS01:0xa4:Mapping NT from 2 to 2 
09:38:49:WU00:FS00:0x11:Client config unavailable.
09:38:49:WU02:FS01:0xa4:Resuming from checkpoint
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.log
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.trr
09:38:49:WU00:FS00:0x11:Resuming from checkpoint
09:38:49:WU00:FS00:0x11:fcCheckPointResume: retreived and current tpr file hash:
09:38:49:WU00:FS00:0x11:   0    179053866    179053866
09:38:49:WU00:FS00:0x11:   1   2528564702   2528564702
09:38:49:WU00:FS00:0x11:   2    746454696    746454696
09:38:49:WU00:FS00:0x11:   3   3710068585   3710068585
09:38:49:WU00:FS00:0x11:   4   1170749563   1170749563
09:38:49:WU00:FS00:0x11:fcCheckPointResume: file hashes same.
09:38:49:WU00:FS00:0x11:fcCheckPointResume: state restored.
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.log
09:38:49:WU00:FS00:0x11:Starting GUI Server
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.edr
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.xtc
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.xtc
09:38:49:WU00:FS00:0x11:Completed 77%
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.edr
09:38:50:WU02:FS01:0xa4:Completed 1124790 out of 1500000 steps  (74%)
09:39:57:WU02:FS01:0xa4:Completed 1125000 out of 1500000 steps  (75%)

Re: Did I lose a work unit?

Posted: Tue Nov 06, 2012 9:50 am
by rpmouton

Code: Select all

09:38:49:WU00:FS00:0x11:Client config unavailable.
09:38:49:WU02:FS01:0xa4:Resuming from checkpoint
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.log
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.trr
09:38:49:WU00:FS00:0x11:Resuming from checkpoint
09:38:49:WU00:FS00:0x11:fcCheckPointResume: retreived and current tpr file hash:
09:38:49:WU00:FS00:0x11:   0    179053866    179053866
09:38:49:WU00:FS00:0x11:   1   2528564702   2528564702
09:38:49:WU00:FS00:0x11:   2    746454696    746454696
09:38:49:WU00:FS00:0x11:   3   3710068585   3710068585
09:38:49:WU00:FS00:0x11:   4   1170749563   1170749563
09:38:49:WU00:FS00:0x11:fcCheckPointResume: file hashes same.
09:38:49:WU00:FS00:0x11:fcCheckPointResume: state restored.
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.log
09:38:49:WU00:FS00:0x11:Starting GUI Server
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.edr
09:38:49:WU00:FS00:0x11:Verified 00/wudata_01.xtc
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.xtc
09:38:49:WU00:FS00:0x11:Completed 77%
09:38:49:WU02:FS01:0xa4:Verified 02/wudata_01.edr
09:38:50:WU02:FS01:0xa4:Completed 1124790 out of 1500000 steps  (74%)
09:39:57:WU02:FS01:0xa4:Completed 1125000 out of 1500000 steps  (75%)
See above, check point found and WU resumed. What is reporting 0 percent? HFM? it does that for me as well on resume until it completes a couple of percentage points

Re: Did I lose a work unit?

Posted: Tue Nov 06, 2012 10:16 am
by RMouse
Ok, just had another look. Everything looks normal now. The client sometimes seems wacky after a restart with regards to estimated time left, but not usually with the % completed. Now everything looks fine.

Sorry to take up your time. Thanks!

Re: Did I lose a work unit?

Posted: Tue Nov 06, 2012 12:12 pm
by rpmouton

Sorry to take up your time. Thanks!
Was up anyway, cheers