Page 1 of 1

Project: 5745 (Run 2, Clone 13, Gen 153)

Posted: Sat Jul 03, 2010 9:44 pm
by Tynat
There were four prior UNSTABLE_MACHINE failures with this WU previous to this one and one additional after restarting the GPU client.

Code: Select all

[11:13:04] + Received work.
[11:13:04] Trying to send all finished work units
[11:13:04] + No unsent completed units remaining.
[11:13:04] + Closed connections
[11:13:09] 
[11:13:09] + Processing work unit
[11:13:09] Core required: FahCore_11.exe
[11:13:09] Core found.
[11:13:09] Working on queue slot 07 [July 3 11:13:09 UTC]
[11:13:09] + Working ...
[11:13:09] - Calling '.\FahCore_11.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 1216 -version 623'

[11:13:09] 
[11:13:09] *------------------------------*
[11:13:09] Folding@Home GPU Core - Beta
[11:13:09] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[11:13:09] 
[11:13:09] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[11:13:09] Build host: amoeba
[11:13:09] Board Type: AMD
[11:13:09] Core      : 
[11:13:09] Preparing to commence simulation
[11:13:09] - Looking at optimizations...
[11:13:09] - Created dyn
[11:13:09] - Files status OK
[11:13:09] - Expanded 59632 -> 357580 (decompressed 599.6 percent)
[11:13:09] Called DecompressByteArray: compressed_data_size=59632 data_size=357580, decompressed_data_size=357580 diff=0
[11:13:09] - Digital signature verified
[11:13:09] 
[11:13:09] Project: 5745 (Run 2, Clone 13, Gen 153)
[11:13:09] 
[11:13:10] Assembly optimizations on if available.
[11:13:10] Entering M.D.
[11:13:16] Tpr hash work/wudata_07.tpr:  3280278006 1681550134 2677261408 2459636388 1016502238
[11:13:16] Working on Protein
[11:13:17] Client config found, loading data.
[11:13:17] Starting GUI Server
[11:13:26] mdrun_gpu returned 
[11:13:26] Nonzero force sum on GPU
[11:13:26] 
[11:13:26] Folding@home Core Shutdown: UNSTABLE_MACHINE
[11:13:29] CoreStatus = 7A (122)
[11:13:29] Sending work to server
[11:13:29] Project: 5745 (Run 2, Clone 13, Gen 153)
[11:13:29] - Read packet limit of 540015616... Set to 524286976.
[11:13:29] - Error: Could not get length of results file work/wuresults_07.dat
[11:13:29] - Error: Could not read unit 07 file. Removing from queue.
[11:13:29] EUE limit exceeded. Pausing 24 hours.

Re: Project: 5745 (Run 2, Clone 13, Gen 153)

Posted: Sat Jul 03, 2010 9:50 pm
by bruce
Project: 5745 (Run 2, Clone 13, Gen 153) has been reported as a bad WU.

Re: Project: 5745 (Run 2, Clone 13, Gen 153)

Posted: Sat Jul 03, 2010 10:32 pm
by Tynat
Thanks Bruce.