Page 1 of 1

Project: 5747 (Run 3, Clone 23, Gen 197)

Posted: Sat Jul 03, 2010 7:40 am
by Tynat
There were four prior UNSTABLE_MACHINE failures with this WU previous to this one and one additional after restarting the GPU client.

Code: Select all

[05:17:36] + Received work.
[05:17:36] Trying to send all finished work units
[05:17:36] + No unsent completed units remaining.
[05:17:36] + Closed connections
[05:17:41]
[05:17:41] + Processing work unit
[05:17:41] Core required: FahCore_11.exe
[05:17:41] Core found.
[05:17:41] Working on queue slot 09 [July 3 05:17:41 UTC]
[05:17:41] + Working ...
[05:17:41] - Calling '.\FahCore_11.exe -dir work/ -suffix 09 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 5228 -version 623'

[05:17:41]
[05:17:41] *------------------------------*
[05:17:41] Folding@Home GPU Core - Beta
[05:17:41] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[05:17:41]
[05:17:41] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[05:17:41] Build host: amoeba
[05:17:41] Board Type: AMD
[05:17:41] Core      :
[05:17:41] Preparing to commence simulation
[05:17:41] - Looking at optimizations...
[05:17:41] - Created dyn
[05:17:41] - Files status OK
[05:17:41] - Expanded 43780 -> 357580 (decompressed 816.7 percent)
[05:17:41] Called DecompressByteArray: compressed_data_size=43780 data_size=357580, decompressed_data_size=357580 diff=0
[05:17:42] - Digital signature verified
[05:17:42]
[05:17:42] Project: 5747 (Run 3, Clone 23, Gen 197)
[05:17:42]
[05:17:42] Assembly optimizations on if available.
[05:17:42] Entering M.D.
[05:17:48] Tpr hash work/wudata_09.tpr:  186647762 839062624 162360122 3888834850 1870310599
[05:17:48] Working on Protein
[05:17:49] Client config found, loading data.
[05:17:49] Starting GUI Server
[05:17:55] mdrun_gpu returned
[05:17:55] NANs detected on GPU
[05:17:55]
[05:17:55] Folding@home Core Shutdown: UNSTABLE_MACHINE
[05:17:58] CoreStatus = 7A (122)
[05:17:58] Sending work to server
[05:17:58] Project: 5747 (Run 3, Clone 23, Gen 197)
[05:17:58] - Read packet limit of 540015616... Set to 524286976.
[05:17:58] - Error: Could not get length of results file work/wuresults_09.dat
[05:17:58] - Error: Could not read unit 09 file. Removing from queue.
[05:17:58] EUE limit exceeded. Pausing 24 hours.

Re: Project: 5747 (Run 3, Clone 23, Gen 197)

Posted: Sat Jul 03, 2010 11:45 am
by toTOW
Reported as a bad WU.

Re: Project: 5747 (Run 3, Clone 23, Gen 197)

Posted: Sat Jul 03, 2010 8:49 pm
by Tynat
Thanks toTOW.