Project: 5755 (Run 10, Clone 195, Gen 12)
Posted: Wed Jan 14, 2009 2:20 am
Got a unit which EUEd non-stop until it stop. Since it cnanot upload the partial results, the servers is never notified :
Code: Select all
[00:55:46] Board Type: Nvidia
[00:55:46] Core :
[00:55:46] Preparing to commence simulation
[00:55:46] - Looking at optimizations...
[00:55:46] - Created dyn
[00:55:46] - Files status OK
[00:55:46] - Expanded 96525 -> 489240 (decompressed 506.8 percent)
[00:55:46] Called DecompressByteArray: compressed_data_size=96525 data_size=489240, decompressed_data_size=489240 diff=0
[00:55:46] - Digital signature verified
[00:55:46]
[00:55:46] Project: 5755 (Run 10, Clone 195, Gen 12)
[00:55:46]
[00:55:46] Assembly optimizations on if available.
[00:55:46] Entering M.D.
[00:55:52] Working on Protein
[00:55:56] Client config found, loading data.
[00:55:56] Starting GUI Server
[00:57:46] Completed 1%
[00:57:46] mdrun_gpu returned
[00:57:46] NANs detected on GPU
[00:57:46]
[00:57:46] Folding@home Core Shutdown: UNSTABLE_MACHINE
[00:57:50] CoreStatus = 7A (122)
[00:57:50] Sending work to server
[00:57:50] Project: 5755 (Run 10, Clone 195, Gen 12)
[00:57:50] - Read packet limit of 540015616... Set to 524286976.
[00:57:50] - Error: Could not get length of results file work/wuresults_07.dat
[00:57:50] - Error: Could not read unit 07 file. Removing from queue.
[00:57:50] Trying to send all finished work units
[00:57:50] + No unsent completed units remaining.
[00:57:50] - Preparing to get new work unit...
[00:57:50] + Attempting to get work packet
[00:57:50] - Will indicate memory of 2046 MB
[00:57:50] - Connecting to assignment server
[00:57:50] Connecting to http://assign-GPU.stanford.edu:8080/
[00:57:50] Posted data.
[00:57:50] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[00:57:50] + News From Folding@Home: GPU folding beta
[00:57:51] Loaded queue successfully.
[00:57:51] Connecting to http://171.67.108.11:8080/
[00:57:51] Posted data.
[00:57:51] Initial: 0000; - Receiving payload (expected size: 97037)
[00:57:52] - Downloaded at ~94 kB/s
[00:57:52] - Averaged speed for that direction ~89 kB/s
[00:57:52] + Received work.
[00:57:52] Trying to send all finished work units
[00:57:52] + No unsent completed units remaining.
[00:57:52] + Closed connections
[00:57:57]
[00:57:57] + Processing work unit
[00:57:57] Core required: FahCore_11.exe
[00:57:57] Core found.
[00:57:57] Working on queue slot 08 [January 14 00:57:57 UTC]
[00:57:57] + Working ...
[00:57:57] - Calling '.\FahCore_11.exe -dir work/ -suffix 08 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 828 -version 623'
[00:57:57]
[00:57:57] *------------------------------*
[00:57:57] Folding@Home GPU Core - Beta
[00:57:57] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[00:57:57]
[00:57:57] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[00:57:57] Build host: amoeba
[00:57:57] Board Type: Nvidia
[00:57:57] Core :
[00:57:57] Preparing to commence simulation
[00:57:57] - Looking at optimizations...
[00:57:57] - Created dyn
[00:57:57] - Files status OK
[00:57:57] - Expanded 96525 -> 489240 (decompressed 506.8 percent)
[00:57:57] Called DecompressByteArray: compressed_data_size=96525 data_size=489240, decompressed_data_size=489240 diff=0
[00:57:57] - Digital signature verified
[00:57:57]
[00:57:57] Project: 5755 (Run 10, Clone 195, Gen 12)
[00:57:57]
[00:57:57] Assembly optimizations on if available.
[00:57:57] Entering M.D.
[00:58:03] Working on Protein
[00:58:07] Client config found, loading data.
[00:58:07] Starting GUI Server
[00:59:57] Completed 1%
[00:59:57] mdrun_gpu returned
[00:59:57] NANs detected on GPU
[00:59:57]
[00:59:57] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:00:01] CoreStatus = 7A (122)
[01:00:01] Sending work to server
[01:00:01] Project: 5755 (Run 10, Clone 195, Gen 12)
[01:00:01] - Read packet limit of 540015616... Set to 524286976.
[01:00:01] - Error: Could not get length of results file work/wuresults_08.dat
[01:00:01] - Error: Could not read unit 08 file. Removing from queue.
[01:00:01] EUE limit exceeded. Pausing 24 hours.