Page 1 of 1

Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Mon Oct 05, 2009 1:08 am
by JJ111
I just started getting this recently. Earlier today I would just get 1, then it would find another project to work on. Now it keeps giving me this :cry: .

Well it finally found another project, it's "Project: 5771 (Run 5, Clone 8, Gen 1331)" and it's working.

Btw im using 190.62 and my core temp is NEVER above 50 deg celcius because i have an accelero S1 on it.

What could be causing it? It can happen at random times too, either when im not using my computer or if im on the internet.

Code: Select all

[23:08:57] *------------------------------*
[23:08:57] Folding@Home GPU Core - Beta
[23:08:57] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[23:08:57] 
[23:08:57] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:08:57] Build host: amoeba
[23:08:57] Board Type: Nvidia
[23:08:57] Core      : 
[23:08:57] Preparing to commence simulation
[23:08:57] - Looking at optimizations...
[23:08:57] - Created dyn
[23:08:57] - Files status OK
[23:08:57] - Expanded 45408 -> 251112 (decompressed 553.0 percent)
[23:08:57] Called DecompressByteArray: compressed_data_size=45408 data_size=251112, decompressed_data_size=251112 diff=0
[23:08:57] - Digital signature verified
[23:08:57] 
[23:08:57] Project: 5772 (Run 12, Clone 144, Gen 256)
[23:08:57] 
[23:08:57] Assembly optimizations on if available.
[23:08:57] Entering M.D.
[23:09:04] Working on Protein
[23:09:05] Client config found, loading data.
[23:09:05] mdrun_gpu returned 
[23:09:05] SHAKE violations on GPU
[23:09:05] 
[23:09:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:09:08] CoreStatus = 7A (122)
[23:09:08] Sending work to server
[23:09:08] Project: 5772 (Run 12, Clone 144, Gen 256)
[23:09:08] - Read packet limit of 540015616... Set to 524286976.
[23:09:08] - Error: Could not get length of results file work/wuresults_02.dat
[23:09:08] - Error: Could not read unit 02 file. Removing from queue.
[23:09:08] - Preparing to get new work unit...
[23:09:08] + Attempting to get work packet
[23:09:08] - Connecting to assignment server
[23:09:08] - Successful: assigned to (171.67.108.11).
[23:09:08] + News From Folding@Home: Welcome to Folding@Home
[23:09:08] Loaded queue successfully.
[23:09:09] + Closed connections
[23:09:14] 
[23:09:14] + Processing work unit
[23:09:14] Core required: FahCore_11.exe
[23:09:14] Core found.
[23:09:14] Working on queue slot 03 [October 4 23:09:14 UTC]
[23:09:14] + Working ...
[23:09:14] 
[23:09:14] *------------------------------*
[23:09:14] Folding@Home GPU Core - Beta
[23:09:14] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[23:09:14] 
[23:09:14] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:09:14] Build host: amoeba
[23:09:14] Board Type: Nvidia
[23:09:14] Core      : 
[23:09:14] Preparing to commence simulation
[23:09:14] - Looking at optimizations...
[23:09:14] - Created dyn
[23:09:14] - Files status OK
[23:09:14] - Expanded 45408 -> 251112 (decompressed 553.0 percent)
[23:09:14] Called DecompressByteArray: compressed_data_size=45408 data_size=251112, decompressed_data_size=251112 diff=0
[23:09:14] - Digital signature verified
[23:09:14] 
[23:09:14] Project: 5772 (Run 12, Clone 144, Gen 256)
[23:09:14] 
[23:09:15] Assembly optimizations on if available.
[23:09:15] Entering M.D.
[23:09:21] Working on Protein
[23:09:23] Client config found, loading data.
[23:09:23] Starting GUI Server
[23:09:23] mdrun_gpu returned 
[23:09:23] SHAKE violations on GPU
[23:09:23] 
[23:09:23] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:09:27] CoreStatus = 7A (122)
[23:09:27] Sending work to server
[23:09:27] Project: 5772 (Run 12, Clone 144, Gen 256)
[23:09:27] - Read packet limit of 540015616... Set to 524286976.
[23:09:27] - Error: Could not get length of results file work/wuresults_03.dat
[23:09:27] - Error: Could not read unit 03 file. Removing from queue.
[23:09:27] - Preparing to get new work unit...
[23:09:27] + Attempting to get work packet
[23:09:27] - Connecting to assignment server
[23:09:27] - Successful: assigned to (171.67.108.11).
[23:09:27] + News From Folding@Home: Welcome to Folding@Home
[23:09:28] Loaded queue successfully.
[23:10:17] + Closed connections
[23:10:22] 
[23:10:22] + Processing work unit
[23:10:22] Core required: FahCore_11.exe
[23:10:22] Core found.
[23:10:22] Working on queue slot 04 [October 4 23:10:22 UTC]
[23:10:22] + Working ...
Damn it another one...

Code: Select all

[03:29:59] + Processing work unit
[03:29:59] Core required: FahCore_11.exe
[03:29:59] Core found.
[03:29:59] Working on queue slot 07 [October 5 03:29:59 UTC]
[03:29:59] + Working ...
[03:29:59] 
[03:29:59] *------------------------------*
[03:29:59] Folding@Home GPU Core - Beta
[03:29:59] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[03:29:59] 
[03:29:59] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[03:29:59] Build host: amoeba
[03:29:59] Board Type: Nvidia
[03:29:59] Core      : 
[03:29:59] Preparing to commence simulation
[03:29:59] - Looking at optimizations...
[03:29:59] - Files status OK
[03:29:59] - Expanded 45402 -> 251112 (decompressed 553.0 percent)
[03:29:59] Called DecompressByteArray: compressed_data_size=45402 data_size=251112, decompressed_data_size=251112 diff=0
[03:29:59] - Digital signature verified
[03:29:59] 
[03:29:59] Project: 5771 (Run 5, Clone 8, Gen 1331)
[03:29:59] 
[03:29:59] Assembly optimizations on if available.
[03:29:59] Entering M.D.
[03:30:05] Will resume from checkpoint file
[03:30:07] Working on Protein
[03:30:07] Client config found, loading data.
[03:30:07] Resuming from checkpoint
[03:30:07] Verified work/wudata_07.log
[03:30:07] Verified work/wudata_07.edr
[03:30:07] Verified work/wudata_07.xtc
[03:30:07] Completed 28%
[03:30:07] Starting GUI Server
[03:31:11] Completed 29%
[03:32:14] Completed 30%
[03:33:17] Completed 31%
[03:34:21] Completed 32%
[03:35:24] Completed 33%
[03:36:27] Completed 34%
[03:37:32] Completed 35%
[03:38:33] Completed 36%
[03:39:34] Completed 37%
[03:40:41] Completed 38%
[03:41:41] Completed 39%
[03:42:41] Completed 40%
[03:43:43] Completed 41%
[03:43:51] mdrun_gpu returned 
[03:43:51] Going to send back what have done -- stepsTotalG=15000000
[03:43:51] Work fraction=0.4100 steps=15000000.
[03:43:55] logfile size=15994 infoLength=15994 edr=0 trr=25
[03:43:55] - Writing 16532 bytes of core data to disk...
[03:43:55] Done: 16020 -> 4723 (compressed to 29.4 percent)
[03:43:55]   ... Done.
[03:43:55] 
[03:43:55] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:43:58] CoreStatus = 7A (122)
[03:43:58] Sending work to server
[03:43:58] Project: 5771 (Run 5, Clone 8, Gen 1331)
[03:43:58] - Read packet limit of 540015616... Set to 524286976.

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Fri Oct 16, 2009 6:06 pm
by ei57
Project: 5772 (Run 12, Clone 144, Gen 256) caused immediate EUE - log identical to the one above (not the damned one!). Got it twice on this PC, before I cleaned the work directory and it might have put another GPU to sleep. I'll check the log later. This happened yesterday. If this is a bad WU, it has been around for at least 10 days.

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Tue Oct 20, 2009 10:24 am
by selfwilly
And again...

[00:38:33] Project: 5772 (Run 12, Clone 144, Gen 256)
[00:38:33]
[00:38:33] Assembly optimizations on if available.
[00:38:33] Entering M.D.
[00:38:40] Working on Protein
[00:38:41] Client config found, loading data.
[00:38:41] Starting GUI Server
[00:38:41] mdrun_gpu returned
[00:38:41] SHAKE violations on GPU
[00:38:41]
[00:38:41] Folding@home Core Shutdown: UNSTABLE_MACHINE
[00:38:44] CoreStatus = 7A (122)
[00:38:44] Sending work to server
[00:38:44] Project: 5772 (Run 12, Clone 144, Gen 256)
[00:38:44] - Error: Could not get length of results file work/wuresults_09.dat
[00:38:44] - Error: Could not read unit 09 file. Removing from queue.
[00:38:44] - Preparing to get new work unit...
[00:38:44] + Attempting to get work packet
[00:38:44] - Connecting to assignment server
[00:38:44] - Successful: assigned to (171.67.108.11).
[00:38:44] + News From Folding@Home: Welcome to Folding@Home
[00:38:44] Loaded queue successfully.
[00:38:45] + Closed connections
[00:38:50]

Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Sat Dec 19, 2009 11:27 am
by theo343
Repeats until the 24 hour pause.
[07:20:40] Project: 5772 (Run 12, Clone 144, Gen 256)
[07:20:40]
[07:20:40] Assembly optimizations on if available.
[07:20:40] Entering M.D.
[07:20:47] Working on Protein
[07:20:47] Client config found, loading data.
[07:20:47] mdrun_gpu returned
[07:20:47] SHAKE violations on GPU
[07:20:47]
[07:20:47] Folding@home Core Shutdown: UNSTABLE_MACHINE
[07:20:50] CoreStatus = 7A (122)
[07:20:50] Sending work to server
[07:22:02] EUE limit exceeded. Pausing 24 hours.

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Sat Dec 19, 2009 3:19 pm
by theo343
This WU has caused me three "pausing 24 hours" today. It keeps coming back inbetween other WUs that works fine.

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Sun Dec 20, 2009 2:02 am
by codysluder
see also viewtopic.php?f=52&t=10370&start=0 and viewtopic.php?f=52&t=7953&start=180#p114076 for other reports of the same WU failing.

Can somebody shut this bad WU down? (and perhaps merge the topics)???

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Sun Dec 20, 2009 5:35 am
by bruce
This WU has been put on the "shut it down" list.

Re: Project: 5772 (Run 12, Clone 144, Gen 256)

Posted: Sun Dec 20, 2009 11:43 am
by theo343
Thanks :)