Page 1 of 1

Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Sun Apr 27, 2008 9:38 am
by Jeannie
EUE at 22% -
[06:16:47] Completed 1100000 out of 5000000 steps (22 percent)
[06:19:01] Warning: long 1-4 interactions
[09:16:48] At least 3 hours since checkpoint written...
[09:16:48]
[09:16:48] Folding@home Core Shutdown: EARLY_UNIT_END

This has happened twice on my stock Q66 running Windows Vista, SMP 5.91

Re: Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Sun Apr 27, 2008 10:31 am
by toTOW
You might want to submit partial results to tell the system this WU will not complete on your computer : viewtopic.php?f=8&t=191 ;)

Re: Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Sun Apr 27, 2008 11:36 am
by zorzyk
I encountred the same "1-4 long interaction" error 2 days ago:

Code: Select all

[23:55:23] 17:33:25] Project: 3062 (Run 3, Clone 77, Gen 26)
[17:33:25] 
[17:33:25] Assembly optimizations on if available.
[17:33:25] Entering M.D.
[17:33:31] Rejecting checkpoint
[17:33:32] ProteinWriting local files
[17:33:32] Extra SSE boost OK.
[17:33:32] 
[17:33:32] Extra SSE boost OK.
[17:33:32] Writing local files
[17:33:32] Completed 0 out of 5000000 steps  (0 percent)
[17:42:51] Writing local files
...
[23:57:43] Completed 2050000 out of 5000000 steps  (41 percent)
[23:57:43] Warning:  long 1-4 interactions
[23:57:43] Gromacs cannot continue further.
[23:57:43] Going to send back what have done.
[23:57:43] logfile size: 59743
[23:57:43] - Writing 60279 bytes of core data to disk...
[23:57:43]   ... Done.
[23:57:43] - Failed to delete work/wudata_01.arc
[23:57:43] - Failed to delete work/wudata_01.sas
[23:57:43] - Failed to delete work/wudata_01.goe
[23:57:43] Warning:  check for stray files
[23:59:43] 
[23:59:43] Folding@home Core Shutdown: EARLY_UNIT_END
[23:59:43] 
[23:59:43] Folding@home Core Shutdown: EARLY_UNIT_END
[23:59:46] CoreStatus = 7B (123)
[23:59:46] Client-core communications error: ERROR 0x7b
[23:59:46] Deleting current work unit & continuing...
Fortunately internet was switched off by night, and client could not connect to assignment sever :). I rolled back the copy made at 23:30 and tried to continue. After next same EUE (at 46%) I decided to boost CPU VCore voltage one step further - that cured problems and I've finished the WU without any problem.

I see projecsts 30xx are more demanding for the hardware than 2653's were. Maybe it would be necessary to inspect OC settings that worked well until now.

Re: Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Wed May 28, 2008 3:38 pm
by rbrandman
Thanks for your post. I've alerted the researcher in charge of this project about the errors you are seeing.

Relly

Re: Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Wed May 28, 2008 6:23 pm
by DanEnsign
Please see viewtopic.php?f=19&t=2704&p=24859#p24773.

Re: Project: 3064 (Run 5, Clone 209, Gen 2)

Posted: Thu May 29, 2008 8:37 am
by joce
Same issue here :

Code: Select all

[16:29:28] *------------------------------*
[16:29:28] Folding@Home Gromacs SMP Core
[16:29:28] Version 1.74 (March 10, 2007)
[16:29:28]
[16:29:28] Preparing to commence simulation
[16:29:28] - Ensuring status. Please wait.
[16:29:45] - Assembly optimizations manually forced on.
[16:29:45] - Not checking prior termination.
[16:29:45] - Expanded 609872 -> 3263429 (decompressed 535.1 percent)
[16:29:45] - Starting from initial work packet
[16:29:45]
[16:29:45] Project: 3064 (Run 2, Clone 51, Gen 49)
[16:29:45]
[16:29:46] Assembly optimizations on if available.
[16:29:46] Entering M.D.
[16:29:52] Protein: p3064_lambdaProteinWriting local files
[16:29:52] Extra SSE boost OK.
[16:29:52]
[16:29:52] Extra SSE boost OK.
[16:29:52] Writing local files
[16:29:52] Completed 0 out of 5000000 steps  (0 percent)
[16:44:52] Timered checkpoint triggered.
[16:50:45] Writing local files
[16:50:45] Completed 50000 out of 5000000 steps  (1 percent)
[17:05:46] Timered checkpoint triggered.
[17:12:00] Writing local files
[17:12:00] Completed 100000 out of 5000000 steps  (2 percent)
[17:27:01] Timered checkpoint triggered.
[17:32:22] Writing local files
[17:32:22] Completed 150000 out of 5000000 steps  (3 percent)
[17:47:23] Timered checkpoint triggered.
[17:52:45] Writing local files
[17:52:45] Completed 200000 out of 5000000 steps  (4 percent)
[18:07:45] Timered checkpoint triggered.
[18:08:18] - Autosending finished units...
[18:08:18] Trying to send all finished work units
[18:08:18] + No unsent completed units remaining.
[18:08:18] - Autosend completed
[18:13:09] Writing local files
[18:13:09] Completed 250000 out of 5000000 steps  (5 percent)
[18:28:09] Timered checkpoint triggered.
[18:33:32] Writing local files
[18:33:33] Completed 300000 out of 5000000 steps  (6 percent)
[18:48:33] Timered checkpoint triggered.
[18:53:55] Writing local files
[18:53:55] Completed 350000 out of 5000000 steps  (7 percent)
[19:08:55] Timered checkpoint triggered.
[19:14:19] Writing local files
[19:14:20] Completed 400000 out of 5000000 steps  (8 percent)
[19:29:20] Timered checkpoint triggered.
[19:34:43] Writing local files
[19:34:43] Completed 450000 out of 5000000 steps  (9 percent)
[19:49:43] Timered checkpoint triggered.
[19:55:07] Writing local files
[19:55:07] Completed 500000 out of 5000000 steps  (10 percent)
[20:10:07] Timered checkpoint triggered.
[20:15:35] Writing local files
[20:15:35] Completed 550000 out of 5000000 steps  (11 percent)
[20:30:35] Timered checkpoint triggered.
[20:35:57] Writing local files
[20:35:57] Completed 600000 out of 5000000 steps  (12 percent)
[20:49:48] Gromacs cannot continue further.
[20:49:48] Going to send back what have done.
[20:49:48] logfile size: 24290
[20:49:48] - Writing 24826 bytes of core data to disk...
[20:49:48]   ... Done.
[20:49:48] - Failed to delete work/wudata_04.arc
[20:49:48] - Failed to delete work/wudata_04.chk
[20:49:48] - Failed to delete work/wudata_04.sas
[20:49:48] - Failed to delete work/wudata_04.goe
[20:49:48] Warning:  check for stray files
[20:51:48]
[20:51:48] Folding@home Core Shutdown: EARLY_UNIT_END
[20:51:48]
[20:51:48] Folding@home Core Shutdown: EARLY_UNIT_END
[00:08:18] - Autosending finished units...
[00:08:18] Trying to send all finished work units
[00:08:18] + No unsent completed units remaining.
[00:08:18] - Autosend completed
[06:08:18] - Autosending finished units...
[06:08:18] Trying to send all finished work units
[06:08:18] + No unsent completed units remaining.
[06:08:18] - Autosend completed