Page 1 of 1

Project: 2665 EUEs

Posted: Tue Apr 07, 2009 3:42 am
by anko1
Catching up on Big Red's logs. EUEs.

Project: 2665 (Run 3, Clone 513, Gen 68)

Code: Select all

[17:42:06] + Closed connections
[17:42:06] 
[17:42:06] + Processing work unit
[17:42:06] Work type a1 not eligible for variable processors
[17:42:06] Core required: FahCore_a1.exe
[17:42:06] Core found.
[17:42:06] Using generic mpiexec calls
[17:42:06] Working on queue slot 04 [March 31 17:42:06 UTC]
[17:42:06] + Working ...
[17:42:06] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 3704 -version 623'

[17:42:06] 
[17:42:06] *------------------------------*
[17:42:06] Folding@Home Gromacs SMP Core
[17:42:06] Version 1.74 (March 10, 2007)
[17:42:06] 
[17:42:06] Preparing to commence simulation
[17:42:06] - Looking at optimizations...
[17:42:06] - Created dyn
[17:42:06] - Files status OK
[17:42:11] - Expanded 4756551 -> 24426905 (decompressed 513.5 percent)
[17:42:11] - Starting from initial work packet
[17:42:11] 
[17:42:11] Project: 2665 (Run 3, Clone 513, Gen 68)
[17:42:11] 
[17:42:12] Assembly optimizations on if available.
[17:42:12] Entering M.D.
[17:42:33]  percent)
[17:42:33] - Starting from initial work packet
[17:42:33] 
[17:42:33] Project: 2665 (Run 3, Clone 513, Gen 68)
[17:42:33] 
[17:42:34] Entering M.D.
[17:42:40] Rejecting checkpoint
[17:42:42] Protein: HGG in water
[17:42:42] Writing local files
[17:42:50] Extra SSE boost OK.
[17:42:50] Writing local files
[17:42:50] Completed 0 out of 250000 steps  (0 percent)
[17:50:03] - Autosending finished units... [March 31 17:50:03 UTC]
[17:50:03] Trying to send all finished work units
[17:50:03] + No unsent completed units remaining.
[17:50:03] - Autosend completed
[17:57:51] Timered checkpoint triggered.
[17:58:33] Writing local files
[17:58:33] Completed 2500 out of 250000 steps  (1 percent)
                   {snip}
[22:02:18] Writing local files
[22:02:18] Completed 42500 out of 250000 steps  (17 percent)
[22:16:25] Warning:  long 1-4 interactions
[22:16:25] Gromacs cannot continue further.
[22:16:25] Going to send back what have done.
[22:16:25] logfile size: 41233
[22:16:25] - Writing 41769 bytes of core data to disk...
[22:16:25]   ... Done.
[22:16:25] - Failed to delete work/wudata_04.sas
[22:16:25] - Failed to delete work/wudata_04.goe
[22:16:25] Warning:  check for stray files
[22:18:25] 
[22:18:25] Folding@home Core Shutdown: EARLY_UNIT_END
[22:18:25] 
[22:18:25] Folding@home Core Shutdown: EARLY_UNIT_END
[22:18:28] CoreStatus = 7B (123)
[22:18:28] Sending work to server
[22:18:28] Project: 2665 (Run 3, Clone 513, Gen 68)


[22:18:28] + Attempting to send results [March 31 22:18:28 UTC]
[22:18:28] - Reading file work/wuresults_04.dat from core
[22:18:28]   (Read 41769 bytes from disk)
[22:18:28] Connecting to http://171.64.65.64:8080/
[22:18:28] Posted data.
[22:18:28] Initial: 0000; - Uploaded at ~41 kB/s
[22:18:29] - Averaged speed for that direction ~391 kB/s
[22:18:29] + Results successfully sent
[22:18:29] Thank you for your contribution to Folding@Home.

Project: 2665 (Run 2, Clone 165, Gen 71)

Code: Select all

[22:18:43] + Closed connections
[22:18:48] 
[22:18:48] + Processing work unit
[22:18:48] Work type a1 not eligible for variable processors
[22:18:48] Core required: FahCore_a1.exe
[22:18:48] Core found.
[22:18:48] Using generic mpiexec calls
[22:18:48] Working on queue slot 05 [March 31 22:18:48 UTC]
[22:18:48] + Working ...
[22:18:48] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 05 -checkpoint 15 -verbose -lifeline 3704 -version 623'

[22:18:48] 
[22:18:48] *------------------------------*
[22:18:48] Folding@Home Gromacs SMP Core
[22:18:48] Version 1.74 (March 10, 2007)
[22:18:48] 
[22:18:48] Preparing to commence simulation
[22:18:48] - Ensuring status. Please wait.
[22:18:53] - Starting from initial work packet
[22:18:53] 
[22:18:53] Project: 2665 (Run 2, Clone 165, Gen 71)
[22:18:53] 
[22:18:54] Assembly optimizations on if available.
[22:18:54] Entering M.D.
[22:19:16]  percent)
[22:19:16] - Starting from initial work packet
[22:19:16] 
[22:19:16] Project: 2665 (Run 2, Clone 165, Gen 71)
[22:19:16] 
[22:19:16] Entering M.D.
[22:19:22] Rejecting checkpoint
[22:19:24] Protein: HGG with glycosylations
[22:19:24] Writing local files
[22:19:33] Extra SSE boost OK.
[22:19:33] Writing local files
[22:19:33] Completed 0 out of 250000 steps  (0 percent)
[22:34:26] Gromacs cannot continue further- Failed to delete work/wudata_05.sas
[22:34:26] - Failed to delete work/wudata_05.goe
[22:34:26] Warning:  check for stray files
[22:34:26] e wor- Failed to delete work/wudata_05.sas
[22:34:26] - Failed to delete work/wudata_05.goe
[22:34:26] Warning:  check for stray files
[22:34:26] te work/wudata_05.goe
[22:34:26] Warning:  check for stray files
[22:36:26] ome Core Shutdown: EARLY_UNIT_END
[22:36:26] _UNIT_END
[22:36:26] Finalizing output
[22:36:30] CoreStatus = 7B (123)
[22:36:30] Sending work to server
[22:36:30] Project: 2665 (Run 2, Clone 165, Gen 71)


[22:36:30] + Attempting to send results [March 31 22:36:30 UTC]
[22:36:30] - Reading file work/wuresults_05.dat from core
[22:36:30]   (Read 9957 bytes from disk)
[22:36:30] Connecting to http://171.64.65.64:8080/
[22:36:30] Posted data.
[22:36:30] Initial: 0000; Conversation time very short, giving reduced weight in bandwidth avg
[22:36:30] - Uploaded at ~21 kB/s
[22:36:30] - Averaged speed for that direction ~350 kB/s
[22:36:30] + Results successfully sent
[22:36:30] Thank you for your contribution to Folding@Home.
The next unit completed successfully.

Several WUs later:

Project: 2665 (Run 0, Clone 576, Gen 70)

Code: Select all

[09:56:48] + Closed connections
[09:56:48] 
[09:56:48] + Processing work unit
[09:56:48] Work type a1 not eligible for variable processors
[09:56:48] Core required: FahCore_a1.exe
[09:56:48] Core found.
[09:56:48] Using generic mpiexec calls
[09:56:48] Working on queue slot 00 [April 4 09:56:48 UTC]
[09:56:48] + Working ...
[09:56:48] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 15 -verbose -lifeline 3704 -version 623'

[09:56:48] 
[09:56:48] *------------------------------*
[09:56:48] Folding@Home Gromacs SMP Core
[09:56:48] Version 1.74 (March 10, 2007)
[09:56:48] 
[09:56:48] Preparing to commence simulation
[09:56:48] - Ensuring status. Please wait.
[09:56:53] - Starting from initial work packet
[09:56:53] 
[09:56:53] Project: 2665 (Run 0, Clone 576, Gen 70)
[09:56:53] 
[09:56:54] Assembly optimizations on if available.
[09:56:54] Entering M.D.
[09:57:15]  percent)
[09:57:15] - Starting from initial work packet
[09:57:15] 
[09:57:15] Project: 2665 (Run 0, Clone 576, Gen 70)
[09:57:15] 
[09:57:16] Entering M.D.
[09:57:22] Rejecting checkpoint
[09:57:24] Protein: HGG in water
[09:57:24] Writing local files
[09:57:32] Extra SSE boost OK.
[09:57:32] Writing local files
[09:57:33] Completed 0 out of 250000 steps  (0 percent)
[10:12:34] Timered checkpoint triggered.
[10:12:59] Writing local files
[10:13:00] Completed 2500 out of 250000 steps  (1 percent)
           {snip}
[21:46:14] Writing local files
[21:46:14] Completed 115000 out of 250000 steps  (46 percent)
[21:56:38] Warning:  long 1-4 interactions
[21:56:38] Gromacs cannot continue further.
[21:56:38] Going to send back what have done.
[21:56:38] logfile size: 95075
[21:56:38] - Writing 95611 bytes of core data to disk...
[21:56:38]   ... Done.
[21:56:38] - Failed to delete work/wudata_00.arc
[21:56:38] No C.P. to delete.
[21:56:38] Warning:  check for stray files
[21:58:38] 
[21:58:38] Folding@home Core Shutdown: EARLY_UNIT_END
[21:58:38] 
[21:58:38] Folding@home Core Shutdown: EARLY_UNIT_END
[21:58:42] CoreStatus = 7B (123)
[21:58:42] Sending work to server
[21:58:42] Project: 2665 (Run 0, Clone 576, Gen 70)


[21:58:42] + Attempting to send results [April 4 21:58:42 UTC]
[21:58:42] - Reading file work/wuresults_00.dat from core
[21:58:42]   (Read 95611 bytes from disk)
[21:58:42] Connecting to http://171.64.65.64:8080/
[21:58:43] Posted data.
[21:58:43] Initial: 0000; - Uploaded at ~47 kB/s
[21:58:44] - Averaged speed for that direction ~211 kB/s
[21:58:44] + Results successfully sent
[21:58:44] Thank you for your contribution to Folding@Home.

Re: Project: 2665 EUEs

Posted: Tue Apr 07, 2009 6:52 am
by susato
I've reported (P2665 R2 C165 G71) as a bad WU; ten other people showed EUE's for it.

Also (P2665 R3 C513 G68) was a bad WU.

Project 665, Run 0, Clone 576, Gen 70 looked bad too.

Thanks for letting us know.

Re: Project: 2665 EUEs

Posted: Tue Apr 07, 2009 5:07 pm
by anko1
Thanks for checking susato. Nice to know it's not always me creating the errors. :-)