Page 1 of 1

Project: 6892 (Run 999, Clone 12, Gen 56) - EUE

Posted: Mon Jan 30, 2012 6:45 pm
by Fahrenheit451
I had trouble with this WU ([22:36:03] Project: 6892 (Run 999, Clone 12, Gen 56)). The client stopped at 30% with EARLY_UNIT_END.

Client runs on Win Vista Ultimate 32bit.

Code: Select all

--- Opening Log file [January 22 18:34:53 UTC] 


# Windows CPU Systray Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\USERNAME\AppData\Roaming\Folding@home-x86
Arguments: -advmethods -verbosity 9 

[18:34:53] - Ask before connecting: No
[18:34:53] - User name: superduper4711 (Team 0)
[18:34:53] - User ID: ECCE00A5AF7AA44
[18:34:53] - Machine ID: 1

<SNIP>...different successful finished WU's between January 22 and January 29...</SNIP>


[22:35:31] 
[22:35:31] Folding@home Core Shutdown: FINISHED_UNIT
[22:35:34] CoreStatus = 64 (100)
[22:35:34] Unit 3 finished with 90 percent of time to deadline remaining.
[22:35:34] Updated performance fraction: 0.900523
[22:35:34] Sending work to server
[22:35:34] Project: 8001 (Run 38, Clone 23, Gen 33)
[22:35:34] - Read packet limit of 540015616... Set to 524286976.


[22:35:34] + Attempting to send results [January 29 22:35:34 UTC]
[22:35:34] - Reading file work/wuresults_03.dat from core
[22:35:34]   (Read 1196689 bytes from disk)
[22:35:34] Connecting to http://171.67.108.58:8080/
[22:35:53] Posted data.
[22:35:53] Initial: 0000; - Uploaded at ~61 kB/s
[22:35:53] - Averaged speed for that direction ~61 kB/s
[22:35:53] + Results successfully sent
[22:35:53] Thank you for your contribution to Folding@Home.
[22:35:53] + Number of Units Completed: 499

[22:35:57] Trying to send all finished work units
[22:35:57] + No unsent completed units remaining.
[22:35:57] - Preparing to get new work unit...
[22:35:57] + Attempting to get work packet
[22:35:57] - Will indicate memory of 2045 MB
[22:35:57] - Connecting to assignment server
[22:35:57] Connecting to http://assign.stanford.edu:8080/
[22:35:58] Posted data.
[22:35:58] Initial: 43AB; - Successful: assigned to (171.67.108.53).
[22:35:58] + News From Folding@Home: Welcome to Folding@Home
[22:35:58] Loaded queue successfully.
[22:35:58] Connecting to http://171.67.108.53:8080/
[22:36:00] Posted data.
[22:36:00] Initial: 0000; - Receiving payload (expected size: 663060)
[22:36:03] - Downloaded at ~215 kB/s
[22:36:03] - Averaged speed for that direction ~193 kB/s
[22:36:03] + Received work.
[22:36:03] Trying to send all finished work units
[22:36:03] + No unsent completed units remaining.
[22:36:03] + Closed connections
[22:36:03] 
[22:36:03] + Processing work unit
[22:36:03] Core required: FahCore_78.exe
[22:36:03] Core found.
[22:36:03] Working on queue slot 04 [January 29 22:36:03 UTC]
[22:36:03] + Working ...
[22:36:03] - Calling '.\FahCore_78.exe -dir work/ -suffix 04 -checkpoint 6 -verbose -lifeline 5868 -version 623'

[22:36:03] 
[22:36:03] *------------------------------*
[22:36:03] Folding@Home Gromacs Core
[22:36:03] Version 1.90 (March 8, 2006)
[22:36:03] 
[22:36:03] Preparing to commence simulation
[22:36:03] - Looking at optimizations...
[22:36:03] - Created dyn
[22:36:03] - Files status OK
[22:36:03] - Expanded 662548 -> 3332352 (decompressed 502.9 percent)
[22:36:03] - Starting from initial work packet
[22:36:03] 
[22:36:03] Project: 6892 (Run 999, Clone 12, Gen 56)
[22:36:03] 
[22:36:03] Assembly optimizations on if available.
[22:36:03] Entering M.D.
[22:36:10] Protein: ALZHEIMER DISEASE AMYLOID
[22:36:10] 
[22:36:10] Writing local files
[22:39:18] Extra SSE boost OK.
[22:39:18] Writing local files
[22:39:18] Completed 0 out of 250000 steps  (0%)
[22:45:18] Timered checkpoint triggered.
[22:48:34] Writing local files
[22:48:34] Completed 2500 out of 250000 steps  (1%)
[22:54:35] Timered checkpoint triggered.
[22:58:11] Writing local files
[22:58:11] Completed 5000 out of 250000 steps  (2%)
[23:04:12] Timered checkpoint triggered.
[23:07:16] Writing local files
[23:07:16] Completed 7500 out of 250000 steps  (3%)
[23:13:16] Timered checkpoint triggered.
[23:15:42] Writing local files
[23:15:42] Completed 10000 out of 250000 steps  (4%)
[23:21:42] Timered checkpoint triggered.
[23:24:05] Writing local files
[23:24:05] Completed 12500 out of 250000 steps  (5%)
[23:30:06] Timered checkpoint triggered.
[23:33:35] Writing local files
[23:33:35] Completed 15000 out of 250000 steps  (6%)
[23:39:35] Timered checkpoint triggered.
[23:41:02] Writing local files
[23:41:02] Completed 17500 out of 250000 steps  (7%)
[23:47:02] Timered checkpoint triggered.
[23:49:07] Writing local files
[23:49:07] Completed 20000 out of 250000 steps  (8%)
[23:55:07] Timered checkpoint triggered.
[23:56:38] Writing local files
[23:56:38] Completed 22500 out of 250000 steps  (9%)
[00:02:38] Timered checkpoint triggered.
[00:04:28] Writing local files
[00:04:28] Completed 25000 out of 250000 steps  (10%)
[00:10:27] Timered checkpoint triggered.
[00:12:33] Writing local files
[00:12:33] Completed 27500 out of 250000 steps  (11%)
[00:18:33] Timered checkpoint triggered.
[00:21:18] Writing local files
[00:21:18] Completed 30000 out of 250000 steps  (12%)
[00:27:19] Timered checkpoint triggered.
[00:29:40] Writing local files
[00:29:40] Completed 32500 out of 250000 steps  (13%)
[00:34:18] - Autosending finished units... [January 30 00:34:18 UTC]
[00:34:18] Trying to send all finished work units
[00:34:18] + No unsent completed units remaining.
[00:34:18] - Autosend completed
[00:34:18] + Working...
[00:35:40] Timered checkpoint triggered.
[00:38:06] Writing local files
[00:38:06] Completed 35000 out of 250000 steps  (14%)
[00:44:05] Timered checkpoint triggered.
[00:46:37] Writing local files
[00:46:37] Completed 37500 out of 250000 steps  (15%)
[00:52:36] Timered checkpoint triggered.
[00:55:16] Writing local files
[00:55:17] Completed 40000 out of 250000 steps  (16%)
[01:01:16] Timered checkpoint triggered.
[01:03:36] Writing local files
[01:03:36] Completed 42500 out of 250000 steps  (17%)
[01:09:36] Timered checkpoint triggered.
[01:12:29] Writing local files
[01:12:29] Completed 45000 out of 250000 steps  (18%)
[01:18:29] Timered checkpoint triggered.
[01:21:16] Writing local files
[01:21:16] Completed 47500 out of 250000 steps  (19%)
[01:27:16] Timered checkpoint triggered.
[01:30:40] Writing local files
[01:30:40] Completed 50000 out of 250000 steps  (20%)
[01:36:40] Timered checkpoint triggered.
[01:38:46] Writing local files
[01:38:46] Completed 52500 out of 250000 steps  (21%)
[01:44:46] Timered checkpoint triggered.
[01:47:05] Writing local files
[01:47:05] Completed 55000 out of 250000 steps  (22%)
[01:53:05] Timered checkpoint triggered.
[01:55:29] Writing local files
[01:55:29] Completed 57500 out of 250000 steps  (23%)
[02:01:30] Timered checkpoint triggered.
[02:03:41] Writing local files
[02:03:41] Completed 60000 out of 250000 steps  (24%)
[02:09:42] Timered checkpoint triggered.
[02:11:37] Writing local files
[02:11:37] Completed 62500 out of 250000 steps  (25%)
[02:17:38] Timered checkpoint triggered.
[02:19:36] Writing local files
[02:19:36] Completed 65000 out of 250000 steps  (26%)
[02:25:37] Timered checkpoint triggered.
[02:27:38] Writing local files
[02:27:38] Completed 67500 out of 250000 steps  (27%)
[02:33:39] Timered checkpoint triggered.
[02:35:13] Writing local files
[02:35:13] Completed 70000 out of 250000 steps  (28%)
[02:41:13] Timered checkpoint triggered.
[02:43:16] Writing local files
[02:43:16] Completed 72500 out of 250000 steps  (29%)
[02:49:16] Timered checkpoint triggered.
[02:51:24] Writing local files
[02:51:24] Completed 75000 out of 250000 steps  (30%)
[02:54:44] Gromacs cannot continue further.
[02:54:44] Going to send back what have done.
[02:54:44] logfile size: 8669
[02:54:44] - Writing 9205 bytes of core data to disk...
[02:54:44] Done: 8693 -> 3375 (compressed to 38.8 percent)
[02:54:44]   ... Done.
[02:54:44] 
[02:54:44] Folding@home Core Shutdown: EARLY_UNIT_END
[02:54:47] CoreStatus = 72 (114)
[02:54:47] Sending work to server
[02:54:47] Project: 6892 (Run 999, Clone 12, Gen 56)
[02:54:47] - Read packet limit of 540015616... Set to 524286976.


[02:54:47] + Attempting to send results [January 30 02:54:47 UTC]
[02:54:47] - Reading file work/wuresults_04.dat from core
[02:54:47]   (Read 3887 bytes from disk)
[02:54:47] Connecting to http://171.67.108.53:8080/
[02:54:48] Posted data.
[02:54:48] Initial: 0000; - Uploaded at ~4 kB/s
[02:54:48] - Averaged speed for that direction ~50 kB/s
[02:54:48] + Results successfully sent
[02:54:48] Thank you for your contribution to Folding@Home.
[02:54:52] Trying to send all finished work units
[02:54:52] + No unsent completed units remaining.
[02:54:52] - Preparing to get new work unit...
[02:54:52] + Attempting to get work packet
[02:54:52] - Will indicate memory of 2045 MB
[02:54:52] - Connecting to assignment server
[02:54:52] Connecting to http://assign.stanford.edu:8080/
[02:54:53] Posted data.
[02:54:53] Initial: 43AB; - Successful: assigned to (171.67.108.59).
[02:54:53] + News From Folding@Home: Welcome to Folding@Home
[02:54:53] Loaded queue successfully.
[02:54:53] Connecting to http://171.67.108.59:8080/
[02:54:54] Posted data.
[02:54:54] Initial: 0000; - Receiving payload (expected size: 545190)
[02:54:58] - Downloaded at ~133 kB/s
[02:54:58] - Averaged speed for that direction ~181 kB/s
[02:54:58] + Received work.
[02:54:58] Trying to send all finished work units
[02:54:58] + No unsent completed units remaining.
[02:54:58] + Closed connections
[02:55:03] 
[02:55:03] + Processing work unit
[02:55:03] Core required: FahCore_a4.exe
[02:55:03] Core found.
[02:55:03] Working on queue slot 05 [January 30 02:55:03 UTC]
[02:55:03] + Working ...
[02:55:03] - Calling '.\FahCore_a4.exe -dir work/ -suffix 05 -checkpoint 6 -verbose -lifeline 5868 -version 623'

[02:55:04] 
[02:55:04] *------------------------------*
[02:55:04] Folding@Home Gromacs GB Core
[02:55:04] Version 2.27 (Dec. 15, 2010)
[02:55:04] 
[02:55:04] Preparing to commence simulation
[02:55:04] - Looking at optimizations...
[02:55:04] - Created dyn
[02:55:04] - Files status OK
[02:55:04] - Expanded 544678 -> 1305312 (decompressed 239.6 percent)
[02:55:04] Called DecompressByteArray: compressed_data_size=544678 data_size=1305312, decompressed_data_size=1305312 diff=0
[02:55:04] - Digital signature verified
[02:55:04] 
[02:55:04] Project: 8004 (Run 64, Clone 23, Gen 57)
[02:55:04] 

I never saw this message in the log: "Timered checkpoint triggered." What does it mean?

Re: Project: 6892 (Run 999, Clone 12, Gen 56) - EUE

Posted: Mon Jan 30, 2012 7:52 pm
by PantherX
The WU has 3 failures so I have marked it as a bad WU:
The WU (P6892,R999,C12,G56) has been reported as a bad WU.
Thanks for your report.

Re: Project: 6892 (Run 999, Clone 12, Gen 56) - EUE

Posted: Mon Jan 30, 2012 8:36 pm
by bruce
Fahrenheit451 wrote:I never saw this message in the log: "Timered checkpoint triggered." What does it mean?
[22:39:18] Completed 0 out of 250000 steps (0%)
[22:45:18] Timered checkpoint triggered.
[22:48:34] Writing local files
[22:48:34] Completed 2500 out of 250000 steps (1%)

Your TPF is (22:48:34 - 22:39:18) = 9:16
Your checkpoint setting is (22:45:18 - 22:39:18) = 6:00

Since the frame is longer than the checkpoint interval, the 6 minute timer is interrupting the processing to create an extra checkpoint.

When you set -verbosity 9, you'll get non-essential messages like that. Remove that flag and you won't see the messages.

Re: Project: 6892 (Run 999, Clone 12, Gen 56) - EUE

Posted: Mon Jan 30, 2012 10:19 pm
by Fahrenheit451
Thank you both. I always learn new things about FAH :-)