Page 1 of 1

Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 7:24 pm
by bollix47
Running 6.23 on i7 940 @ stock using Ubuntu 8.10 with -smp 8 option and HT on.

Code: Select all

[17:13:30] Entering M.D.
[17:18:46] Completed 2500 out of 250000 steps  (1%)
[17:23:52] Completed 5000 out of 250000 steps  (2%)
[17:29:00] Completed 7500 out of 250000 steps  (3%)
[17:34:07] Completed 10000 out of 250000 steps  (4%)
[17:39:15] Completed 12500 out of 250000 steps  (5%)
[17:44:21] Completed 15000 out of 250000 steps  (6%)
[17:49:27] Completed 17500 out of 250000 steps  (7%)
[17:54:33] Completed 20000 out of 250000 steps  (8%)
[17:59:41] Completed 22500 out of 250000 steps  (9%)
[18:04:51] Completed 25000 out of 250000 steps  (10%)
[18:09:58] Completed 27500 out of 250000 steps  (11%)
[18:15:07] Completed 30000 out of 250000 steps  (12%)
[18:20:16] Completed 32500 out of 250000 steps  (13%)
[18:25:22] Completed 35000 out of 250000 steps  (14%)
[18:30:30] Completed 37500 out of 250000 steps  (15%)
[18:35:36] Completed 40000 out of 250000 steps  (16%)
[18:40:45] Completed 42500 out of 250000 steps  (17%)
[18:45:52] Completed 45000 out of 250000 steps  (18%)
[18:51:00] Completed 47500 out of 250000 steps  (19%)
[18:56:06] Completed 50000 out of 250000 steps  (20%)
[19:01:14] Completed 52500 out of 250000 steps  (21%)
[19:06:22] Completed 55000 out of 250000 steps  (22%)
[19:11:27] Completed 57500 out of 250000 steps  (23%)
[19:12:39] CoreStatus = FF (255)
[19:12:39] Sending work to server
[19:12:39] Project: 2671 (Run 37, Clone 26, Gen 28)
[19:12:39] - Error: Could not get length of results file work/wuresults_04.dat
[19:12:39] - Error: Could not read unit 04 file. Removing from queue.
[19:12:39] Trying to send all finished work units
[19:12:39] + No unsent completed units remaining.
[19:12:39] - Preparing to get new work unit...
[19:12:39] + Attempting to get work packet
[19:12:39] - Will indicate memory of 3006 MB
[19:12:39] - Connecting to assignment server
[19:12:39] Connecting to http://assign.stanford.edu:8080/
[19:12:39] Posted data.
[19:12:39] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[19:12:39] + News From Folding@Home: Welcome to Folding@Home
[19:12:39] Loaded queue successfully.
[19:12:39] Connecting to http://171.64.65.56:8080/
[19:12:46] Posted data.
[19:12:46] Initial: 0000; - Receiving payload (expected size: 4832397)
[19:12:56] - Downloaded at ~471 kB/s
[19:12:56] - Averaged speed for that direction ~458 kB/s
[19:12:56] + Received work.
[19:12:56] Trying to send all finished work units
[19:12:56] + No unsent completed units remaining.
[19:12:56] + Closed connections
[19:13:01] 
[19:13:01] + Processing work unit
[19:13:01] Core required: FahCore_a2.exe
[19:13:01] Core found.
[19:13:01] Working on queue slot 05 [December 4 19:13:01 UTC]
[19:13:01] + Working ...
[19:13:01] - Calling './mpiexec -np 8 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 05 -checkpoint 30 -verbose -lifeline 11025 -version 623'

[19:13:01] 
[19:13:01] *------------------------------*
[19:13:01] Folding@Home Gromacs SMP Core
[19:13:01] Version 2.01 (Wed Aug 13 13:11:25 PDT 2008)
[19:13:01] 
[19:13:01] Preparing to commence simulation
[19:13:01] - Ensuring status. Please wait.
[19:13:11] - Looking at optimizations...
[19:13:11] - Working with standard loops on this execution.
[19:13:11] - Files status OK
[19:13:12] - Expanded 4831885 -> 23976217 (decompressed 496.2 percent)
[19:13:12] Called DecompressByteArray: compressed_data_size=4831885 data_size=23976217, decompressed_data_size=23976217 diff=0
[19:13:12] - Digital signature verified
[19:13:12] 
[19:13:12] Project: 2669 (Run 8, Clone 69, Gen 33)
As can be seen in the log the client immediately downloaded a different project, so it didn't go thru the usual 'try it 3 times' before moving on.

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 8:27 pm
by kasson
Yep--that's the version 6.23 core working for you and automatically reporting the WU. :)

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 8:34 pm
by bollix47
kasson wrote:Yep--that's the version 6.23 core working for you and automatically reporting the WU. :)

Very nice! :wink:

Does that mean when WUs using 6.23 abort like this we no longer have to report them here :?:

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 8:42 pm
by toTOW
Wasn't Error FF supposed to be fixed in 2.01 A2 core ?

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 8:42 pm
by kasson
Yes--if 6.23 correctly catches the EUE, i.e you see the
[19:12:39] Sending work to server
message, the client is correctly reporting the EUE and you don't need to report them here.

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Thu Dec 04, 2008 8:43 pm
by toTOW
No, there's nothing to send :( :
[19:12:39] Sending work to server
[19:12:39] Project: 2671 (Run 37, Clone 26, Gen 28)
[19:12:39] - Error: Could not get length of results file work/wuresults_04.dat
[19:12:39] - Error: Could not read unit 04 file. Removing from queue.

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Fri Dec 05, 2008 3:53 pm
by kasson
Apologies--that is correct. This is being fixed in a new version of the client, currently under preparation. Right now, the client is trying to do the right thing, but if the files are not there it doesn't succeed. The new version should tell the server that it tried anyway. :)

Re: Project: 2671 (Run 37, Clone 26, Gen 28)

Posted: Fri Dec 05, 2008 4:12 pm
by Xilikon
kasson wrote:Apologies--that is correct. This is being fixed in a new version of the client, currently under preparation. Right now, the client is trying to do the right thing, but if the files are not there it doesn't succeed. The new version should tell the server that it tried anyway. :)
Yes, this what we want. What matter is that there is a record somewhere that it tried to report a EUE, which is as important as reporting a EUE because failing to report mean that files is messed up in a way.