general questions about WUs
Posted: Tue Feb 24, 2009 10:17 pm
I have a few rather quick general questions about WUs:
1) how often or how many times does a WU gets resent out? (or how does the PandeGroup determine if and when they need to send it back out again?)
2) Where would I be able to find a table with what all of the different core stati meanings and definitions?
3) Why is it that sometimes the client would automatically download the same WU a number of times before it is able to properly start the run? (see Fahlog below).
4) When we post problems with the WUs here and some of the site admins/moderators report back with the number of times (if any) that particular WU shows up in the database, what it is supposed to mean? Is there any way to tell from within that database whether the returned WU was valid, or does it only stipulate that the WU has be returned for (where applicable) appropriate credit value?
1) how often or how many times does a WU gets resent out? (or how does the PandeGroup determine if and when they need to send it back out again?)
2) Where would I be able to find a table with what all of the different core stati meanings and definitions?
3) Why is it that sometimes the client would automatically download the same WU a number of times before it is able to properly start the run? (see Fahlog below).
4) When we post problems with the WUs here and some of the site admins/moderators report back with the number of times (if any) that particular WU shows up in the database, what it is supposed to mean? Is there any way to tell from within that database whether the returned WU was valid, or does it only stipulate that the WU has be returned for (where applicable) appropriate credit value?
Code: Select all
[16:28:25] - Warning: Could not delete all work unit files (5): Core file absent
[16:28:25] Trying to send all finished work units
[16:28:25] + No unsent completed units remaining.
[16:28:25] - Preparing to get new work unit...
[16:28:25] + Attempting to get work packet
[16:28:25] - Will indicate memory of 16003 MB
[16:28:25] - Connecting to assignment server
[16:28:25] Connecting to http://assign.stanford.edu:8080/
[16:28:25] Posted data.
[16:28:25] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[16:28:25] + News From Folding@Home: Welcome to Folding@Home
[16:28:25] Loaded queue successfully.
[16:28:25] Connecting to http://171.67.108.24:8080/
[16:28:32] Posted data.
[16:28:32] Initial: 0000; - Receiving payload (expected size: 4862689)
[16:28:45] - Downloaded at ~365 kB/s
[16:28:45] - Averaged speed for that direction ~339 kB/s
[16:28:45] + Received work.
[16:28:45] Trying to send all finished work units
[16:28:45] + No unsent completed units remaining.
[16:28:45] + Closed connections
[16:28:45]
[16:28:45] + Processing work unit
[16:28:45] Core required: FahCore_a2.exe
[16:28:45] Core found.
[16:28:45] Working on queue slot 06 [February 24 16:28:45 UTC]
[16:28:45] + Working ...
[16:28:45] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 06 -checkpoint 15 -verbose -lifeline 18269 -version 624'
[16:28:45]
[16:28:45] *------------------------------*
[16:28:45] Folding@Home Gromacs SMP Core
[16:28:45] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[16:28:45]
[16:28:45] Preparing to commence simulation
[16:28:45] - Ensuring status. Please wait.
[16:28:46] Called DecompressByteArray: compressed_data_size=4862177 data_size=24067137, decompressed_data_size=24067137 diff=0
[16:28:46] - Digital signature verified
[16:28:46]
[16:28:46] Project: 2676 (Run 1, Clone 121, Gen 12)
[16:28:46]
[16:28:47] Assembly optimizations on if available.
[16:28:47] Entering M.D.
[16:28:53] Will resume from checkpoint file
[16:28:56] ng M.D.
[16:29:02] Will resume from checkpoint file
[16:29:05] fcCheckPointResume: file hashes different -- aborting.
[16:29:09] CoreStatus = FF (255)
[16:29:09] Sending work to server
[16:29:09] Project: 2676 (Run 1, Clone 121, Gen 12)
[16:29:09] - Error: Could not get length of results file work/wuresults_06.dat
[16:29:09] - Error: Could not read unit 06 file. Removing from queue.
[16:29:09] Trying to send all finished work units
[16:29:09] + No unsent completed units remaining.
[16:29:09] - Preparing to get new work unit...
[16:29:09] + Attempting to get work packet
[16:29:09] - Will indicate memory of 16003 MB
[16:29:09] - Connecting to assignment server
[16:29:09] Connecting to http://assign.stanford.edu:8080/
[16:29:10] Posted data.
[16:29:10] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[16:29:10] + News From Folding@Home: Welcome to Folding@Home
[16:29:10] Loaded queue successfully.
[16:29:10] Connecting to http://171.67.108.24:8080/
[16:29:16] Posted data.
[16:29:16] Initial: 0000; - Receiving payload (expected size: 4862689)
[16:29:26] - Downloaded at ~474 kB/s
[16:29:26] - Averaged speed for that direction ~366 kB/s
[16:29:26] + Received work.
[16:29:26] Trying to send all finished work units
[16:29:26] + No unsent completed units remaining.
[16:29:26] + Closed connections
[16:29:31]
[16:29:31] + Processing work unit
[16:29:31] Core required: FahCore_a2.exe
[16:29:31] Core found.
[16:29:31] Working on queue slot 07 [February 24 16:29:31 UTC]
[16:29:31] + Working ...
[16:29:31] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 18269 -version 624'
[16:29:32]
[16:29:32] *------------------------------*
[16:29:32] Folding@Home Gromacs SMP Core
[16:29:32] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[16:29:32]
[16:29:32] Preparing to commence simulation
[16:29:32] - Ensuring status. Please wait.
[16:29:33] Called DecompressByteArray: compressed_data_size=4862177 data_size=24067137, decompressed_data_size=24067137 diff=0
[16:29:33] - Digital signature verified
[16:29:33]
[16:29:33] Project: 2676 (Run 1, Clone 121, Gen 12)
[16:29:33]
[16:29:33] Assembly optimizations on if available.
[16:29:33] Entering M.D.
[16:29:39] Will resume from checkpoint file
[16:29:43] ng M.D.
[16:29:49] Will resume from checkpoint file
[16:29:52] fcCheckPointResume: file hashes different -- aborting.
[16:29:56] CoreStatus = FF (255)
[16:29:56] Sending work to server
[16:29:56] Project: 2676 (Run 1, Clone 121, Gen 12)
[16:29:56] - Error: Could not get length of results file work/wuresults_07.dat
[16:29:56] - Error: Could not read unit 07 file. Removing from queue.
[16:29:56] Trying to send all finished work units
[16:29:56] + No unsent completed units remaining.
[16:29:56] - Preparing to get new work unit...
[16:29:56] + Attempting to get work packet
[16:29:56] - Will indicate memory of 16003 MB
[16:29:56] - Connecting to assignment server
[16:29:56] Connecting to http://assign.stanford.edu:8080/
[16:29:56] Posted data.
[16:29:56] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[16:29:56] + News From Folding@Home: Welcome to Folding@Home
[16:29:56] Loaded queue successfully.
[16:29:56] Connecting to http://171.67.108.24:8080/
[16:30:02] Posted data.
[16:30:02] Initial: 0000; - Receiving payload (expected size: 4862689)
[16:30:14] - Downloaded at ~395 kB/s
[16:30:14] - Averaged speed for that direction ~372 kB/s
[16:30:14] + Received work.
[16:30:14] Trying to send all finished work units
[16:30:14] + No unsent completed units remaining.
[16:30:14] + Closed connections
[16:30:19]
[16:30:19] + Processing work unit
[16:30:19] Core required: FahCore_a2.exe
[16:30:19] Core found.
[16:30:19] Working on queue slot 08 [February 24 16:30:19 UTC]
[16:30:19] + Working ...
[16:30:19] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 18269 -version 624'
[16:30:19]
[16:30:19] *------------------------------*
[16:30:19] Folding@Home Gromacs SMP Core
[16:30:19] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[16:30:19]
[16:30:19] Preparing to commence simulation
[16:30:19] - Ensuring status. Please wait.
[16:30:28] - Looking at optimizations...
[16:30:28] - Working with standard loops on this execution.
[16:30:28] - Files status OK
[16:30:29] - Expanded 4862177 -> 24067137 (decompressed 494.9 percent)
[16:30:29] Called DecompressByteArray: compressed_data_size=4862177 data_size=24067137, decompressed_data_size=24067137 diff=0
[16:30:29] - Digital signature verified
[16:30:29]
[16:30:29] Project: 2676 (Run 1, Clone 121, Gen 12)
[16:30:29]
[16:30:30] Entering M.D.
[16:39:16] Completed 2500 out of 250000 steps (1%)
[16:47:54] Completed 5000 out of 250000 steps (2%)
[16:56:36] Completed 7500 out of 250000 steps (3%)
[17:02:25] - Autosending finished units... [February 24 17:02:25 UTC]
[17:02:25] Trying to send all finished work units
[17:02:25] + No unsent completed units remaining.
[17:02:25] - Autosend completed
[17:05:18] Completed 10000 out of 250000 steps (4%)
[17:13:59] Completed 12500 out of 250000 steps (5%)
[17:22:38] Completed 15000 out of 250000 steps (6%)
[17:31:19] Completed 17500 out of 250000 steps (7%)
[17:40:02] Completed 20000 out of 250000 steps (8%)
[17:48:46] Completed 22500 out of 250000 steps (9%)
[17:57:31] Completed 25000 out of 250000 steps (10%)
[18:06:18] Completed 27500 out of 250000 steps (11%)
[18:15:06] Completed 30000 out of 250000 steps (12%)
[18:23:54] Completed 32500 out of 250000 steps (13%)
[18:32:42] Completed 35000 out of 250000 steps (14%)
[18:41:30] Completed 37500 out of 250000 steps (15%)
[18:50:18] Completed 40000 out of 250000 steps (16%)
[18:59:08] Completed 42500 out of 250000 steps (17%)
[19:07:58] Completed 45000 out of 250000 steps (18%)
[19:16:47] Completed 47500 out of 250000 steps (19%)
[19:25:35] Completed 50000 out of 250000 steps (20%)
[19:34:23] Completed 52500 out of 250000 steps (21%)
[19:43:08] Completed 55000 out of 250000 steps (22%)
[19:51:52] Completed 57500 out of 250000 steps (23%)
[20:00:36] Completed 60000 out of 250000 steps (24%)
[20:09:20] Completed 62500 out of 250000 steps (25%)
[20:18:03] Completed 65000 out of 250000 steps (26%)
[20:26:47] Completed 67500 out of 250000 steps (27%)
[20:35:32] Completed 70000 out of 250000 steps (28%)
[20:44:16] Completed 72500 out of 250000 steps (29%)
[20:53:01] Completed 75000 out of 250000 steps (30%)
[21:01:44] Completed 77500 out of 250000 steps (31%)
[21:10:23] Completed 80000 out of 250000 steps (32%)
[21:19:02] Completed 82500 out of 250000 steps (33%)
[21:27:42] Completed 85000 out of 250000 steps (34%)
[21:36:23] Completed 87500 out of 250000 steps (35%)
[21:45:05] Completed 90000 out of 250000 steps (36%)
[21:53:48] Completed 92500 out of 250000 steps (37%)
[22:02:31] Completed 95000 out of 250000 steps (38%)
[22:11:14] Completed 97500 out of 250000 steps (39%)
[22:19:57] Completed 100000 out of 250000 steps (40%)