Page 1 of 1

WU completed but not send, work started on another WU?

Posted: Wed Aug 26, 2009 7:16 am
by Abraham54
WU 6301 (Run 377, Clone 3, Gen 26) is ready, but there are connection problems, so this WU is kept in qeue. Now work has started on WU 4461 (Run 205, Clone 1, Gen 31), apparently kept in reserve?

Code: Select all

[01:56:14] Completed 1500000 out of 1500000 steps  (100%)
[01:56:14] Writing final coordinates.
[01:56:15] Past main M.D. loop
[01:57:15] 
[01:57:15] Finished Work Unit:
[01:57:15] - Reading up to 188448 from "work/wudata_09.arc": Read 188448
[01:57:15] - Reading up to 17520 from "work/wudata_09.xtc": Read 17520
[01:57:15] goefile size: 0
[01:57:15] logfile size: 89116
[01:57:15] Leaving Run
[01:57:16] - Writing 302088 bytes of core data to disk...
[01:57:16] Done: 301576 -> 209892 (compressed to 69.5 percent)
[01:57:16]   ... Done.
[01:57:16] - Shutting down core
[01:57:16] 
[01:57:16] Folding@home Core Shutdown: FINISHED_UNIT
[01:57:18] CoreStatus = 64 (100)
[01:57:18] Sending work to server
[01:57:18] Project: 4461 (Run 205, Clone 1, Gen 31)


[01:57:18] + Attempting to send results [August 26 01:57:18 UTC]
[01:57:20] - Couldn't send HTTP request to server
[01:57:20] + Could not connect to Work Server (results)
[01:57:20]     (171.67.108.13:8080)
[01:57:20] + Retrying using alternative port
[01:57:21] - Couldn't send HTTP request to server
[01:57:21] + Could not connect to Work Server (results)
[01:57:21]     (171.67.108.13:80)
[01:57:21] - Error: Could not transmit unit 09 (completed August 26) to work server.
[01:57:21]   Keeping unit 09 in queue.
[01:57:21] Project: 4461 (Run 205, Clone 1, Gen 31)


[01:57:21] + Attempting to send results [August 26 01:57:21 UTC]
[01:57:23] - Couldn't send HTTP request to server
[01:57:23] + Could not connect to Work Server (results)
[01:57:23]     (171.67.108.13:8080)
[01:57:23] + Retrying using alternative port
[01:57:24] - Couldn't send HTTP request to server
[01:57:24] + Could not connect to Work Server (results)
[01:57:24]     (171.67.108.13:80)
[01:57:24] - Error: Could not transmit unit 09 (completed August 26) to work server.


[01:57:24] + Attempting to send results [August 26 01:57:24 UTC]
[01:57:25] - Couldn't send HTTP request to server
[01:57:25]   (Got status 503)
[01:57:25] + Could not connect to Work Server (results)
[01:57:25]     (171.67.108.17:8080)
[01:57:25] + Retrying using alternative port
[01:57:25] - Couldn't send HTTP request to server
[01:57:25]   (Got status 503)
[01:57:25] + Could not connect to Work Server (results)
[01:57:25]     (171.67.108.17:80)
[01:57:25]   Could not transmit unit 09 to Collection server; keeping in queue.
[01:57:25] - Preparing to get new work unit...
[01:57:25] + Attempting to get work packet
[01:57:25] - Connecting to assignment server
[01:57:27] - Successful: assigned to (171.64.65.111).
[01:57:27] + News From Folding@Home: Welcome to Folding@Home
[01:57:27] Loaded queue successfully.
[01:57:31] Project: 4461 (Run 205, Clone 1, Gen 31)


[01:57:31] + Attempting to send results [August 26 01:57:31 UTC]
[01:57:32] - Couldn't send HTTP request to server
[01:57:32] + Could not connect to Work Server (results)
[01:57:32]     (171.67.108.13:8080)
[01:57:32] + Retrying using alternative port
[01:57:34] - Couldn't send HTTP request to server
[01:57:34] + Could not connect to Work Server (results)
[01:57:34]     (171.67.108.13:80)
[01:57:34] - Error: Could not transmit unit 09 (completed August 26) to work server.


[01:57:34] + Attempting to send results [August 26 01:57:34 UTC]
[01:57:34] - Couldn't send HTTP request to server
[01:57:34]   (Got status 503)
[01:57:34] + Could not connect to Work Server (results)
[01:57:34]     (171.67.108.17:8080)
[01:57:34] + Retrying using alternative port
[01:57:35] - Couldn't send HTTP request to server
[01:57:35]   (Got status 503)
[01:57:35] + Could not connect to Work Server (results)
[01:57:35]     (171.67.108.17:80)
[01:57:35]   Could not transmit unit 09 to Collection server; keeping in queue.
[01:57:35] + Closed connections
[01:57:35] 
[01:57:35] + Processing work unit
[01:57:35] Core required: FahCore_78.exe
[01:57:35] Core found.
[01:57:35] Working on queue slot 00 [August 26 01:57:35 UTC]
[01:57:35] + Working ...
[01:57:35] 
[01:57:35] *------------------------------*
[01:57:35] Folding@Home Gromacs Core
[01:57:35] Version 1.90 (March 8, 2006)
[01:57:35] 
[01:57:35] Preparing to commence simulation
[01:57:35] - Looking at optimizations...
[01:57:35] - Created dyn
[01:57:35] - Files status OK
[01:57:35] - Expanded 457269 -> 2243997 (decompressed 490.7 percent)
[01:57:35] - Starting from initial work packet
[01:57:35] 
[01:57:35] Project: 6301 (Run 377, Clone 3, Gen 26)
[01:57:35] 
[01:57:36] Assembly optimizations on if available.
[01:57:36] Entering M.D.
[01:57:42] Protein: p6301_sh3_with_ALA_frags
[01:57:42] 
[01:57:42] Writing local files
[01:57:42] Extra SSE boost OK.
[01:57:42] Writing local files
[01:57:42] Completed 0 out of 500000 steps  (0%)
[02:08:22] Project: 4461 (Run 205, Clone 1, Gen 31)


[02:08:22] + Attempting to send results [August 26 02:08:22 UTC]
[02:08:23] - Couldn't send HTTP request to server
[02:08:23] + Could not connect to Work Server (results)
[02:08:23]     (171.67.108.13:8080)
[02:08:23] + Retrying using alternative port
[02:08:25] - Couldn't send HTTP request to server
[02:08:25] + Could not connect to Work Server (results)
[02:08:25]     (171.67.108.13:80)
[02:08:25] - Error: Could not transmit unit 09 (completed August 26) to work server.


[02:08:25] + Attempting to send results [August 26 02:08:25 UTC]
[02:08:25] - Couldn't send HTTP request to server
[02:08:25]   (Got status 503)
[02:08:25] + Could not connect to Work Server (results)
[02:08:25]     (171.67.108.17:8080)
[02:08:25] + Retrying using alternative port
[02:08:25] - Couldn't send HTTP request to server
[02:08:25]   (Got status 503)
[02:08:25] + Could not connect to Work Server (results)
[02:08:25]     (171.67.108.17:80)
[02:08:25]   Could not transmit unit 09 to Collection server; keeping in queue.
[02:08:25] + Working...
[02:17:07] Writing local files
[02:17:07] Completed 5000 out of 500000 steps  (1%)
[02:36:34] Writing local files
[02:36:35] Completed 10000 out of 500000 steps  (2%)
[02:56:00] Writing local files
[02:56:00] Completed 15000 out of 500000 steps  (3%)
[03:15:21] Writing local files
[03:15:21] Completed 20000 out of 500000 steps  (4%)
[03:34:56] Writing local files
[03:34:56] Completed 25000 out of 500000 steps  (5%)
[03:54:17] Writing local files
[03:54:17] Completed 30000 out of 500000 steps  (6%)
[04:13:38] Writing local files
[04:13:38] Completed 35000 out of 500000 steps  (7%)
[04:32:57] Writing local files
[04:32:57] Completed 40000 out of 500000 steps  (8%)
[04:52:16] Writing local files
[04:52:16] Completed 45000 out of 500000 steps  (9%)
[05:11:39] Writing local files
[05:11:39] Completed 50000 out of 500000 steps  (10%)
[05:31:16] Writing local files
[05:31:16] Completed 55000 out of 500000 steps  (11%)
[05:50:37] Writing local files
[05:50:37] Completed 60000 out of 500000 steps  (12%)
[06:09:54] Writing local files
[06:09:54] Completed 65000 out of 500000 steps  (13%)
[06:29:07] Writing local files
[06:29:07] Completed 70000 out of 500000 steps  (14%)
[06:48:33] Writing local files
[06:48:33] Completed 75000 out of 500000 steps  (15%)

Re: WU completed but not send, work started on another WU?

Posted: Wed Aug 26, 2009 7:41 am
by Abraham54
Apparently there was some mix-up with logs.
After restarting my notebook the log showed this:

Code: Select all

[07:31:14] - Ask before connecting: No
[07:31:14] - User name: Abraham54 (Team 159388)
[07:31:14] - User ID: 619DF08A6D5D77
[07:31:14] - Machine ID: 1
[07:31:14] 
[07:31:14] Loaded queue successfully.
[07:31:14] Initialization complete
[07:31:14] 
[07:31:14] + Processing work unit
[07:31:14] Core required: FahCore_78.exe
[07:31:14] Project: 4461 (Run 205, Clone 1, Gen 31)
[07:31:14] Core found.


[07:31:14] + Attempting to send results [August 26 07:31:14 UTC]
[07:31:14] Working on queue slot 00 [August 26 07:31:14 UTC]
[07:31:14] + Working ...
[07:31:15] 
[07:31:15] *------------------------------*
[07:31:15] Folding@Home Gromacs Core
[07:31:15] Version 1.90 (March 8, 2006)
[07:31:15] 
[07:31:15] Preparing to commence simulation
[07:31:15] - Looking at optimizations...
[07:31:15] - Files status OK
[07:31:16] - Expanded 457269 -> 2243997 (decompressed 490.7 percent)
[07:31:16] 
[07:31:16] Project: 6301 (Run 377, Clone 3, Gen 26)
[07:31:16] 
[07:31:17] + Results successfully sent
[07:31:17] Thank you for your contribution to Folding@Home.
[07:31:17] + Number of Units Completed: 14

[07:31:18] Assembly optimizations on if available.
[07:31:18] Entering M.D.
[07:31:39] (Starting from checkpoint)
[07:31:39] Protein: p6301_sh3_with_ALA_frags
[07:31:39] 
[07:31:39] Writing local files
[07:31:40] Completed 85000 out of 500000 steps  (17%)
[07:31:40] Extra SSE boost OK.
Conclusion: there is no problem, there was no WU kept in reserve!

Re: WU completed but not send, work started on another WU?

Posted: Thu Aug 27, 2009 5:33 am
by bruce
This is a normal function of the client. When there is a connection problem, the client keeps the WU in queue and it will automatically try to upload it at a later time (at least every 6 hours). It will also try to upload it if the client is restarted, which is exactly what you did.

If you happen to have this problem again in the future, we recommend that you ignore it and let the automatic features handle it.