Hardware configuration: PC: AMD Sempron(tm) Processor 2800+, 1024 MB RAM, Microsoft Windows XP (Home Edition) SP3 Laptop: Mobile AMD Sempron(tm) Processor 3500+, 896 MB RAM, Microsoft Windows Vista (Home Premium)
[06:43:39] Completed 250000 out of 250000 steps (100%)
[06:43:39] Writing final coordinates.
[06:43:43] Past main M.D. loop
[06:44:43]
[06:44:43] Finished Work Unit:
[06:44:43] - Reading up to 2297592 from "work/wudata_09.arc": Read 2297592
[06:44:43] - Reading up to 440440 from "work/wudata_09.xtc": Read 440440
[06:44:43] goefile size: 0
[06:44:43] logfile size: 41851
[06:44:43] Leaving Run
[06:44:46] - Writing 2798059 bytes of core data to disk...
[06:44:46] ... Done.
[06:44:46] - Shutting down core
[06:44:46]
[06:44:46] Folding@home Core Shutdown: FINISHED_UNIT
[06:44:49] CoreStatus = 64 (100)
[06:44:49] Sending work to server
[06:44:49] Project: 2485 (Run 27, Clone 38, Gen 2)
[06:44:49] - Read packet limit of 540015616... Set to 524286976.
[06:44:49] + Attempting to send results [December 19 06:44:49 UTC]
[06:44:56] - Couldn't send HTTP request to server
[06:44:56] + Could not connect to Work Server (results)
[06:44:56] (171.65.103.162:8080)
[06:44:56] + Retrying using alternative port
[06:44:57] - Couldn't send HTTP request to server
[06:44:57] + Could not connect to Work Server (results)
[06:44:57] (171.65.103.162:80)
[06:44:57] - Error: Could not transmit unit 09 (completed December 19) to work server.
[06:44:57] Keeping unit 09 in queue.
[06:44:57] Project: 2485 (Run 27, Clone 38, Gen 2)
[06:44:57] - Read packet limit of 540015616... Set to 524286976.
[06:44:57] + Attempting to send results [December 19 06:44:57 UTC]
[06:44:59] - Couldn't send HTTP request to server
[06:44:59] + Could not connect to Work Server (results)
[06:44:59] (171.65.103.162:8080)
[06:44:59] + Retrying using alternative port
[06:45:00] - Couldn't send HTTP request to server
[06:45:00] + Could not connect to Work Server (results)
[06:45:00] (171.65.103.162:80)
[06:45:00] - Error: Could not transmit unit 09 (completed December 19) to work server.
[06:45:00] - Read packet limit of 540015616... Set to 524286976.
[06:45:00] + Attempting to send results [December 19 06:45:00 UTC]
[06:45:04] - Server does not have record of this unit. Will try again later.
[06:45:04] Could not transmit unit 09 to Collection server; keeping in queue.
[06:45:04] - Preparing to get new work unit...
[06:45:04] + Attempting to get work packet
[06:45:04] - Connecting to assignment server
[06:45:05] - Successful: assigned to (171.67.108.13).
[06:45:05] + News From Folding@Home: Welcome to Folding@Home
[06:45:05] Loaded queue successfully.
[06:45:07] Project: 2485 (Run 27, Clone 38, Gen 2)
[06:45:07] - Read packet limit of 540015616... Set to 524286976.
[06:45:07] + Attempting to send results [December 19 06:45:07 UTC]
[06:45:08] - Couldn't send HTTP request to server
[06:45:08] + Could not connect to Work Server (results)
[06:45:08] (171.65.103.162:8080)
[06:45:08] + Retrying using alternative port
[06:45:10] - Couldn't send HTTP request to server
[06:45:10] + Could not connect to Work Server (results)
[06:45:10] (171.65.103.162:80)
[06:45:10] - Error: Could not transmit unit 09 (completed December 19) to work server.
[06:45:10] - Read packet limit of 540015616... Set to 524286976.
[06:45:10] + Attempting to send results [December 19 06:45:10 UTC]
[06:45:14] - Server does not have record of this unit. Will try again later.
[06:45:14] Could not transmit unit 09 to Collection server; keeping in queue.
[06:45:14] + Closed connections
[06:45:14]
[06:45:14] + Processing work unit
It's tried to upload 3 times since then with no luck. Server status shows that's it's accepting. Could someone check this out please?
Hardware configuration: PC: AMD Sempron(tm) Processor 2800+, 1024 MB RAM, Microsoft Windows XP (Home Edition) SP3 Laptop: Mobile AMD Sempron(tm) Processor 3500+, 896 MB RAM, Microsoft Windows Vista (Home Premium)
[03:10:35] + Attempting to send results [December 20 03:10:35 UTC]
[03:10:38] + Results successfully sent
[03:10:38] Thank you for your contribution to Folding@Home.
[03:10:38] + Number of Units Completed: 108
All I had to do is wait 21 hours. Looks like whatever was causing the problem went away.
This can explain the 21 hours -- apparently the "5 minutes" was far from accurate early this morning (California time). Nothing was uploaded between 22:00 and 02:00 [06:00 and 10:00 GMT]. Since that time there seem to be lots of WUs uploading but you could hit a rough spot or two. Just let it keep trying for a day or so.
Continual problem in connecting to work server. Unable to upload work completed 26 December and 8 January. Computer works with all other internet access including BOINC projects. Log below:
--- Opening Log file [January 17 01:46:46]
# Windows Graphical Edition ###################################################
###############################################################################
Folding@Home Client Version 5.03
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files\Folding@Home
[01:46:46] - Ask before connecting: Yes
[01:46:46] - Use IE connection settings: Yes
[01:46:46] - User name: Alan_Fowler (Team 0)
[01:46:46] - User ID: 6BE3C9C70B17C362
[01:46:46] - Machine ID: 1
[01:46:46]
[01:46:46] Loaded queue successfully.
[01:46:46] Initialization complete
[01:46:46] + Benchmarking ...
[01:46:49]
[01:46:49] + Processing work unit
[01:46:49] + Attempting to send results
[01:46:49] Core required: FahCore_78.exe
[01:46:49] Core found.
[01:46:49] Working on Unit 06 [January 17 01:46:49]
[01:46:49] + Working ...
[01:46:52] - Presenting message box asking to network.
[01:46:53]
[01:46:53] *------------------------------*
[01:46:53] Folding@Home Gromacs Core
[01:46:53] Version 1.90 (March 8, 2006)
[01:46:53]
[01:46:53] Preparing to commence simulation
[01:46:53] - Ensuring status. Please wait.
[01:47:10] - Looking at optimizations...
[01:47:10] - Working with standard loops on this execution.
[01:47:10] - Previous termination of core was improper.
[01:47:10] - Files status OK
[01:47:11] - Expanded 237692 -> 1168241 (decompressed 491.4 percent)
[01:47:12]
[01:47:12] Project: 4438 (Run 57, Clone 0, Gen 5)
[01:47:12]
[01:47:14] Entering M.D.
[01:47:37] (Starting from checkpoint)
[01:47:37] Protein: p4438_Seq41_Amber03
[01:47:37]
[01:47:38] Writing local files
[01:48:48] Error: Got status code 503 from server
[01:48:48] + Could not connect to Work Server (results)
[01:48:48] (171.65.103.162:8080)
[01:48:48] - Error: Could not transmit unit 03 (completed December 26) to work server.
[01:48:48] + Attempting to send results
[01:49:19] Couldn't send HTTP request to server (wininet)
[01:49:19] + Could not connect to Work Server (results)
[01:49:19] (171.65.103.100:8080)
[01:49:19] Could not transmit unit 03 to Collection server; keeping in queue.
[01:49:19] + Attempting to send results
[01:49:22] Error: Got status code 503 from server
[01:49:22] + Could not connect to Work Server (results)
[01:49:22] (171.65.103.162:8080)
[01:49:22] - Error: Could not transmit unit 04 (completed January 8) to work server.
[01:49:22] + Attempting to send results
[01:49:52] Couldn't send HTTP request to server (wininet)
[01:49:52] + Could not connect to Work Server (results)
[01:49:52] (171.65.103.100:8080)
[01:49:52] Could not transmit unit 04 to Collection server; keeping in queue.
[01:51:00] Completed 585000 out of 1500000 steps (39)
[03:24:49] Writing local files
[03:24:49] Completed 600000 out of 1500000 steps (40)
[05:01:07] Writing local files
[05:01:08] Completed 615000 out of 1500000 steps (41)
[15:57:06] + Attempting to send results
[15:57:49] Couldn't send HTTP request to server (wininet)
[15:57:49] + Could not connect to Work Server (results)
[15:57:49] (171.65.103.162:8080)
[15:57:49] - Error: Could not transmit unit 03 (completed December 26) to work server.
[15:57:49] + Attempting to send results
[15:58:19] Couldn't send HTTP request to server (wininet)
[15:58:19] + Could not connect to Work Server (results)
[15:58:19] (171.65.103.100:8080)
[15:58:19] Could not transmit unit 03 to Collection server; keeping in queue.
[15:58:19] + Attempting to send results
[15:58:50] Couldn't send HTTP request to server (wininet)
[15:58:50] + Could not connect to Work Server (results)
[15:58:50] (171.65.103.162:8080)
[15:58:50] - Error: Could not transmit unit 04 (completed January 8) to work server.
[15:58:50] + Attempting to send results
[15:59:20] Couldn't send HTTP request to server (wininet)
[15:59:20] + Could not connect to Work Server (results)
[15:59:20] (171.65.103.100:8080)
[15:59:20] Could not transmit unit 04 to Collection server; keeping in queue.
[17:20:17] Writing local files
[17:20:17] Completed 630000 out of 1500000 steps (42)
[18:56:52] Writing local files
[18:56:52] Completed 645000 out of 1500000 steps (43)
[20:33:15] Writing local files
[20:33:15] Completed 660000 out of 1500000 steps (44)
[21:59:20] + Attempting to send results
[21:59:51] Couldn't send HTTP request to server (wininet)
[21:59:51] + Could not connect to Work Server (results)
[21:59:51] (171.65.103.162:8080)
[21:59:51] - Error: Could not transmit unit 03 (completed December 26) to work server.
[21:59:51] + Attempting to send results
[22:00:21] Couldn't send HTTP request to server (wininet)
[22:00:21] + Could not connect to Work Server (results)
[22:00:21] (171.65.103.100:8080)
[22:00:21] Could not transmit unit 03 to Collection server; keeping in queue.
[22:00:21] + Attempting to send results
[22:00:22] Couldn't send HTTP request to server (wininet)
[22:00:22] + Could not connect to Work Server (results)
[22:00:22] (171.65.103.162:8080)
[22:00:22] - Error: Could not transmit unit 04 (completed January 8) to work server.
[22:00:22] + Attempting to send results
[22:00:52] Couldn't send HTTP request to server (wininet)
[22:00:52] + Could not connect to Work Server (results)
[22:00:52] (171.65.103.100:8080)
[22:00:52] Could not transmit unit 04 to Collection server; keeping in queue.
[22:06:03] Writing local files
[22:06:03] Completed 675000 out of 1500000 steps (45)
[23:42:54] Writing local files
[23:42:54] Completed 690000 out of 1500000 steps (46)
[01:10:32] Writing local files
[01:10:33] Completed 705000 out of 1500000 steps (47)
[02:41:01] Writing local files
[02:41:01] Completed 720000 out of 1500000 steps (48)
[04:00:52] + Attempting to send results
[04:01:23] Couldn't send HTTP request to server (wininet)
[04:01:23] + Could not connect to Work Server (results)
[04:01:23] (171.65.103.162:8080)
[04:01:23] - Error: Could not transmit unit 03 (completed December 26) to work server.
[04:01:23] + Attempting to send results
[04:01:53] Couldn't send HTTP request to server (wininet)
[04:01:53] + Could not connect to Work Server (results)
[04:01:53] (171.65.103.100:8080)
[04:01:53] Could not transmit unit 03 to Collection server; keeping in queue.
[04:01:53] + Attempting to send results
[04:01:54] Couldn't send HTTP request to server (wininet)
[04:01:54] + Could not connect to Work Server (results)
[04:01:54] (171.65.103.162:8080)
[04:01:54] - Error: Could not transmit unit 04 (completed January 8) to work server.
[04:01:54] + Attempting to send results
[04:02:24] Couldn't send HTTP request to server (wininet)
[04:02:24] + Could not connect to Work Server (results)
[04:02:24] (171.65.103.100:8080)
[04:02:24] Could not transmit unit 04 to Collection server; keeping in queue.
[04:11:17] Writing local files
[04:11:17] Completed 735000 out of 1500000 steps (49)
[05:39:57] Writing local files
[05:39:57] Completed 750000 out of 1500000 steps (50)
[07:14:39] Writing local files
[07:14:39] Completed 765000 out of 1500000 steps (51)
[08:41:37] Writing local files
[08:41:37] Completed 780000 out of 1500000 steps (52)
[10:02:24] + Attempting to send results
[10:02:24] Couldn't send HTTP request to server (wininet)
[10:02:24] + Could not connect to Work Server (results)
[10:02:24] (171.65.103.162:8080)
[10:02:24] - Error: Could not transmit unit 03 (completed December 26) to work server.
[10:02:24] + Attempting to send results
[10:02:55] Couldn't send HTTP request to server (wininet)
[10:02:55] + Could not connect to Work Server (results)
[10:02:55] (171.65.103.100:8080)
[10:02:55] Could not transmit unit 03 to Collection server; keeping in queue.
[10:02:55] + Attempting to send results
[10:02:55] Couldn't send HTTP request to server (wininet)
[10:02:55] + Could not connect to Work Server (results)
[10:02:55] (171.65.103.162:8080)
[10:02:55] - Error: Could not transmit unit 04 (completed January 8) to work server.
[10:02:55] + Attempting to send results
[10:03:26] Couldn't send HTTP request to server (wininet)
[10:03:26] + Could not connect to Work Server (results)
[10:03:26] (171.65.103.100:8080)
[10:03:26] Could not transmit unit 04 to Collection server; keeping in queue.
[10:13:36] Writing local files
[10:13:37] Completed 795000 out of 1500000 steps (53)
[11:42:56] Writing local files
[11:42:56] Completed 810000 out of 1500000 steps (54)
[13:12:48] Writing local files
[13:12:48] Completed 825000 out of 1500000 steps (55)
[14:39:41] Writing local files
[14:39:42] Completed 840000 out of 1500000 steps (56)
[16:03:26] + Attempting to send results
[16:03:26] Couldn't send HTTP request to server (wininet)
[16:03:26] + Could not connect to Work Server (results)
[16:03:26] (171.65.103.162:8080)
[16:03:26] - Error: Could not transmit unit 03 (completed December 26) to work server.
[16:03:26] + Attempting to send results
[16:03:56] Couldn't send HTTP request to server (wininet)
[16:03:56] + Could not connect to Work Server (results)
[16:03:56] (171.65.103.100:8080)
[16:03:56] Could not transmit unit 03 to Collection server; keeping in queue.
[16:03:56] + Attempting to send results
[16:04:27] Couldn't send HTTP request to server (wininet)
[16:04:27] + Could not connect to Work Server (results)
[16:04:27] (171.65.103.162:8080)
[16:04:27] - Error: Could not transmit unit 04 (completed January 8) to work server.
[16:04:27] + Attempting to send results
[16:04:57] Couldn't send HTTP request to server (wininet)
[16:04:57] + Could not connect to Work Server (results)
[16:04:57] (171.65.103.100:8080)
[16:04:57] Could not transmit unit 04 to Collection server; keeping in queue.
[16:16:41] Writing local files
[16:16:41] Completed 855000 out of 1500000 steps (57)
[17:44:35] Writing local files
[17:44:35] Completed 870000 out of 1500000 steps (58)
[19:17:06] Writing local files
[19:17:06] Completed 885000 out of 1500000 steps (59)
[20:41:23] Writing local files
[20:41:23] Completed 900000 out of 1500000 steps (60)
[22:00:19] Writing local files
[22:00:19] Completed 915000 out of 1500000 steps (61)
[22:04:57] + Attempting to send results
[22:04:58] Couldn't send HTTP request to server (wininet)
[22:04:58] + Could not connect to Work Server (results)
[22:04:58] (171.65.103.162:8080)
[22:04:58] - Error: Could not transmit unit 03 (completed December 26) to work server.
[22:04:58] + Attempting to send results
[22:05:28] Couldn't send HTTP request to server (wininet)
[22:05:28] + Could not connect to Work Server (results)
[22:05:28] (171.65.103.100:8080)
[22:05:28] Could not transmit unit 03 to Collection server; keeping in queue.
[22:05:28] + Attempting to send results
[22:05:28] Couldn't send HTTP request to server (wininet)
[22:05:28] + Could not connect to Work Server (results)
[22:05:28] (171.65.103.162:8080)
[22:05:28] - Error: Could not transmit unit 04 (completed January 8) to work server.
[22:05:28] + Attempting to send results
[22:05:59] Couldn't send HTTP request to server (wininet)
[22:05:59] + Could not connect to Work Server (results)
[22:05:59] (171.65.103.100:8080)
[22:05:59] Could not transmit unit 04 to Collection server; keeping in queue.
[23:26:42] Writing local files
[23:26:42] Completed 930000 out of 1500000 steps (62)
[00:50:18] Writing local files
[00:50:18] Completed 945000 out of 1500000 steps (63)
[02:19:09] Writing local files
[02:19:10] Completed 960000 out of 1500000 steps (64)
[03:40:28] Writing local files
[03:40:28] Completed 975000 out of 1500000 steps (65)
[04:05:59] + Attempting to send results
[04:06:30] Couldn't send HTTP request to server (wininet)
[04:06:30] + Could not connect to Work Server (results)
[04:06:30] (171.65.103.162:8080)
[04:06:30] - Error: Could not transmit unit 03 (completed December 26) to work server.
[04:06:30] + Attempting to send results
[04:07:00] Couldn't send HTTP request to server (wininet)
[04:07:00] + Could not connect to Work Server (results)
[04:07:00] (171.65.103.100:8080)
[04:07:00] Could not transmit unit 03 to Collection server; keeping in queue.
[04:07:00] + Attempting to send results
[04:07:31] Couldn't send HTTP request to server (wininet)
[04:07:31] + Could not connect to Work Server (results)
[04:07:31] (171.65.103.162:8080)
[04:07:31] - Error: Could not transmit unit 04 (completed January 8) to work server.
[04:07:31] + Attempting to send results
[04:08:01] Couldn't send HTTP request to server (wininet)
[04:08:01] + Could not connect to Work Server (results)
[04:08:01] (171.65.103.100:8080)
[04:08:01] Could not transmit unit 04 to Collection server; keeping in queue.
[05:06:04] Writing local files
[05:06:04] Completed 990000 out of 1500000 steps (66)
[15:09:32] + Attempting to send results
[15:10:14] Couldn't send HTTP request to server (wininet)
[15:10:14] + Could not connect to Work Server (results)
[15:10:14] (171.65.103.162:8080)
[15:10:14] - Error: Could not transmit unit 03 (completed December 26) to work server.
[15:10:14] + Attempting to send results
[15:10:45] Couldn't send HTTP request to server (wininet)
[15:10:45] + Could not connect to Work Server (results)
[15:10:45] (171.65.103.100:8080)
[15:10:45] Could not transmit unit 03 to Collection server; keeping in queue.
[15:10:45] + Attempting to send results
[15:11:15] Couldn't send HTTP request to server (wininet)
[15:11:15] + Could not connect to Work Server (results)
[15:11:15] (171.65.103.162:8080)
[15:11:15] - Error: Could not transmit unit 04 (completed January 8) to work server.
[15:11:15] + Attempting to send results
[15:11:46] Couldn't send HTTP request to server (wininet)
[15:11:46] + Could not connect to Work Server (results)
[15:11:46] (171.65.103.100:8080)
[15:11:46] Could not transmit unit 04 to Collection server; keeping in queue.
[15:57:31] Writing local files
[15:57:32] Completed 1005000 out of 1500000 steps (67)
[17:23:11] Writing local files
[17:23:11] Completed 1020000 out of 1500000 steps (68)
[18:37:59] Writing local files
[18:37:59] Completed 1035000 out of 1500000 steps (69)
[20:09:01] Writing local files
[20:09:01] Completed 1050000 out of 1500000 steps (70)
[21:11:46] + Attempting to send results
[21:12:16] Couldn't send HTTP request to server (wininet)
[21:12:16] + Could not connect to Work Server (results)
[21:12:16] (171.65.103.162:8080)
[21:12:16] - Error: Could not transmit unit 03 (completed December 26) to work server.
[21:12:16] + Attempting to send results
[21:12:47] Couldn't send HTTP request to server (wininet)
[21:12:47] + Could not connect to Work Server (results)
[21:12:47] (171.65.103.100:8080)
[21:12:47] Could not transmit unit 03 to Collection server; keeping in queue.
[21:12:47] + Attempting to send results
[21:13:18] Couldn't send HTTP request to server (wininet)
[21:13:18] + Could not connect to Work Server (results)
[21:13:18] (171.65.103.162:8080)
[21:13:18] - Error: Could not transmit unit 04 (completed January 8) to work server.
[21:13:18] + Attempting to send results
[21:13:18] Couldn't send HTTP request to server (wininet)
[21:13:18] + Could not connect to Work Server (results)
[21:13:18] (171.65.103.100:8080)
[21:13:18] Could not transmit unit 04 to Collection server; keeping in queue.
[21:25:59] Opening C:\Program Files\Folding@Home\MyFolding.html...
[21:37:40] Writing local files
[21:37:40] Completed 1065000 out of 1500000 steps (71)
[23:11:12] Writing local files
[23:11:12] Completed 1080000 out of 1500000 steps (72)
The problem is with Use IE connection settings: Yes. If you have installed the Windows security patches from the last couple years, that feature was broken by one of those patches. If you're NOT behind a proxy, turn that off. If you are behind a proxy, you can try turning it off but that may lead to other problems.
Hi Alan_Fowler (team 0),
Your WU (P4438 R57 C0 G5) was added to the stats database on 2009-01-21 15:38:22 for 225 points of credit.
For me to search for the other WUs, I need you to provide the Project, Run, Clone, and Gen numbers.
I expect that they have been uploaded many, many times, but when the server sends the client a message saying it was successfully received, Internet Explorer intercepts that message and it never gets back to the client so the client assumes there was an error and saves the WU to be uploaded agan, with the same results.
I don't know if this is of importance here but I had problems sending to 103.100
[13:16:14] + Could not connect to Work Server (results)
[13:16:14] (128.59.74.4:8080)
[13:16:14] - Error: Could not transmit unit 02 (completed February 17) to work server.
[13:16:14] + Attempting to send results
[13:16:14] - Expanded 220812 -> 615593 (decompressed 278.7 percent)
[13:16:14]
[13:16:14] Project: 3859 (Run 600, Clone 0, Gen 11)
[13:16:14]
[13:16:14] Assembly optimizations on if available.
[13:16:14] Entering M.D.
[13:16:14] Error: Got status code 503 from server
[13:16:14] + Could not connect to Work Server (results)
[13:16:14] (171.65.103.100:8080)
[13:16:14] Could not transmit unit 02 to Collection server; keeping in queue.
[13:16:20] Will resume from checkpoint file
- but realised, from this thread ,that the server was receiving data but not aknowledging reciept.
The data WAS going out however, as I watched on the old Sygate 5.5 send stream monitor ,( I like that free baby) 2.2 Mb approx go out a number of times on restarting the client but never receiving a final handshake on data sent.
I re-configured the client to use IE settings as a trial....and Yes! it sent out the double amount I'd seen before - on other units ,with similar problems, 5.5 MB.
Still not accepted correctly though.
Now the tricky bit...when this double amount has gone out before the unit has deleted itself and WU lost (as far as I know) so when I reset config back to' NOT IE' settings -I was pleased to see the unit go straight up and get read as uploaded. But...
NB: 2.7MB & the thus the previous data was not completing fully!
[13:21:17] Working on Unit 03 [February 17 13:21:17]
[13:21:17] + Working ...
[13:21:17]
[13:21:17] *------------------------------*
[13:21:17] Folding@Home Double Gromacs Core C
[13:21:17] Version 1.00 (Thu Apr 24 19:12:09 PDT 2008)
[13:21:17]
[13:21:17] Preparing to commence simulation
[13:21:17] - Files status OK
[13:21:18] - Expanded 220812 -> 615593 (decompressed 278.7 percent)
[13:21:18]
[13:21:18] Project: 3859 (Run 600, Clone 0, Gen 11)
[13:21:18]
[13:21:18] Assembly optimizations on if available.
[13:21:18] Entering M.D.
[13:21:24] Will resume from checkpoint file
[13:21:24] Working on p3850_fkbprelative_ligand
[13:21:24] Completed 0 out of 1000000 steps (0)
[13:21:26] Resuming from checkpoint
[13:21:26] Verified work/wudata_03.log
[13:21:26] Verified work/wudata_03.edr
[13:21:26] Verified work/wudata_03.xvg
[13:21:26] Verified work/wudata_03.trr
[13:21:26] Verified work/wudata_03.xtc
[13:21:26] Completed 3300 out of 1000000 steps (0)
[13:22:42] + Results successfully sent
[13:22:42] Thank you for your contribution to Folding@Home.
Whooo...This IS Fun!!
I've lost so many units lately after many weeks of work this is an area that needs a review IMO.
Many can't see the data trying to go and will not realise that it is trying ,or check the server- AOK in this case.
Even so, with this background knowledge, it's far from obvious what's happening.
Did the IE switch clear flags somewhere?
Are uploads being terminated early?
What does the double size data stream mean on the same size WU being sent?
For some reason this server assigned the exact same project to two of the clients running on my system. I have a total of 6 Windows CPU clients running and 1 GPU Client and all of them have previously finished at least one WU so far. Are the servers supposed to hand out identical projects this quickly if at all? Here are the partial logs from each of the clients. They look nearly identical except I gave them separate machine IDs. And by identical projects I mean "Project: 2484 (Run 48, Clone 23, Gen 6)" and "Project: 2484 (Run 48, Clone 23, Gen 6)" are the two projects I received. They did both start within about a minute of each other as shown in the logs below. This seems like a very poor use of resources if identical projects can be handed out this quickly, especially to the same user.
I'm not sure this will apply but here are my system specs:
OS: Windows 7 x64
Memory: 6GB DDR3
Processor: Intel Core i7
GPU: ATI Radeon HD 3870
Anarchist4000 wrote:For some reason this server assigned the exact same project to two of the clients running on my system. I have a total of 6 Windows CPU clients running and 1 GPU Client and all of them have previously finished at least one WU so far. Are the servers supposed to hand out identical projects this quickly if at all? Here are the partial logs from each of the clients. They look nearly identical except I gave them separate machine IDs. And by identical projects I mean "Project: 2484 (Run 48, Clone 23, Gen 6)" and "Project: 2484 (Run 48, Clone 23, Gen 6)" are the two projects I received. They did both start within about a minute of each other as shown in the logs below. This seems like a very poor use of resources if identical projects can be handed out this quickly, especially to the same user.
I've seen a (very) few similar reports. I suspect that there is a race condition that does occasionally issue the same WU to more than one client, particularly when the servers are very busy. My hunch is that you'll get credit for both of them even though they happened to go to the same person so the "especially to the same user" really doesn't apply. (Any Mod can check for you once they've both been returned.)
The server code has been recently rewritten and it's currently being tested. It will include many fixes and hopefully this is one of them.
13:57:33] Finished Work Unit:
[13:57:33] - Reading up to 2294200 from "work/wudata_02.trr": Read 2294200
[13:57:33] - Reading up to 121124 from "work/wudata_02.xtc": Read 121124
[13:57:33] xvg file size: 183025
[13:57:33] logfile size: 75935
[13:57:33] Leaving Run
[13:57:37] - Writing 2760624 bytes of core data to disk...
[13:57:37] ... Done.
[13:57:37] - Shutting down core
[13:57:37]
[13:57:37] Folding@home Core Shutdown: FINISHED_UNIT
[13:57:41] CoreStatus = 64 (100)
[13:57:41] Sending work to server
[13:57:41] Project: 3855 (Run 628, Clone 8, Gen 27)
[13:57:41] + Attempting to send results [February 24 13:57:41 UTC]
[13:58:02] - Couldn't send HTTP request to server
[13:58:02] + Could not connect to Work Server (results)
[13:58:02] (128.59.74.4:8080)
[13:58:02] + Retrying using alternative port
[13:58:23] - Couldn't send HTTP request to server
[13:58:23] + Could not connect to Work Server (results)
[13:58:23] (128.59.74.4:80)
[13:58:23] - Error: Could not transmit unit 02 (completed February 24) to work server.
[13:58:23] Keeping unit 02 in queue.
[13:58:23] Project: 3855 (Run 628, Clone 8, Gen 27)
[13:58:23] + Attempting to send results [February 24 13:58:23 UTC]
[13:58:44] - Couldn't send HTTP request to server
[13:58:44] + Could not connect to Work Server (results)
[13:58:44] (128.59.74.4:8080)
[13:58:44] + Retrying using alternative port
[13:59:05] - Couldn't send HTTP request to server
[13:59:05] + Could not connect to Work Server (results)
[13:59:05] (128.59.74.4:80)
[13:59:05] - Error: Could not transmit unit 02 (completed February 24) to work server.
[13:59:05] + Attempting to send results [February 24 13:59:05 UTC]
[13:59:06] - Couldn't send HTTP request to server
[13:59:06] + Could not connect to Work Server (results)
[13:59:06] (171.65.103.100:8080)
[13:59:06] + Retrying using alternative port
[13:59:15] - Couldn't send HTTP request to server
[13:59:15] + Could not connect to Work Server (results)
[13:59:15] (171.65.103.100:80)
[13:59:15] Could not transmit unit 02 to Collection server; keeping in queue.
[13:59:15] - Preparing to get new work unit...
[13:59:15] + Attempting to get work packet
[13:59:15] - Connecting to assignment server
[13:59:16] - Successful: assigned to (171.67.108.13).
[13:59:16] + News From Folding@Home: Welcome to Folding@Home
[13:59:16] Loaded queue successfully.
[13:59:16] + Could not connect to Work Server
[13:59:16] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[13:59:23] + Attempting to get work packet
[13:59:23] - Connecting to assignment server
[13:59:23] - Successful: assigned to (171.67.108.13).
[13:59:23] + News From Folding@Home: Welcome to Folding@Home
[13:59:23] Loaded queue successfully.
[13:59:24] + Could not connect to Work Server
[13:59:24] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[13:59:49] + Attempting to get work packet
[13:59:49] - Connecting to assignment server
[13:59:49] - Successful: assigned to (171.67.108.13).
[13:59:49] + News From Folding@Home: Welcome to Folding@Home
[13:59:49] Loaded queue successfully.
[13:59:50] + Could not connect to Work Server
[13:59:50] - Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[14:00:10] + Attempting to get work packet
[14:00:10] - Connecting to assignment server
[14:00:11] - Successful: assigned to (171.67.108.13).
[14:00:11] + News From Folding@Home: Welcome to Folding@Home
[14:00:11] Loaded queue successfully.
[14:00:11] + Could not connect to Work Server
[14:00:11] - Attempt #4 to get work failed, and no other work to do.
Waiting before retry.
[14:00:56] + Attempting to get work packet
[14:00:56] - Connecting to assignment server
[14:00:57] - Successful: assigned to (128.143.48.226).
[14:00:57] + News From Folding@Home: Welcome to Folding@Home
[14:00:57] Loaded queue successfully.
[14:00:58] Project: 3855 (Run 628, Clone 8, Gen 27)
[14:00:58] + Attempting to send results [February 24 14:00:58 UTC]
[14:01:19] - Couldn't send HTTP request to server
[14:01:19] + Could not connect to Work Server (results)
[14:01:19] (128.59.74.4:8080)
[14:01:19] + Retrying using alternative port
[14:01:40] - Couldn't send HTTP request to server
[14:01:40] + Could not connect to Work Server (results)
[14:01:40] (128.59.74.4:80)
[14:01:40] - Error: Could not transmit unit 02 (completed February 24) to work server.
[14:01:40] + Attempting to send results [February 24 14:01:40 UTC]
[14:04:49] - Couldn't send HTTP request to server
[14:04:49] + Could not connect to Work Server (results)
[14:04:49] (171.65.103.100:8080)
[14:04:49] + Retrying using alternative port
[14:04:58] - Couldn't send HTTP request to server
[14:04:58] + Could not connect to Work Server (results)
[14:04:58] (171.65.103.100:80)
[14:04:58] Could not transmit unit 02 to Collection server; keeping in queue.
[14:04:58] + Closed connections
[14:04:58]
[14:04:58] + Processing work unit
[14:04:58] Core required: FahCore_7c.exe
[14:04:58] Core found.
[14:04:58] Working on queue slot 03 [February 24 14:04:58 UTC]
[14:04:58] + Working ...
[14:04:59]
[14:04:59] *------------------------------*
[14:04:59] Folding@Home Double Gromacs Core C
[14:04:59] Version 1.00 (Thu Apr 24 19:12:09 PDT 2008)
[14:04:59]
[14:04:59] Preparing to commence simulation
[14:04:59] - Files status OK
[14:04:59] - Expanded 113583 -> 641836 (decompressed 565.0 percent)
[14:04:59]
[14:04:59] Project: 3863 (Run 181, Clone 5, Gen 0)
[14:04:59]
[14:04:59] Assembly optimizations on if available.
[14:04:59] Entering M.D.
[14:05:05] Working on p3863_fkbprelative_ligand
[14:05:05] Completed 0 out of 1500000 steps (0%)
[14:05:05] Extra SSE2 boost OK
[14:20:06] Timer requesting checkpoint
[14:28:27] Completed 15000 out of 1500000 steps (1%)
[14:43:28] Timer requesting checkpoint
[14:52:20] Completed 30000 out of 1500000 steps (2%)
[15:07:20] Timer requesting checkpoint
[15:18:00] Completed 45000 out of 1500000 steps (3%)
[15:33:02] Timer requesting checkpoint
[15:44:12] Completed 60000 out of 1500000 steps (4%)
[15:59:12] Timer requesting checkpoint
[16:08:43] Completed 75000 out of 1500000 steps (5%)
[16:23:43] Timer requesting checkpoint
[16:32:55] Completed 90000 out of 1500000 steps (6%)
[16:47:56] Timer requesting checkpoint
[16:57:16] Completed 105000 out of 1500000 steps (7%)
[17:12:16] Timer requesting checkpoint
[17:21:56] Completed 120000 out of 1500000 steps (8%)
[17:36:56] Timer requesting checkpoint
[17:46:27] Completed 135000 out of 1500000 steps (9%)
[18:01:29] Timer requesting checkpoint
[18:10:54] Completed 150000 out of 1500000 steps (10%)
[18:25:55] Timer requesting checkpoint
[18:34:59] Completed 165000 out of 1500000 steps (11%)
[18:42:57] Project: 3855 (Run 628, Clone 8, Gen 27)
[18:42:57] + Attempting to send results [February 24 18:42:57 UTC]
[18:43:19] - Couldn't send HTTP request to server
[18:43:19] + Could not connect to Work Server (results)
[18:43:19] (128.59.74.4:8080)
[18:43:19] + Retrying using alternative port
[18:43:40] - Couldn't send HTTP request to server
[18:43:40] + Could not connect to Work Server (results)
[18:43:40] (128.59.74.4:80)
[18:43:40] - Error: Could not transmit unit 02 (completed February 24) to work server.
[18:43:40] + Attempting to send results [February 24 18:43:40 UTC]
[18:43:40] - Couldn't send HTTP request to server
[18:43:40] + Could not connect to Work Server (results)
[18:43:40] (171.65.103.100:8080)
[18:43:40] + Retrying using alternative port
[18:46:49] - Couldn't send HTTP request to server
[18:46:49] + Could not connect to Work Server (results)
[18:46:49] (171.65.103.100:80)
[18:46:49] Could not transmit unit 02 to Collection server; keeping in queue.
[18:46:49] + Working...
[18:50:00] Timer requesting checkpoint
[19:00:24] Completed 180000 out of 1500000 steps (12%)
[19:15:26] Timer requesting checkpoint
[19:24:35] Completed 195000 out of 1500000 steps (13%)
[19:39:37] Timer requesting checkpoint
[19:49:11] Completed 210000 out of 1500000 steps (14%)
[20:04:12] Timer requesting checkpoint
[20:16:44] Completed 225000 out of 1500000 steps (15%)
[20:31:46] Timer requesting checkpoint
[20:44:32] Completed 240000 out of 1500000 steps (16%)
[20:59:33] Timer requesting checkpoint
[21:11:55] Completed 255000 out of 1500000 steps (17%)
[21:26:57] Timer requesting checkpoint
[21:38:24] Completed 270000 out of 1500000 steps (18%)
[21:53:25] Timer requesting checkpoint
[22:04:03] Completed 285000 out of 1500000 steps (19%)
[22:19:04] Timer requesting checkpoint
[22:31:16] Completed 300000 out of 1500000 steps (20%)
[22:46:17] Timer requesting checkpoint
[22:57:09] Completed 315000 out of 1500000 steps (21%)
[23:12:09] Timer requesting checkpoint
[23:19:08] Completed 330000 out of 1500000 steps (22%)
[23:34:08] Timer requesting checkpoint
[23:41:13] Completed 345000 out of 1500000 steps (23%)
[23:56:14] Timer requesting checkpoint
[00:03:18] Completed 360000 out of 1500000 steps (24%)
[00:18:19] Timer requesting checkpoint
[00:25:29] Completed 375000 out of 1500000 steps (25%)
[00:40:30] Timer requesting checkpoint
[00:41:24] + Paused
[00:42:51] + Working ...
[00:42:51] Suspending work thread...
[00:42:51] Resuming work thread...
[00:43:02] Printing Queue Information
Current Queue:
Slot 00 Empty/Deleted
Project: 3859 (Run 11597, Clone 0, Gen 8), Core: 7c
Work server: 128.59.74.4:8080
Collection server: 171.65.103.100
Download date: February 20 13:49:53
Finished date: February 21 19:42:19
Failed uploads: 1
Slot 01 Empty/Deleted
Project: 3858 (Run 10562, Clone 0, Gen 28), Core: 7c
Work server: 128.59.74.4:8080
Collection server: 171.65.103.100
Download date: February 21 19:53:09
Finished date: February 23 02:02:23
Failed uploads: 2
Slot 02 Done
Project: 3855 (Run 628, Clone 8, Gen 27), Core: 7c
Work server: 128.59.74.4:8080
Collection server: 171.65.103.100
Download date: February 23 02:49:49
Finished date: February 24 13:57:41
Failed uploads: 4
Slot 03 *Ready
Project: 3863 (Run 181, Clone 5, Gen 0), Core: 7c
Work server: 128.143.48.226:8080
Collection server: 128.143.48.227
Download date: February 24 14:00:58
Deadline date: April 25 14:00:58
PF: 0.965870 based on last 4 slot(s)
[00:46:49] Project: 3855 (Run 628, Clone 8, Gen 27)
[00:46:49] + Attempting to send results [February 25 00:46:49 UTC]
[00:47:11] - Couldn't send HTTP request to server
[00:47:11] + Could not connect to Work Server (results)
[00:47:11] (128.59.74.4:8080)
[00:47:11] + Retrying using alternative port
[00:47:32] - Couldn't send HTTP request to server
[00:47:32] + Could not connect to Work Server (results)
[00:47:32] (128.59.74.4:80)
[00:47:32] - Error: Could not transmit unit 02 (completed February 24) to work server.
[00:47:32] + Attempting to send results [February 25 00:47:32 UTC]
[00:47:41] - Couldn't send HTTP request to server
[00:47:41] + Could not connect to Work Server (results)
[00:47:41] (171.65.103.100:8080)
[00:47:41] + Retrying using alternative port
[00:47:51] - Couldn't send HTTP request to server
[00:47:51] + Could not connect to Work Server (results)
[00:47:51] (171.65.103.100:80)
[00:47:51] Could not transmit unit 02 to Collection server; keeping in queue.
[00:47:51] + Working...
Have there been problems today with 171.65.103.100. Or is it me? My last two WUs (Slots 0 & 1) uploaded eventually after a couple of failures. But slot 2 is sure seems to be taking its sweet time. Still, I'm concerned it might be something on my end since there hasn't been a lot of talk about this server in a while.
DrBB1 wrote:
Have there been problems today with 171.65.103.100. Or is it me? My last two WUs (Slots 0 & 1) uploaded eventually after a couple of failures. But slot 2 is sure seems to be taking its sweet time. Still, I'm concerned it might be something on my end since there hasn't been a lot of talk about this server in a while.
[20:57:13] + Attempting to send results [February 24 20:57:13 UTC]
[20:57:34] - Couldn't send HTTP request to server
[20:57:34] + Could not connect to Work Server (results)
[20:57:34] (128.59.74.4:8080)
[20:57:34] + Retrying using alternative port
[20:57:55] - Couldn't send HTTP request to server
[20:57:55] + Could not connect to Work Server (results)
[20:57:55] (128.59.74.4:80)
[20:57:55] - Error: Could not transmit unit 09 (completed February 24) to work server.
[20:57:55] + Attempting to send results [February 24 20:57:55 UTC]
[20:57:55] - Couldn't send HTTP request to server
[20:57:55] + Could not connect to Work Server (results)
[20:57:55] (171.65.103.100:8080)
[20:57:55] + Retrying using alternative port
[20:57:55] - Couldn't send HTTP request to server
[20:57:55] (Got status 503)
[20:57:55] + Could not connect to Work Server (results)
[20:57:55] (171.65.103.100:80)
[20:57:55] Could not transmit unit 09 to Collection server; keeping in queue.
[20:57:55] + Closed connections
But what about 171.65.103.100...? It's been over a day and I am up to 8+ failed uploads.
toTOW wrote:Re: 171.65.103.162/171.65.103.100
Post by toTOW on Wed Feb 25, 2009 8:16 am
Please see announcement about 128.59.74.4: viewtopic.php?f=24&t=8601
Please see announcement about 128.59.74.4 : viewtopic.php?f=24&t=8601
bruce wrote:See my comment on all of the collection servers here which applies specifically to 171.65.103.100 and its friends.
Thanks Bruce. At least it's not me this time. Am I correct in inferring that it's simply a matter of time before the upload is accepted? Pardon me if the answer is obvious--it's been a long day....
Yes, it's simply a matter of time (and whatever statistics are associated with somebody finishing their upload just as your client tries to start an upload). There's nothing you can do about it except be patient.