Page 3 of 5
Re: 171.67.108.21 is Reject;
Posted: Wed Feb 17, 2010 10:52 pm
by Ragnar Dan
I've got another WU - Project: 5785 (Run 9, Clone 85, Gen 23) - that has failed to upload to this server 4 times in the last ~2 hours and 20 minutes. What is exasperating about it, though, is this waste of the GPU:
Code: Select all
[20:18:42] Folding@home Core Shutdown: FINISHED_UNIT
[20:18:46] CoreStatus = 64 (100)
[20:18:46] Unit 4 finished with 99 percent of time to deadline remaining.
[20:18:46] Updated performance fraction: 0.985027
[20:18:46] Sending work to server
[20:18:46] Project: 5785 (Run 9, Clone 85, Gen 23)
[20:18:46] - Read packet limit of 540015616... Set to 524286976.
[20:18:46] + Attempting to send results [February 17 20:18:46 UTC]
[20:18:46] - Reading file work/wuresults_04.dat from core
[20:18:46] (Read 167488 bytes from disk)
[20:18:46] Connecting to http://171.67.108.21:8080/
[20:18:47] - Couldn't send HTTP request to server
[20:18:47] + Could not connect to Work Server (results)
[20:18:47] (171.67.108.21:8080)
[20:18:47] + Retrying using alternative port
[20:18:47] Connecting to http://171.67.108.21:80/
[20:19:08] - Couldn't send HTTP request to server
[20:19:08] + Could not connect to Work Server (results)
[20:19:08] (171.67.108.21:80)
[20:19:08] - Error: Could not transmit unit 04 (completed February 17) to work server.
[20:19:08] - 1 failed uploads of this unit.
[20:19:08] Keeping unit 04 in queue.
[20:19:08] Trying to send all finished work units
[20:19:08] Project: 5785 (Run 9, Clone 85, Gen 23)
[20:19:08] - Read packet limit of 540015616... Set to 524286976.
[20:19:08] + Attempting to send results [February 17 20:19:08 UTC]
[20:19:08] - Reading file work/wuresults_04.dat from core
[20:19:08] (Read 167488 bytes from disk)
[20:19:08] Connecting to http://171.67.108.21:8080/
[20:19:10] - Couldn't send HTTP request to server
[20:19:10] + Could not connect to Work Server (results)
[20:19:10] (171.67.108.21:8080)
[20:19:10] + Retrying using alternative port
[20:19:10] Connecting to http://171.67.108.21:80/
[20:19:31] - Couldn't send HTTP request to server
[20:19:31] + Could not connect to Work Server (results)
[20:19:31] (171.67.108.21:80)
[20:19:31] - Error: Could not transmit unit 04 (completed February 17) to work server.
[20:19:31] - 2 failed uploads of this unit.
[20:19:31] - Read packet limit of 540015616... Set to 524286976.
[20:19:31] + Attempting to send results [February 17 20:19:31 UTC]
[20:19:31] - Reading file work/wuresults_04.dat from core
[20:19:31] (Read 167488 bytes from disk)
[20:19:31] Connecting to http://171.67.108.26:8080/
[20:29:43] Posted data.
[20:49:15] - Autosending finished units... [February 17 20:49:15 UTC]
[20:49:15] Trying to send all finished work units
[20:49:15] - Already sending work
[20:49:15] + Sent 0 of 1 completed units to the server
[20:49:15] - Autosend completed
[20:49:43] Initial: 001A; + Could not connect to Work Server (results)
[21:00:18] (171.67.108.26:8080)
[21:00:18] + Retrying using alternative port
[21:00:18] Connecting to http://171.67.108.26:80/
[21:00:18] - Couldn't send HTTP request to server
[21:00:18] + Could not connect to Work Server (results)
[21:00:18] (171.67.108.26:80)
[21:00:18] Could not transmit unit 04 to Collection server; keeping in queue.
[21:00:18] + Sent 0 of 1 completed units to the server
[21:00:18] - Preparing to get new work unit...
[21:00:18] + Attempting to get work packet
[21:00:18] - Will indicate memory of 256 MB
[21:00:18] - Connecting to assignment server
[21:00:18] Connecting to http://assign-GPU.stanford.edu:8080/
[21:00:19] Posted data.
[21:00:19] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[21:00:19] + News From Folding@Home: Welcome to Folding@Home
[21:00:19] Loaded queue successfully.
[21:00:19] Connecting to http://171.67.108.11:8080/
[21:00:19] Posted data.
[21:00:19] Initial: 0000; - Receiving payload (expected size: 45866)
[21:00:20] - Downloaded at ~44 kB/s
[21:00:20] - Averaged speed for that direction ~65 kB/s
[21:00:20] + Received work.
[21:00:20] Trying to send all finished work units
[21:00:20] Project: 5785 (Run 9, Clone 85, Gen 23)
[21:00:20] - Read packet limit of 540015616... Set to 524286976.
[21:00:20] + Attempting to send results [February 17 21:00:20 UTC]
[21:00:20] - Reading file work/wuresults_04.dat from core
[21:00:20] (Read 167488 bytes from disk)
[21:00:20] Connecting to http://171.67.108.21:8080/
[21:00:21] - Couldn't send HTTP request to server
[21:00:21] + Could not connect to Work Server (results)
[21:00:21] (171.67.108.21:8080)
[21:00:21] + Retrying using alternative port
[21:00:21] Connecting to http://171.67.108.21:80/
[21:00:42] - Couldn't send HTTP request to server
[21:00:42] + Could not connect to Work Server (results)
[21:00:42] (171.67.108.21:80)
[21:00:42] - Error: Could not transmit unit 04 (completed February 17) to work server.
[21:00:42] - 3 failed uploads of this unit.
[21:00:42] - Read packet limit of 540015616... Set to 524286976.
It tries less frequently than the SMP clients. Even though it could have been folding and completed most of a WU in the wasted 40+ minutes it took. Then it got a WU, and uploaded its results but got the "expected ACK for result" error, and then waited 10 more minutes for a new WU.
There is need for a change here, and quickly.
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 4:04 am
by Ragnar Dan
Same WU, all this time later, except this time it uploaded and was rejected because the "Server does not have record of this unit.":
Code: Select all
[03:55:55] + Attempting to send results [February 18 03:55:55 UTC]
[03:55:55] - Reading file work/wuresults_04.dat from core
[03:55:55] (Read 167488 bytes from disk)
[03:55:55] Connecting to http://171.67.108.21:8080/
[03:55:56] - Couldn't send HTTP request to server
[03:55:56] + Could not connect to Work Server (results)
[03:55:56] (171.67.108.21:8080)
[03:55:56] + Retrying using alternative port
[03:55:56] Connecting to http://171.67.108.21:80/
[03:56:17] - Couldn't send HTTP request to server
[03:56:17] + Could not connect to Work Server (results)
[03:56:17] (171.67.108.21:80)
[03:56:17] - Error: Could not transmit unit 04 (completed February 17) to work server.
[03:56:17] - 10 failed uploads of this unit.
[03:56:17] - Read packet limit of 540015616... Set to 524286976.
[03:56:17] + Attempting to send results [February 18 03:56:17 UTC]
[03:56:17] - Reading file work/wuresults_04.dat from core
[03:56:17] (Read 167488 bytes from disk)
[03:56:17] Connecting to http://171.67.108.26:8080/
[03:56:25] Posted data.
[03:56:25] Initial: 0000; - Uploaded at ~20 kB/s
[03:56:25] - Averaged speed for that direction ~64 kB/s
[03:56:25] - Server does not have record of this unit. Will try again later.
[03:56:25] Could not transmit unit 04 to Collection server; keeping in queue.
[03:56:25] + Sent 0 of 1 completed units to the server
[03:56:25] + Closed connections
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 12:12 pm
by noorman
.
I remember this happening in the past too; it 's about tha miscommunication (or lack of communication) between certain servers so that the receiving server doesn't have the correct list of sent out Work Units; it has no reference for the Wu that 's presented and does refuse the data ...
This will be fixed soon, I hope (too).
EDIT: Passed on this problem to make sure it gets seen to ...
.
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 6:07 pm
by Ragnar Dan
This one and .26 are both failing HTTP requests. 27 failed uploads so far.
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 7:34 pm
by tobor
But the board shows they're not t in rejct but I got a few that wont upload...???
My daily average just keeps goen down and down...
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 7:42 pm
by noorman
.
What is the error message from the log of the computer that can't upload ?
Is it connecting to the WS (x.x.108.21) or to a CS; what is the IP of the machine it calls to upload ?
.
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 7:56 pm
by dschief
Have multiple clients hung with the same problem. one has been locked for 30+ min trying to upload to a server supposedly accepting.
Code: Select all
[19:07:34] Folding@home Core Shutdown: FINISHED_UNIT
[19:07:37] CoreStatus = 64 (100)
[19:07:37] Sending work to server
[19:07:37] Project: 3469 (Run 18, Clone 106, Gen 1)
[19:07:37] - Read packet limit of 540015616... Set to 524286976.
[19:07:37] + Attempting to send results [February 18 19:07:37 UTC]
[19:07:39] - Couldn't send HTTP request to server
[19:07:39] + Could not connect to Work Server (results)
[19:07:39] (171.67.108.21:8080)
[19:07:39] + Retrying using alternative port
[19:10:48] - Couldn't send HTTP request to server
[19:10:48] + Could not connect to Work Server (results)
[19:10:48] (171.67.108.21:80)
[19:10:48] - Error: Could not transmit unit 02 (completed February 18) to work server.
[19:10:48] Keeping unit 02 in queue.
[19:10:48] Project: 3469 (Run 18, Clone 106, Gen 1)
[19:10:48] - Read packet limit of 540015616... Set to 524286976.
[19:10:48] + Attempting to send results [February 18 19:10:48 UTC]
[19:10:50] - Couldn't send HTTP request to server
[19:10:50] + Could not connect to Work Server (results)
[19:10:50] (171.67.108.21:8080)
[19:10:50] + Retrying using alternative port
[19:13:59] - Couldn't send HTTP request to server
[19:13:59] + Could not connect to Work Server (results)
[19:13:59] (171.67.108.21:80)
[19:13:59] - Error: Could not transmit unit 02 (completed February 18) to work server.
[19:13:59] - Read packet limit of 540015616... Set to 524286976.
[19:13:59] + Attempting to send results [February 18 19:13:59 UTC]
[19:46:43] + Could not connect to Work Server (results)
[19:46:43] (171.67.108.26:8080)
[19:46:43] + Retrying using alternative port
[19:46:43] - Couldn't send HTTP request to server
[19:46:43] + Could not connect to Work Server (results)
[19:46:43] (171.67.108.26:80)
[19:46:43] Could not transmit unit 02 to Collection server; keeping in queue.
[19:46:43] - Preparing to get new work unit...
[19:46:43] + Attempting to get work packet
[19:46:43] - Connecting to assignment server
[19:46:43] - Successful: assigned to (171.67.108.11).
[19:46:43] + News From Folding@Home: Welcome to Folding@Home
[19:46:43] Loaded queue successfully.
[19:46:44] Project: 3469 (Run 18, Clone 106, Gen 1)
[19:46:44] - Read packet limit of 540015616... Set to 524286976.
[19:46:44] + Attempting to send results [February 18 19:46:44 UTC]
[19:46:46] - Couldn't send HTTP request to server
[19:46:46] + Could not connect to Work Server (results)
[19:46:46] (171.67.108.21:8080)
[19:46:46] + Retrying using alternative port
Re: 171.67.108.21 is Reject;
Posted: Thu Feb 18, 2010 8:09 pm
by tobor
noorman wrote:.
What is the error message from the log of the computer that can't upload ?
Is it connecting to the WS (x.x.108.21) or to a CS; what is the IP of the machine it calls to upload ?
.
Code: Select all
5:59] + Could not connect to Work Server (results)
[17:35:59] (171.67.108.26:80)
[17:35:59] Could not transmit unit 06 to Collection server; keeping in queue.
[17:35:59] + Closed connections
[17:35:59]
[17:35:59] + Processing work unit
[17:35:59] Core required: FahCore_11.exe
[17:35:59] Core found.
[17:35:59] Working on queue slot 04 [February 18 17:35:59 UTC]
[17:35:59] + Working ...
[17:35:59]
[17:35:59] *------------------------------*
[17:35:59] Folding@Home GPU Core
[17:35:59] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[17:35:59]
[17:35:59] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[17:35:59] Build host: amoeba
[17:35:59] Board Type: Nvidia
[17:35:59] Core :
[17:35:59] Preparing to commence simulation
[17:35:59] - Looking at optimizations...
[17:35:59] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[17:35:59] - Created dyn
[17:35:59] - Files status OK
[17:35:59] - Expanded 88651 -> 447307 (decompressed 504.5 percent)
[17:35:59] Called DecompressByteArray: compressed_data_size=88651 data_size=447307, decompressed_data_size=447307 diff=0
[17:35:59] - Digital signature verified
[17:35:59]
[17:35:59] Project: 10105 (Run 49, Clone 5, Gen 12)
[17:35:59]
[17:35:59] Assembly optimizations on if available.
[17:35:59] Entering M.D.
[17:36:05] Tpr hash work/wudata_04.tpr: 2897199944 2596071054 3035341778 2015047456 3323095055
[17:36:05]
[17:36:05] Calling fah_main args: 14 usage=100
[17:36:05]
[17:36:06] Working on p10105_lambda_370K
[17:36:07] Client config found, loading data.
[17:36:08] Starting GUI Server
[17:37:34] Completed 1%
[17:39:00] Completed 2%
[17:40:26] Completed 3%
[17:41:52] Completed 4%
[17:43:19] Completed 5%
[17:44:45] Completed 6%
[17:46:11] Completed 7%
[17:47:37] Completed 8%
[17:49:04] Completed 9%
[17:50:30] Completed 10%
[17:51:56] Completed 11%
[17:53:22] Completed 12%
[17:54:48] Completed 13%
[17:56:15] Completed 14%
[17:57:41] Completed 15%
[17:59:07] Completed 16%
[18:00:33] Completed 17%
[18:01:59] Completed 18%
[18:03:26] Completed 19%
[18:04:52] Completed 20%
[18:06:18] Completed 21%
[18:07:44] Completed 22%
[18:09:11] Completed 23%
[18:10:37] Completed 24%
[18:12:03] Completed 25%
[18:13:32] Completed 26%
[18:14:59] Completed 27%
[18:16:26] Completed 28%
[18:17:27] Completed 29%
[18:17:27] mdrun_gpu returned
[18:17:27] NANs detected on GPU
[18:17:27]
[18:17:27] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:17:32] CoreStatus = 7A (122)
[18:17:32] Sending work to server
[18:17:32] Project: 10105 (Run 49, Clone 5, Gen 12)
[18:17:32] - Error: Could not get length of results file work/wuresults_04.dat
[18:17:32] - Error: Could not read unit 04 file. Removing from queue.
[18:17:32] Project: 3469 (Run 20, Clone 164, Gen 1)
[18:17:32] + Attempting to send results [February 18 18:17:32 UTC]
[18:17:32] - Couldn't send HTTP request to server
[18:17:32] + Could not connect to Work Server (results)
[18:17:32] (171.67.108.21:8080)
[18:17:32] + Retrying using alternative port
[18:17:53] - Couldn't send HTTP request to server
[18:17:53] + Could not connect to Work Server (results)
[18:17:53] (171.67.108.21:80)
[18:17:53] - Error: Could not transmit unit 02 (completed February 17) to work server.
[18:17:53] + Attempting to send results [February 18 18:17:53 UTC]
Folding@Home Client Shutdown.
--- Opening Log file [February 18 18:24:17 UTC]
# Windows GPU Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.23
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Documents and Settings\steve\Application Data\Folding@home-gpu2
Arguments: -gpu 1 -forcegpu nvidia_g80"
[18:24:17] - Ask before connecting: No
[18:24:17] - User name: stv911 (Team 4)
[18:24:17] - User ID: BE4069840F022A4
[18:24:17] - Machine ID: 3
[18:24:17]
[18:24:18] Loaded queue successfully.
[18:24:18] Initialization complete
[18:24:18] - Preparing to get new work unit...
[18:24:18] + Attempting to get work packet
[18:24:18] Project: 3469 (Run 20, Clone 164, Gen 1)
[18:24:18] + Attempting to send results [February 18 18:24:18 UTC]
[18:24:18] - Connecting to assignment server
[18:24:18] - Successful: assigned to (171.64.65.71).
[18:24:18] + News From Folding@Home: Welcome to Folding@Home
[18:24:18] Loaded queue successfully.
[18:24:18] - Couldn't send HTTP request to server
[18:24:18] + Could not connect to Work Server (results)
[18:24:18] (171.67.108.21:8080)
[18:24:18] + Retrying using alternative port
[18:24:19] + Closed connections
[18:24:19]
[18:24:19] + Processing work unit
[18:24:19] Core required: FahCore_11.exe
[18:24:19] Core found.
[18:24:19] Working on queue slot 05 [February 18 18:24:19 UTC]
[18:24:19] + Working ...
[18:24:19]
[18:24:19] *------------------------------*
[18:24:19] Folding@Home GPU Core
[18:24:19] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[18:24:19]
[18:24:19] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[18:24:19] Build host: amoeba
[18:24:19] Board Type: Nvidia
[18:24:19] Core :
[18:24:19] Preparing to commence simulation
[18:24:19] - Looking at optimizations...
[18:24:19] DeleteFrameFiles: successfully deleted file=work/wudata_05.ckp
[18:24:19] - Created dyn
[18:24:19] - Files status OK
[18:24:19] - Expanded 88615 -> 447307 (decompressed 504.7 percent)
[18:24:19] Called DecompressByteArray: compressed_data_size=88615 data_size=447307, decompressed_data_size=447307 diff=0
[18:24:19] - Digital signature verified
[18:24:19]
[18:24:19] Project: 10104 (Run 79, Clone 4, Gen 43)
[18:24:19]
[18:24:19] Assembly optimizations on if available.
[18:24:19] Entering M.D.
[18:24:25] Tpr hash work/wudata_05.tpr: 1869719903 3846018668 3746366035 1708418399 581598269
[18:24:25]
[18:24:25] Calling fah_main args: 14 usage=100
[18:24:25]
[18:24:26] Working on p10104_lambda_370K
[18:24:27] Client config found, loading data.
[18:24:28] Starting GUI Server
[18:24:39] - Couldn't send HTTP request to server
[18:24:39] + Could not connect to Work Server (results)
[18:24:39] (171.67.108.21:80)
[18:24:39] - Error: Could not transmit unit 02 (completed February 17) to work server.
[18:24:39] + Attempting to send results [February 18 18:24:39 UTC]
[18:25:54] Completed 1%
[18:27:20] Completed 2%
[18:28:46] Completed 3%
[18:30:12] Completed 4%
[18:31:38] Completed 5%
[18:33:04] Completed 6%
[18:34:30] Completed 7%
[18:35:57] Completed 8%
[18:37:23] Completed 9%
[18:38:49] Completed 10%
[18:40:15] Completed 11%
[18:41:41] Completed 12%
[18:43:07] Completed 13%
[18:44:33] Completed 14%
[18:45:59] Completed 15%
[18:47:25] Completed 16%
[18:48:51] Completed 17%
[18:50:18] Completed 18%
[18:51:44] Completed 19%
[18:53:08] Completed 20%
[18:53:08] mdrun_gpu returned
[18:53:08] NANs detected on GPU
[18:53:08]
[18:53:08] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:53:12] CoreStatus = 7A (122)
[18:53:12] Sending work to server
[18:53:12] - Preparing to get new work unit...
[18:53:12] + Attempting to get work packet
[18:53:12] - Connecting to assignment server
[18:53:12] - Successful: assigned to (171.64.65.71).
[18:53:12] + News From Folding@Home: Welcome to Folding@Home
[18:53:12] Loaded queue successfully.
[18:53:13] + Closed connections
[18:53:18]
[18:53:18] + Processing work unit
[18:53:18] Core required: FahCore_11.exe
[18:53:18] Core found.
[18:53:18] Working on queue slot 07 [February 18 18:53:18 UTC]
[18:53:18] + Working ...
[18:53:18]
[18:53:18] *------------------------------*
[18:53:18] Folding@Home GPU Core
[18:53:18] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[18:53:18]
[18:53:18] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[18:53:18] Build host: amoeba
[18:53:18] Board Type: Nvidia
[18:53:18] Core :
[18:53:18] Preparing to commence simulation
[18:53:18] - Looking at optimizations...
[18:53:18] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[18:53:18] - Created dyn
[18:53:18] - Files status OK
[18:53:18] - Expanded 88528 -> 447307 (decompressed 505.2 percent)
[18:53:18] Called DecompressByteArray: compressed_data_size=88528 data_size=447307, decompressed_data_size=447307 diff=0
[18:53:18] - Digital signature verified
[18:53:18]
[18:53:18] Project: 10105 (Run 258, Clone 5, Gen 9)
[18:53:18]
[18:53:18] Assembly optimizations on if available.
[18:53:18] Entering M.D.
[18:53:24] Tpr hash work/wudata_07.tpr: 1026159378 4078678355 1865917681 3691378718 161214339
[18:53:24]
[18:53:24] Calling fah_main args: 14 usage=100
[18:53:24]
[18:53:25] Working on p10105_lambda_370K
[18:53:26] Client config found, loading data.
[18:53:27] Starting GUI Server
[18:54:53] Completed 1%
[18:55:32] - Unknown packet returned from server, expected ACK for results
[18:55:32] Could not transmit unit 02 to Collection server; keeping in queue.
[18:55:32] Project: 10104 (Run 79, Clone 4, Gen 43)
[18:55:32] - Error: Could not get length of results file work/wuresults_05.dat
[18:55:32] - Error: Could not read unit 05 file. Removing from queue.
[18:55:32] Project: 10102 (Run 743, Clone 1, Gen 11)
[18:55:32] + Attempting to send results [February 18 18:55:32 UTC]
[18:55:33] - Couldn't send HTTP request to server
[18:55:33] + Could not connect to Work Server (results)
[18:55:33] (171.64.65.71:8080)
[18:55:33] + Retrying using alternative port
[18:55:35] - Couldn't send HTTP request to server
[18:55:35] + Could not connect to Work Server (results)
[18:55:35] (171.64.65.71:80)
[18:55:35] - Error: Could not transmit unit 06 (completed February 18) to work server.
[18:55:35] + Attempting to send results [February 18 18:55:35 UTC]
[18:55:37] - Server does not have record of this unit. Will try again later.
[18:55:37] Could not transmit unit 06 to Collection server; keeping in queue.
[18:56:19] Completed 2%
[18:57:45] Completed 3%
Folding@Home Client Shutdown.
--- Opening Log file [February 18 18:59:02 UTC]
# Windows GPU Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.23
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Documents and Settings\steve\Application Data\Folding@home-gpu2
Arguments: -gpu 1 -forcegpu nvidia_g80"
[18:59:02] - Ask before connecting: No
[18:59:02] - User name: stv911 (Team 4)
[18:59:02] - User ID: BE4069840F022A4
[18:59:02] - Machine ID: 3
[18:59:02]
[18:59:02] Loaded queue successfully.
[18:59:02] Initialization complete
[18:59:02]
[18:59:02] + Processing work unit
[18:59:02] Project: 3469 (Run 20, Clone 164, Gen 1)
[18:59:02] + Attempting to send results [February 18 18:59:02 UTC]
[18:59:02] Core required: FahCore_11.exe
[18:59:02] Core found.
[18:59:02] Working on queue slot 07 [February 18 18:59:02 UTC]
[18:59:02] + Working ...
[18:59:02]
[18:59:02] *------------------------------*
[18:59:02] Folding@Home GPU Core
[18:59:02] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[18:59:02]
[18:59:02] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[18:59:02] Build host: amoeba
[18:59:02] Board Type: Nvidia
[18:59:02] Core :
[18:59:02] Preparing to commence simulation
[18:59:02] - Looking at optimizations...
[18:59:02] - Files status OK
[18:59:02] - Expanded 88528 -> 447307 (decompressed 505.2 percent)
[18:59:02] Called DecompressByteArray: compressed_data_size=88528 data_size=447307, decompressed_data_size=447307 diff=0
[18:59:02] - Digital signature verified
[18:59:02]
[18:59:02] Project: 10105 (Run 258, Clone 5, Gen 9)
[18:59:02]
[18:59:02] Assembly optimizations on if available.
[18:59:02] Entering M.D.
[18:59:03] - Couldn't send HTTP request to server
[18:59:03] + Could not connect to Work Server (results)
[18:59:03] (171.67.108.21:8080)
[18:59:03] + Retrying using alternative port
[18:59:08] Will resume from checkpoint file
[18:59:08] Tpr hash work/wudata_07.tpr: 1026159378 4078678355 1865917681 3691378718 161214339
[18:59:08]
[18:59:08] Calling fah_main args: 14 usage=100
[18:59:08]
[18:59:09] Working on p10105_lambda_370K
[18:59:10] Client config found, loading data.
[18:59:11] Resuming from checkpoint
[18:59:11] fcCheckPointResume: retreived and current tpr file hash:
[18:59:11] 0 1026159378 1026159378
[18:59:11] 1 4078678355 4078678355
[18:59:11] 2 1865917681 1865917681
[18:59:11] 3 3691378718 3691378718
[18:59:11] 4 161214339 161214339
[18:59:11] fcCheckPointResume: file hashes same.
[18:59:11] fcCheckPointResume: state restored.
[18:59:11] Verified work/wudata_07.log
[18:59:11] Verified work/wudata_07.edr
[18:59:11] Verified work/wudata_07.xtc
[18:59:11] Starting GUI Server
[18:59:11] Completed 3%
[18:59:24] - Couldn't send HTTP request to server
[18:59:24] + Could not connect to Work Server (results)
[18:59:24] (171.67.108.21:80)
[18:59:24] - Error: Could not transmit unit 02 (completed February 17) to work server.
[18:59:24] + Attempting to send results [February 18 18:59:24 UTC]
[19:00:39] Completed 4%
[19:02:07] Completed 5%
[19:03:36] Completed 6%
[19:05:07] Completed 7%
[19:06:38] Completed 8%
[19:08:10] Completed 9%
[19:09:41] Completed 10%
[19:11:09] Completed 11%
[19:12:37] Completed 12%
[19:14:05] Completed 13%
[19:15:33] Completed 14%
[19:17:01] Completed 15%
[19:18:29] Completed 16%
[19:19:57] Completed 17%
[19:21:25] Completed 18%
[19:22:53] Completed 19%
[19:24:21] Completed 20%
[19:25:49] Completed 21%
[19:27:17] Completed 22%
[19:28:46] Completed 23%
[19:30:14] Completed 24%
[19:31:42] Completed 25%
[19:33:10] Completed 26%
[19:34:37] Completed 27%
[19:36:05] Completed 28%
[19:37:33] Completed 29%
[19:39:01] Completed 30%
[19:40:29] Completed 31%
[19:41:57] Completed 32%
[19:43:25] Completed 33%
[19:44:48] + Could not connect to Work Server (results)
[19:44:48] (171.67.108.26:8080)
[19:44:48] + Retrying using alternative port
[19:44:48] - Couldn't send HTTP request to server
[19:44:48] + Could not connect to Work Server (results)
[19:44:48] (171.67.108.26:80)
[19:44:48] Could not transmit unit 02 to Collection server; keeping in queue.
[19:44:48] Project: 10102 (Run 743, Clone 1, Gen 11)
[19:44:48] + Attempting to send results [February 18 19:44:48 UTC]
[19:44:50] - Couldn't send HTTP request to server
[19:44:50] + Could not connect to Work Server (results)
[19:44:50] (171.64.65.71:8080)
[19:44:50] + Retrying using alternative port
[19:44:51] - Couldn't send HTTP request to server
[19:44:51] + Could not connect to Work Server (results)
[19:44:51] (171.64.65.71:80)
[19:44:51] - Error: Could not transmit unit 06 (completed February 18) to work server.
[19:44:51] + Attempting to send results [February 18 19:44:51 UTC]
[19:44:53] Completed 34%
[19:45:12] - Couldn't send HTTP request to server
[19:45:12] + Could not connect to Work Server (results)
[19:45:12] (171.67.108.26:8080)
[19:45:12] + Retrying using alternative port
[19:45:12] - Couldn't send HTTP request to server
[19:45:12] (Got status 503)
[19:45:12] + Could not connect to Work Server (results)
[19:45:12] (171.67.108.26:80)
[19:45:12] Could not transmit unit 06 to Collection server; keeping in queue.
[19:46:21] Completed 35%
[19:47:49] Completed 36%
[19:49:17] Completed 37%
[19:50:44] Completed 38%
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 4:56 am
by Ravage7779
Also still having issues sending wu's back to 171.67.108.21. It's still broken.
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 8:42 am
by noorman
Ravage7779 wrote:Also still having issues sending wu's back to 171.67.108.21. It's still broken.
..
Passed that prblem on (yesterday my time - UTC/GMT+1)
.
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 2:17 pm
by tobor
noorman wrote:Ravage7779 wrote:Also still having issues sending wu's back to 171.67.108.21. It's still broken.
..
Passed that prblem on (yesterday my time - UTC/GMT+1)
.
Well you must be special...lol
Code: Select all
GPU
[11:22:47] + Attempting to send results [February 19 11:22:47 UTC]
[11:22:48] - Couldn't send HTTP request to server
[11:22:48] + Could not connect to Work Server (results)
[11:22:48] (171.67.108.21:8080)
[11:22:48] + Retrying using alternative port
[11:23:08] - Couldn't send HTTP request to server
[11:23:08] + Could not connect to Work Server (results)
[11:23:08] (171.67.108.21:80)
[11:23:08] - Error: Could not transmit unit 02 (completed February 17) to work server.
[11:23:08] + Attempting to send results [February 19 11:23:08 UTC]
[11:47:33] - Unknown packet returned from server, expected ACK for results
[11:47:33] Could not transmit unit 02 to Collection server; keeping in queue.
[11:47:33] Project: 10102 (Run 743, Clone 1, Gen 11)
[11:47:33] + Attempting to send results [February 19 11:47:33 UTC]
[11:47:34] - Couldn't send HTTP request to server
[11:47:34] + Could not connect to Work Server (results)
[11:47:34] (171.64.65.71:8080)
[11:47:34] + Retrying using alternative port
[11:47:35] - Couldn't send HTTP request to server
[11:47:35] + Could not connect to Work Server (results)
[11:47:35] (171.64.65.71:80)
[11:47:35] - Error: Could not transmit unit 06 (completed February 18) to work server.
[11:47:35] + Attempting to send results [February 19 11:47:35 UTC]
[11:48:03] - Couldn't send HTTP request to server
[11:48:03] + Could not connect to Work Server (results)
[11:48:03] (171.67.108.26:8080)
[11:48:03] + Retrying using alternative port
[11:48:03] - Couldn't send HTTP request to server
[11:48:03] + Could not connect to Work Server (results)
[11:48:03] (171.67.108.26:80)
[11:48:03] Could not transmit unit 06 to Collection server; keeping in queue.
[11:48:03] - Preparing to get new work unit...
[11:48:03] + Attempting to get work packet
[11:48:03] - Connecting to assignment server
[11:48:03] - Successful: assigned to (171.64.65.71).
[11:48:03] + News From Folding@Home: Welcome to Folding@Home
[11:48:04] Loaded queue successfully.
[11:48:04] - Couldn't send HTTP request to server
[11:48:04] + Could not connect to Work Server
[11:48:04] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[11:48:17] + Attempting to get work packet
[11:48:17] - Connecting to assignment server
[11:48:17] - Successful: assigned to (171.64.65.71).
[11:48:17] + News From Folding@Home: Welcome to Folding@Home
[11:48:18] Loaded queue successfully.
[11:48:18] - Couldn't send HTTP request to server
[11:48:18] + Could not connect to Work Server
[11:48:18] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[11:48:36] + Attempting to get work packet
[11:48:36] - Connecting to assignment server
[11:48:37] - Successful: assigned to (171.64.65.71).
[11:48:37] + News From Folding@Home: Welcome to Folding@Home
[11:48:37] Loaded queue successfully.
[11:48:37] - Couldn't send HTTP request to server
[11:48:37] + Could not connect to Work Server
[11:48:37] - Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[11:49:05] + Attempting to get work packet
[11:49:05] - Connecting to assignment server
[11:49:05] - Successful: assigned to (171.64.65.71).
[11:49:05] + News From Folding@Home: Welcome to Folding@Home
[11:49:05] Loaded queue successfully.
[11:49:06] - Couldn't send HTTP request to server
[11:49:06] + Could not connect to Work Server
[11:49:06] - Attempt #4 to get work failed, and no other work to do.
Waiting before retry.
[11:49:49] Project: 3469 (Run 20, Clone 164, Gen 1)
[11:49:49] + Attempting to send results [February 19 11:49:49 UTC]
[11:49:49] - Couldn't send HTTP request to server
[11:49:49] + Could not connect to Work Server (results)
[11:49:49] (171.67.108.21:8080)
[11:49:49] + Retrying using alternative port
[11:49:51] + Attempting to get work packet
[11:49:51] - Connecting to assignment server
[11:49:52] - Successful: assigned to (171.64.65.71).
[11:49:52] + News From Folding@Home: Welcome to Folding@Home
[11:49:52] Loaded queue successfully.
[11:49:53] + Closed connections
[11:49:53]
[11:49:53] + Processing work unit
[11:49:53] Core required: FahCore_11.exe
[11:49:53] Core found.
[11:49:53] Working on queue slot 07 [February 19 11:49:53 UTC]
[11:49:53] + Working ...
[11:49:53]
[11:49:53] *------------------------------*
[11:49:53] Folding@Home GPU Core
[11:49:53] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[11:49:53]
[11:49:53] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[11:49:53] Build host: amoeba
[11:49:53] Board Type: Nvidia
[11:49:53] Core :
[11:49:53] Preparing to commence simulation
[11:49:53] - Looking at optimizations...
[11:49:53] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[11:49:53] - Created dyn
[11:49:53] - Files status OK
[11:49:53] - Expanded 88668 -> 447307 (decompressed 504.4 percent)
[11:49:53] Called DecompressByteArray: compressed_data_size=88668 data_size=447307, decompressed_data_size=447307 diff=0
[11:49:53] - Digital signature verified
[11:49:53]
[11:49:53] Project: 10104 (Run 7, Clone 0, Gen 49)
[11:49:53]
[11:49:53] Assembly optimizations on if available.
[11:49:53] Entering M.D.
[11:49:59] Tpr hash work/wudata_07.tpr: 1562808168 1296928566 3965288469 2490880732 3476058361
[11:49:59]
[11:49:59] Calling fah_main args: 14 usage=100
[11:49:59]
[11:49:59] Working on p10104_lambda_370K
[11:50:01] Client config found, loading data.
[11:50:01] Starting GUI Server
[11:50:10] - Couldn't send HTTP request to server
[11:50:10] + Could not connect to Work Server (results)
[11:50:10] (171.67.108.21:80)
[11:50:10] - Error: Could not transmit unit 02 (completed February 17) to work server.
[11:50:10] + Attempting to send results [February 19 11:50:10 UTC]
[11:51:29] Completed 1%
[11:52:57] Completed 2%
[11:54:25] Completed 3%
[11:55:53] Completed 4%
[11:57:21] Completed 5%
[11:58:49] Completed 6%
[12:00:17] Completed 7%
[12:01:45] Completed 8%
[12:03:13] Completed 9%
[12:04:41] Completed 10%
[12:06:09] Completed 11%
[12:07:37] Completed 12%
[12:09:05] Completed 13%
[12:10:33] Completed 14%
[12:12:01] Completed 15%
[12:13:29] Completed 16%
[12:14:57] Completed 17%
[12:16:25] Completed 18%
[12:17:53] Completed 19%
[12:18:47] - Unknown packet returned from server, expected ACK for results
[12:18:47] Could not transmit unit 02 to Collection server; keeping in queue.
[12:18:47] Project: 10102 (Run 743, Clone 1, Gen 11)
[12:18:47] + Attempting to send results [February 19 12:18:47 UTC]
[12:18:48] - Couldn't send HTTP request to server
[12:18:48] + Could not connect to Work Server (results)
[12:18:48] (171.64.65.71:8080)
[12:18:48] + Retrying using alternative port
[12:18:49] - Couldn't send HTTP request to server
[12:18:49] + Could not connect to Work Server (results)
[12:18:49] (171.64.65.71:80)
[12:18:49] - Error: Could not transmit unit 06 (completed February 18) to work server.
[12:18:49] + Attempting to send results [February 19 12:18:49 UTC]
[12:19:21] Completed 20%
[12:20:49] Completed 21%
[12:22:17] Completed 22%
[12:23:45] Completed 23%
[12:25:13] Completed 24%
[12:26:41] Completed 25%
[12:28:09] Completed 26%
[12:29:37] Completed 27%
[12:31:05] Completed 28%
[12:32:33] Completed 29%
[12:34:01] Completed 30%
[12:35:29] Completed 31%
[12:36:57] Completed 32%
[12:38:25] Completed 33%
[12:39:53] Completed 34%
[12:40:59] - Unknown packet returned from server, expected ACK for results
[12:40:59] Could not transmit unit 06 to Collection server; keeping in queue.
[12:41:21] Completed 35%
[12:42:49] Completed 36%
[12:44:17] Completed 37%
[12:45:45] Completed 38%
[12:47:13] Completed 39%
[12:48:41] Completed 40%
[12:50:09] Completed 41%
[12:51:37] Completed 42%
[12:53:05] Completed 43%
[12:54:33] Completed 44%
[12:56:01] Completed 45%
[12:57:29] Completed 46%
[12:58:57] Completed 47%
[13:00:25] Completed 48%
[13:01:53] Completed 49%
[13:03:21] Completed 50%
[13:04:49] Completed 51%
[13:06:17] Completed 52%
[13:07:45] Completed 53%
[13:09:13] Completed 54%
[13:10:41] Completed 55%
[13:12:09] Completed 56%
[13:13:37] Completed 57%
[13:15:05] Completed 58%
[13:16:33] Completed 59%
[13:18:01] Completed 60%
[13:19:29] Completed 61%
[13:20:57] Completed 62%
[13:22:25] Completed 63%
[13:23:53] Completed 64%
[13:25:21] Completed 65%
[13:26:49] Completed 66%
[13:28:17] Completed 67%
[13:29:45] Completed 68%
[13:31:13] Completed 69%
[13:32:41] Completed 70%
[13:34:09] Completed 71%
[13:35:37] Completed 72%
[13:37:05] Completed 73%
[13:38:33] Completed 74%
[13:40:01] Completed 75%
[13:41:29] Completed 76%
GPU2
[12:42:37] - Preparing to get new work unit...
[12:42:37] + Attempting to get work packet
[12:42:37] - Connecting to assignment server
[12:42:37] - Successful: assigned to (171.64.65.71).
[12:42:37] + News From Folding@Home: Welcome to Folding@Home
[12:42:37] Loaded queue successfully.
[12:42:38] - Couldn't send HTTP request to server
[12:42:38] + Could not connect to Work Server
[12:42:38] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[12:42:47] + Attempting to get work packet
[12:42:47] - Connecting to assignment server
[12:42:48] - Successful: assigned to (171.64.65.71).
[12:42:48] + News From Folding@Home: Welcome to Folding@Home
[12:42:48] Loaded queue successfully.
[12:42:48] - Couldn't send HTTP request to server
[12:42:48] + Could not connect to Work Server
[12:42:48] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[12:43:07] + Attempting to get work packet
[12:43:07] - Connecting to assignment server
[12:43:07] - Successful: assigned to (171.64.65.71).
[12:43:07] + News From Folding@Home: Welcome to Folding@Home
[12:43:08] Loaded queue successfully.
[12:43:08] - Couldn't send HTTP request to server
[12:43:08] + Could not connect to Work Server
[12:43:08] - Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[12:43:36] + Attempting to get work packet
[12:43:36] - Connecting to assignment server
[12:43:36] - Successful: assigned to (171.64.65.71).
[12:43:36] + News From Folding@Home: Welcome to Folding@Home
[12:43:36] Loaded queue successfully.
[12:43:38] + Closed connections
[12:43:38]
[12:43:38] + Processing work unit
[12:43:38] Core required: FahCore_11.exe
[12:43:38] Core found.
[12:43:38] Working on queue slot 03 [February 19 12:43:38 UTC]
[12:43:38] + Working ...
[12:43:38]
[12:43:38] *------------------------------*
[12:43:38] Folding@Home GPU Core
[12:43:38] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[12:43:38]
[12:43:38] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[12:43:38] Build host: amoeba
[12:43:38] Board Type: Nvidia
[12:43:38] Core :
[12:43:38] Preparing to commence simulation
[12:43:38] - Looking at optimizations...
[12:43:38] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[12:43:38] - Created dyn
[12:43:38] - Files status OK
[12:43:38] - Expanded 88585 -> 447307 (decompressed 504.9 percent)
[12:43:38] Called DecompressByteArray: compressed_data_size=88585 data_size=447307, decompressed_data_size=447307 diff=0
[12:43:38] - Digital signature verified
[12:43:38]
[12:43:38] Project: 10105 (Run 143, Clone 8, Gen 17)
[12:43:38]
[12:43:38] Assembly optimizations on if available.
[12:43:38] Entering M.D.
[12:43:44] Tpr hash work/wudata_03.tpr: 854742982 885193027 1353723476 570622627 4271095551
[12:43:44]
[12:43:44] Calling fah_main args: 14 usage=100
[12:43:44]
[12:43:45] Working on p10105_lambda_370K
[12:43:46] Client config found, loading data.
[12:43:46] Starting GUI Server
[12:44:39] Completed 1%
[12:45:32] Completed 2%
[12:46:25] Completed 3%
[12:47:18] Completed 4%
[12:48:11] Completed 5%
[12:49:04] Completed 6%
[12:49:57] Completed 7%
[12:50:50] Completed 8%
[12:51:43] Completed 9%
[12:52:36] Completed 10%
[12:53:29] Completed 11%
[12:54:22] Completed 12%
[12:55:15] Completed 13%
[12:56:08] Completed 14%
[12:57:00] Completed 15%
[12:57:53] Completed 16%
[12:58:46] Completed 17%
[12:59:39] Completed 18%
[13:00:32] Completed 19%
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 2:56 pm
by noorman
.
@ tobor
I 'm just trying to help Pande Group the best I can LOL
Could you (or have you tested) test the following by entering (preferably from a blank page) the IP addresses in to your webbrowser's URL bar and then push ENTER ...
171.67.108.21:8080
then 171.67.108.21:80
then 171.64.65.71:8080
then 171.64.65.71:80
then 171.67.108.26:8080
then 171.67.108.26:80
If none of these give you a blank page with in the top left corner an OK, you have a connection problem between your PC and Stanford (or your LAN /Firewall !
My GPU-folder has been able - in the same timeframe - to connect to 171.67.108.21:8080 within 1 attempt and upload a Results file
Also it has been able to connect to 171.64.65.71:8080 within 1 attempt to do the same there and again for the following WU, it got the Results back to 171.64.65.71:8080 within 1 attempt.
Inbetween it had no prblems to get Work from 171.67.108.21:8080 within the 1st attempt and from 171.64.65.71:8080 in the 3rd attempt to do so ...
It is therefor very unlikely that your system keeps failing to connect to any of the named servers at any of their 2 ports ( used by F@H, being port 8080 and port 80 ) / port indicated by :number
.
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 4:12 pm
by tobor
Cannot connect to any of um..
The :8080s says (Web page cannot be displayed) The :80s just sits there trying to connect but never does..
thnx
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 4:28 pm
by tobor
Re: 171.67.108.21 is Reject;
Posted: Fri Feb 19, 2010 4:35 pm
by noorman
tobor wrote:Cannot connect to any of um..
The :8080s says (Web page cannot be displayed) The :80s just sits there trying to connect but never does..
thnx
.
Can you run a tracert (traceroute) to one of the IP addresses ?
I:\Documents and Settings\x>tracert 171.67.108.26
Tracing route to vsp09a.Stanford.EDU [171.67.108.26]
over a maximum of 30 hops:
1 <1 ms <1 ms <1 ms 192.168.2.1
2 <1 ms <1 ms <1 ms 192.168.3.1
3 7 ms 7 ms 5 ms d51537001.access.telenet.be [81.83.112.1]
4 10 ms 11 ms 8 ms dD5E0C521.access.telenet.be [213.224.197.33]
5 9 ms 10 ms 9 ms dD5E0FD59.access.telenet.be [213.224.253.89]
6 8 ms 11 ms 8 ms dD5E0FDB9.access.telenet.be [213.224.253.185]
7 10 ms 9 ms 9 ms xe-0-2-0.anr11.ip4.tinet.net [77.67.65.177]
8 19 ms 31 ms 17 ms xe-0-0-0.lon11.ip4.tinet.net [89.149.185.166]
9 19 ms 19 ms 19 ms te7-6.mpd02.lon01.atlas.cogentco.com [130.117.15.49]
10 97 ms 97 ms 98 ms te0-2-0-1.mpd21.jfk02.atlas.cogentco.com [66.28.4.189]
11 126 ms 125 ms 126 ms te1-8.mpd01.ord01.atlas.cogentco.com [154.54.29.157]
12 149 ms 150 ms 150 ms te0-4-0-0.mpd21.mci01.atlas.cogentco.com [154.54.30.170]
13 169 ms 168 ms 168 ms te2-2.mpd01.sfo01.atlas.cogentco.com [154.54.6.38]
14 168 ms 170 ms 169 ms te4-2.mpd01.sjc04.atlas.cogentco.com [154.54.2.166]
15 170 ms 171 ms 169 ms Stanford_University2.demarc.cogentco.com [66.250.7.138]
16 183 ms 182 ms 181 ms bnda-rtr-1.Stanford.EDU [68.65.168.33]
17 * * * Request timed out.
18 * * * Request timed out.
19 183 ms 181 ms 182 ms vsp09a.Stanford.EDU [171.67.108.26]
Trace complete.
.
This checks if there is any connection possible at all, not only a HTTP connection ...
( the time-out is on the Stanford network, probably a Firewall, so you can ignore that )
It can display the source/location of the break in your connection to Stanford (too) when it starts doing what you see in hops #17 and 18 in my traceroute copy.
.