Page 4 of 4
Re: 171.65.103.100 ports 80 and 8080 down
Posted: Tue Jun 16, 2009 4:16 pm
by 7im
Just a reminder, since this server is only a backup collection server, please also post which primary work server your clients are failing to connect to. Thanks.
Edit by Mod:
All of the posts in the thread "171.65.103.100 ports 80 and 8080 down" also involve Work Server 171.64.122.70 so I'm going to merge this with the other threads that discuss that pair of servers.
Re: 171.65.103.100 ports 80 and 8080 down
Posted: Tue Jun 16, 2009 4:36 pm
by shdbcamping
7im wrote:Just a reminder, since this server is only a backup collection server, please also post which primary work server your clients are failing to connect to. Thanks.
It should be the other one in my code posted
Re: 171.65.103.100 ports 80 and 8080 down
Posted: Tue Jun 16, 2009 4:41 pm
by Coolpplse
I have Project: 5905 (Run 7, Clone 792, Gen 5), Core: 14 completed and its work server was 171.64.122.70:8080, and the collection server was this one... 171.65.103.100 and they're both down....
Re: 171.65.103.100 ports 80 and 8080 down
Posted: Tue Jun 16, 2009 4:54 pm
by Shadowtester
Code: Select all
[14:36:36] Completed 100%
[14:36:36] Successful run
[14:36:36] DynamicWrapper: Finished Work Unit: sleep=10000
[14:36:46] Reserved 113424 bytes for xtc file; Cosm status=0
[14:36:46] Allocated 113424 bytes for xtc file
[14:36:46] - Reading up to 113424 from "work/wudata_06.xtc": Read 113424
[14:36:46] Read 113424 bytes from xtc file; available packet space=786317040
[14:36:46] xtc file hash check passed.
[14:36:46] Reserved 33528 33528 786317040 bytes for arc file=<work/wudata_06.trr> Cosm status=0
[14:36:46] Allocated 33528 bytes for arc file
[14:36:46] - Reading up to 33528 from "work/wudata_06.trr": Read 33528
[14:36:46] Read 33528 bytes from arc file; available packet space=786283512
[14:36:46] trr file hash check passed.
[14:36:46] Allocated 560 bytes for edr file
[14:36:46] Read bedfile
[14:36:46] edr file hash check passed.
[14:36:46] Allocated 31068 bytes for logfile
[14:36:46] Read logfile
[14:36:46] GuardedRun: success in DynamicWrapper
[14:36:46] GuardedRun: done
[14:36:46] Run: GuardedRun completed.
[14:36:48] - Writing 179092 bytes of core data to disk...
[14:36:48] Done: 178580 -> 156092 (compressed to 87.4 percent)
[14:36:48] ... Done.
[14:36:48] - Shutting down core
[14:36:48]
[14:36:48] Folding@home Core Shutdown: FINISHED_UNIT
[14:36:52] CoreStatus = 64 (100)
[14:36:52] Unit 6 finished with 94 percent of time to deadline remaining.
[14:36:52] Updated performance fraction: 0.944638
[14:36:52] Sending work to server
[14:36:52] Project: 5755 (Run 12, Clone 268, Gen 293)
[14:36:52] - Read packet limit of 540015616... Set to 524286976.
[14:36:52] + Attempting to send results [June 16 14:36:52 UTC]
[14:36:52] - Reading file work/wuresults_06.dat from core
[14:36:52] (Read 156604 bytes from disk)
[14:36:52] Connecting to http://171.67.108.11:8080/
[14:36:58] Posted data.
[14:36:58] Initial: 0000; - Uploaded at ~25 kB/s
[14:36:58] - Averaged speed for that direction ~20 kB/s
[14:36:58] + Results successfully sent
[14:36:58] Thank you for your contribution to Folding@Home.
[14:36:58] + Number of Units Completed: 1363
[14:37:02] Trying to send all finished work units
[14:37:02] Project: 5905 (Run 0, Clone 441, Gen 6)
[14:37:02] - Read packet limit of 540015616... Set to 524286976.
[14:37:02] + Attempting to send results [June 16 14:37:02 UTC]
[14:37:02] - Reading file work/wuresults_00.dat from core
[14:37:02] (Read 70355 bytes from disk)
[14:37:02] Connecting to http://171.64.122.70:8080/
[14:40:11] - Couldn't send HTTP request to server
[14:40:11] + Could not connect to Work Server (results)
[14:40:11] (171.64.122.70:8080)
[14:40:11] + Retrying using alternative port
[14:40:11] Connecting to http://171.64.122.70:80/
[14:43:20] - Couldn't send HTTP request to server
[14:43:20] + Could not connect to Work Server (results)
[14:43:20] (171.64.122.70:80)
[14:43:20] - Error: Could not transmit unit 00 (completed June 15) to work server.
[14:43:20] - 30 failed uploads of this unit.
[14:43:20] - Read packet limit of 540015616... Set to 524286976.
[14:43:20] + Attempting to send results [June 16 14:43:20 UTC]
[14:43:20] - Reading file work/wuresults_00.dat from core
[14:43:20] (Read 70355 bytes from disk)
[14:43:20] Connecting to http://171.65.103.100:8080/
[14:46:29] - Couldn't send HTTP request to server
[14:46:29] + Could not connect to Work Server (results)
[14:46:29] (171.65.103.100:8080)
[14:46:29] + Retrying using alternative port
[14:46:29] Connecting to http://171.65.103.100:80/
[14:49:38] - Couldn't send HTTP request to server
[14:49:38] + Could not connect to Work Server (results)
[14:49:38] (171.65.103.100:80)
[14:49:38] Could not transmit unit 00 to Collection server; keeping in queue.
[14:49:38] Project: 5905 (Run 0, Clone 441, Gen 6)
[14:49:38] - Read packet limit of 540015616... Set to 524286976.
[14:49:38] + Attempting to send results [June 16 14:49:38 UTC]
[14:49:38] - Reading file work/wuresults_00.dat from core
[14:49:38] (Read 70355 bytes from disk)
[14:49:38] Connecting to http://171.64.122.70:8080/
[14:52:47] - Couldn't send HTTP request to server
[14:52:47] + Could not connect to Work Server (results)
[14:52:47] (171.64.122.70:8080)
[14:52:47] + Retrying using alternative port
[14:52:47] Connecting to http://171.64.122.70:80/
[14:55:56] - Couldn't send HTTP request to server
[14:55:56] + Could not connect to Work Server (results)
[14:55:56] (171.64.122.70:80)
[14:55:56] - Error: Could not transmit unit 00 (completed June 15) to work server.
[14:55:56] - 31 failed uploads of this unit.
[14:55:56] - Read packet limit of 540015616... Set to 524286976.
[14:55:56] + Attempting to send results [June 16 14:55:56 UTC]
[14:55:56] - Reading file work/wuresults_00.dat from core
[14:55:56] (Read 70355 bytes from disk)
[14:55:56] Connecting to http://171.65.103.100:8080/
[14:59:05] - Couldn't send HTTP request to server
[14:59:05] + Could not connect to Work Server (results)
[14:59:05] (171.65.103.100:8080)
[14:59:05] + Retrying using alternative port
[14:59:05] Connecting to http://171.65.103.100:80/
[15:02:14] - Couldn't send HTTP request to server
[15:02:14] + Could not connect to Work Server (results)
[15:02:14] (171.65.103.100:80)
[15:02:14] Could not transmit unit 00 to Collection server; keeping in queue.
[15:02:14] + Sent 0 of 2 completed units to the server
[15:02:14] - Preparing to get new work unit...
[15:02:14] + Attempting to get work packet
[15:02:14] - Will indicate memory of 1024 MB
[15:02:14] - Connecting to assignment server
[15:02:14] Connecting to http://assign-GPU.stanford.edu:8080/
[15:02:17] Posted data.
[15:02:17] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[15:02:17] + News From Folding@Home: Welcome to Folding@Home
[15:02:17] Loaded queue successfully.
[15:02:17] Connecting to http://171.67.108.11:8080/
[15:02:19] Posted data.
[15:02:19] Initial: 0000; - Receiving payload (expected size: 45933)
[15:02:20] - Downloaded at ~44 kB/s
[15:02:20] - Averaged speed for that direction ~72 kB/s
[15:02:20] + Received work.
[15:02:20] Trying to send all finished work units
[15:02:20] Project: 5905 (Run 0, Clone 441, Gen 6)
[15:02:20] - Read packet limit of 540015616... Set to 524286976.
[15:02:20] + Attempting to send results [June 16 15:02:20 UTC]
[15:02:20] - Reading file work/wuresults_00.dat from core
[15:02:20] (Read 70355 bytes from disk)
[15:02:20] Connecting to http://171.64.122.70:8080/
[15:05:29] - Couldn't send HTTP request to server
[15:05:29] + Could not connect to Work Server (results)
[15:05:29] (171.64.122.70:8080)
[15:05:29] + Retrying using alternative port
[15:05:29] Connecting to http://171.64.122.70:80/
[15:08:38] - Couldn't send HTTP request to server
[15:08:38] + Could not connect to Work Server (results)
[15:08:38] (171.64.122.70:80)
[15:08:38] - Error: Could not transmit unit 00 (completed June 15) to work server.
[15:08:38] - 32 failed uploads of this unit.
[15:08:38] - Read packet limit of 540015616... Set to 524286976.
[15:08:38] + Attempting to send results [June 16 15:08:38 UTC]
[15:08:38] - Reading file work/wuresults_00.dat from core
[15:08:38] (Read 70355 bytes from disk)
[15:08:38] Connecting to http://171.65.103.100:8080/
[15:11:47] - Couldn't send HTTP request to server
[15:11:47] + Could not connect to Work Server (results)
[15:11:47] (171.65.103.100:8080)
[15:11:47] + Retrying using alternative port
[15:11:47] Connecting to http://171.65.103.100:80/
[15:14:56] - Couldn't send HTTP request to server
[15:14:56] + Could not connect to Work Server (results)
[15:14:56] (171.65.103.100:80)
[15:14:56] Could not transmit unit 00 to Collection server; keeping in queue.
[15:14:56] Project: 5905 (Run 0, Clone 441, Gen 6)
[15:14:56] - Read packet limit of 540015616... Set to 524286976.
[15:14:56] + Attempting to send results [June 16 15:14:56 UTC]
[15:14:56] - Reading file work/wuresults_00.dat from core
[15:14:56] (Read 70355 bytes from disk)
[15:14:56] Connecting to http://171.64.122.70:8080/
[15:18:05] - Couldn't send HTTP request to server
[15:18:05] + Could not connect to Work Server (results)
[15:18:05] (171.64.122.70:8080)
[15:18:05] + Retrying using alternative port
[15:18:05] Connecting to http://171.64.122.70:80/
[15:21:14] - Couldn't send HTTP request to server
[15:21:14] + Could not connect to Work Server (results)
[15:21:14] (171.64.122.70:80)
[15:21:14] - Error: Could not transmit unit 00 (completed June 15) to work server.
[15:21:14] - 33 failed uploads of this unit.
[15:21:14] - Read packet limit of 540015616... Set to 524286976.
[15:21:14] + Attempting to send results [June 16 15:21:14 UTC]
[15:21:14] - Reading file work/wuresults_00.dat from core
[15:21:14] (Read 70355 bytes from disk)
[15:21:14] Connecting to http://171.65.103.100:8080/
[15:24:23] - Couldn't send HTTP request to server
[15:24:23] + Could not connect to Work Server (results)
[15:24:23] (171.65.103.100:8080)
[15:24:23] + Retrying using alternative port
[15:24:23] Connecting to http://171.65.103.100:80/
[15:27:32] - Couldn't send HTTP request to server
[15:27:32] + Could not connect to Work Server (results)
[15:27:32] (171.65.103.100:80)
[15:27:32] Could not transmit unit 00 to Collection server; keeping in queue.
[15:27:32] + Sent 0 of 2 completed units to the server
[15:27:32] + Closed connections
[15:27:32]
[15:27:32] + Processing work unit
[15:27:32] Core required: FahCore_11.exe
[15:27:32] Core found.
[15:27:32] Working on queue slot 07 [June 16 15:27:32 UTC]
[15:27:32] + Working ...
[15:27:32] - Calling '.\FahCore_11.exe -dir work/ -suffix 07 -priority 96 -nocpulock -checkpoint 15 -forceasm -verbose -lifeline 8 -version 623'
[15:27:32]
[15:27:32] *------------------------------*
[15:27:32] Folding@Home GPU Core - Beta
[15:27:32] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[15:27:32]
[15:27:32] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[15:27:32] Build host: amoeba
[15:27:32] Board Type: Nvidia
[15:27:32] Core :
[15:27:32] Preparing to commence simulation
[15:27:32] - Assembly optimizations manually forced on.
[15:27:32] - Not checking prior termination.
[15:27:32] - Expanded 45421 -> 251112 (decompressed 552.8 percent)
[15:27:32] Called DecompressByteArray: compressed_data_size=45421 data_size=251112, decompressed_data_size=251112 diff=0
[15:27:32] - Digital signature verified
[15:27:32]
[15:27:32] Project: 5772 (Run 2, Clone 1, Gen 417)
[15:27:32]
[15:27:32] Assembly optimizations on if available.
[15:27:32] Entering M.D.
[15:27:39] Working on Protein
[15:27:40] Client config found, loading data.
[15:27:40] Starting GUI Server
[15:29:01] Completed 1%
There is the FAH log from the time of the last completed wu till the start of folding for the latest wu. It seems the primary collection for these wu' is 171.64.122.70 which is down as well so both the primary and backup wu collection servers are down not a good situation!
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 5:01 pm
by Shadowtester
I have been having problems with this server since 16:30:37 GMT June 15th the client has tried to connect on both port 80 and 8080 multiple time I currently have 2 completed wu's for this server and what compounds the problem is that the backup collection server for these wu's is also down right now any idea when either this server or 171.65.103.100 might be back up this problem is already 24 hours old and with these wu they have a short time limit!!!!!!
Re: 171.65.103.100 ports 80 and 8080 down
Posted: Tue Jun 16, 2009 5:15 pm
by 7im
That's why it's good document both servers for each pending WU.
I see that Pande Group is aware of the problem, and working on a fix.
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 6:13 pm
by Trotador
Same problem here, no way to send a 5904 WU
Trotador
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 7:54 pm
by jimerickson
getting quite a large queue of work units. is this server going to be up and running soon? or should i just quit folding for awhile?
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 8:09 pm
by 7im
No need to quit folding until you hit a full queue of 10 WUs.
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 8:34 pm
by jimerickson
i am at10 work units in queue now. going offline....
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 8:38 pm
by bruce
jimerickson wrote:i am at10 work units in queue now. going offline....
What are the project numbers and/or server numbers? Which version of the client are you running?
Please post the output from either -queueinfo or from qd.
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 9:39 pm
by jimerickson
now down to 1 WU in queue!
Code: Select all
Slot 02 Done
Project: 5905 (Run 3, Clone 609, Gen 6), Core: 14
Work server: 171.64.122.70:8080
Collection server: 171.65.103.100
Download date: June 15 04:17:06
Finished date: June 15 14:38:22
Failed uploads: 32
Re: 171.64.122.70
Posted: Tue Jun 16, 2009 10:01 pm
by jimerickson
by the way i am running the windows 6.23 gpu console client on 64-bit gentoo on a nvidia gtx8800. thanks for all you guys do to keep things running.
Re: 171.64.122.70
Posted: Wed Jun 17, 2009 4:59 am
by anko1
My unit just uploaded after 26 tries. Looks like the server is good right now. Thanks for fixing it.
Re: 171.64.122.70
Posted: Wed Jun 17, 2009 6:11 am
by Teddy
Just got home from work, Yep all good here now, no unsent units remaining!
Teddy