Page 1 of 1

171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 7:31 am
by artoar_11
Server Status (171.64.65.54) - Jun 29/2011/19:50:10 PDT; STATUS - full ; CONNECT - DOWN.

Code: Select all

[04:52:38] Completed 495000 out of 500000 steps  (99%)
[04:56:09] Completed 500000 out of 500000 steps  (100%)
[04:56:09] DynamicWrapper: Finished Work Unit: sleep=10000
[04:56:19] 
[04:56:19] Finished Work Unit:
[04:56:19] - Reading up to 3700560 from "work/wudata_02.trr": Read 3700560
[04:56:19] trr file hash check passed.
[04:56:19] edr file hash check passed.
[04:56:19] logfile size: 58610
[04:56:19] Leaving Run
[04:56:21] - Writing 3795130 bytes of core data to disk...
[04:56:21]   ... Done.
[04:56:21] - Shutting down core
[04:56:21] 
[04:56:21] Folding@home Core Shutdown: FINISHED_UNIT
[04:56:25] CoreStatus = 64 (100)
[04:56:25] Unit 2 finished with 96 percent of time to deadline remaining.
[04:56:25] Updated performance fraction: 0.966493
[04:56:25] Sending work to server
[04:56:25] Project: 6057 (Run 0, Clone 174, Gen 293)


[04:56:25] + Attempting to send results [June 30 04:56:25 UTC]
[04:56:25] - Reading file work/wuresults_02.dat from core
[04:56:25]   (Read 3795130 bytes from disk)
[04:56:25] Connecting to http://171.64.65.54:8080/
[04:56:46] - Couldn't send HTTP request to server
[04:56:46] + Could not connect to Work Server (results)
[04:56:46]     (171.64.65.54:8080)
[04:56:46] + Retrying using alternative port
[04:56:46] Connecting to http://171.64.65.54:80/
[04:57:07] - Couldn't send HTTP request to server
[04:57:07] + Could not connect to Work Server (results)
[04:57:07]     (171.64.65.54:80)
[04:57:07] - Error: Could not transmit unit 02 (completed June 30) to work server.
[04:57:07] - 1 failed uploads of this unit.
[04:57:07]   Keeping unit 02 in queue.
[04:57:07] Trying to send all finished work units
[04:57:07] Project: 6057 (Run 0, Clone 174, Gen 293)


[04:57:07] + Attempting to send results [June 30 04:57:07 UTC]
[04:57:07] - Reading file work/wuresults_02.dat from core
[04:57:07]   (Read 3795130 bytes from disk)
[04:57:07] Connecting to http://171.64.65.54:8080/
[04:57:28] - Couldn't send HTTP request to server
[04:57:28] + Could not connect to Work Server (results)
[04:57:28]     (171.64.65.54:8080)
[04:57:28] + Retrying using alternative port
[04:57:28] Connecting to http://171.64.65.54:80/
[04:57:49] - Couldn't send HTTP request to server
[04:57:49] + Could not connect to Work Server (results)
[04:57:49]     (171.64.65.54:80)
[04:57:49] - Error: Could not transmit unit 02 (completed June 30) to work server.
[04:57:49] - 2 failed uploads of this unit.


[04:57:49] + Attempting to send results [June 30 04:57:49 UTC]
[04:57:49] - Reading file work/wuresults_02.dat from core
[04:57:49]   (Read 3795130 bytes from disk)
[04:57:49] Connecting to http://171.67.108.25:8080/
[04:57:50] - Couldn't send HTTP request to server
[04:57:50] + Could not connect to Work Server (results)
[04:57:50]     (171.67.108.25:8080)
[04:57:50] + Retrying using alternative port
[04:57:50] Connecting to http://171.67.108.25:80/
[04:57:52] - Couldn't send HTTP request to server
[04:57:52] + Could not connect to Work Server (results)
[04:57:52]     (171.67.108.25:80)
[04:57:52]   Could not transmit unit 02 to Collection server; keeping in queue.
[04:57:52] + Sent 0 of 1 completed units to the server
[04:57:52] - Preparing to get new work unit...
[04:57:52] Cleaning up work directory
[04:57:52] + Attempting to get work packet
[04:57:52] Passkey found
[04:57:52] - Will indicate memory of 4072 MB
[04:57:52] - Connecting to assignment server
[04:57:52] Connecting to http://assign.stanford.edu:8080/
[04:57:53] Posted data.
[04:57:53] Initial: 4081; - Successful: assigned to (129.64.95.82).
[04:57:53] + News From Folding@Home: Welcome to Folding@Home
[04:57:53] Loaded queue successfully.
[04:57:53] Sent data
[04:57:53] Connecting to http://129.64.95.82:8080/
[04:57:54] Posted data.
[04:57:54] Initial: 0000; - Receiving payload (expected size: 663145)
[04:57:56] - Downloaded at ~323 kB/s
[04:57:56] - Averaged speed for that direction ~259 kB/s
[04:57:56] + Received work.
[04:57:56] Trying to send all finished work units
[04:57:56] Project: 6057 (Run 0, Clone 174, Gen 293)


[04:57:56] + Attempting to send results [June 30 04:57:56 UTC]
[04:57:56] - Reading file work/wuresults_02.dat from core
[04:57:56]   (Read 3795130 bytes from disk)
[04:57:56] Connecting to http://171.64.65.54:8080/
[04:58:17] - Couldn't send HTTP request to server
[04:58:17] + Could not connect to Work Server (results)
[04:58:17]     (171.64.65.54:8080)
[04:58:17] + Retrying using alternative port
[04:58:17] Connecting to http://171.64.65.54:80/
[04:58:38] - Couldn't send HTTP request to server
[04:58:38] + Could not connect to Work Server (results)
[04:58:38]     (171.64.65.54:80)
[04:58:38] - Error: Could not transmit unit 02 (completed June 30) to work server.
[04:58:38] - 3 failed uploads of this unit.


[04:58:38] + Attempting to send results [June 30 04:58:38 UTC]
[04:58:38] - Reading file work/wuresults_02.dat from core
[04:58:38]   (Read 3795130 bytes from disk)
[04:58:38] Connecting to http://171.67.108.25:8080/
[04:58:40] - Couldn't send HTTP request to server
[04:58:40] + Could not connect to Work Server (results)
[04:58:40]     (171.67.108.25:8080)
[04:58:40] + Retrying using alternative port
[04:58:40] Connecting to http://171.67.108.25:80/
[04:58:42] - Couldn't send HTTP request to server
[04:58:42] + Could not connect to Work Server (results)
[04:58:42]     (171.67.108.25:80)
[04:58:42]   Could not transmit unit 02 to Collection server; keeping in queue.
[04:58:42] + Sent 0 of 1 completed units to the server
[04:58:42] + Closed connections
After restart WU:

Code: Select all

###############################################################################

Launch directory: E:\_SMP_FAH-v. 6.34
Executable: E:\_SMP_FAH-v. 6.34\FAH6.34-win32-SMP.exe
Arguments: -smp -verbosity 9 

[05:55:57] - Ask before connecting: No
[05:55:57] - User name: artoar_home (Team 32435)
[05:55:57] - User ID: xxxx
[05:55:57] - Machine ID: 1
[05:55:57] 
[05:55:57] Loaded queue successfully.
[05:55:57] 
[05:55:57] - Autosending finished units... [June 30 05:55:57 UTC]
[05:55:57] + Processing work unit
[05:55:57] Trying to send all finished work units
[05:55:57] A4 will attempt to use 4 threads.
[05:55:57] Project: 6057 (Run 0, Clone 174, Gen 293)
[05:55:57] Core required: FahCore_a4.exe


[05:55:57] Core found.
[05:55:57] + Attempting to send results [June 30 05:55:57 UTC]
[05:55:57] - Reading file work/wuresults_02.dat from core
[05:55:57] Working on queue slot 03 [June 30 05:55:57 UTC]
[05:55:57]   (Read 3795130 bytes from disk)
[05:55:57] + Working ...
[05:55:57] Connecting to http://171.64.65.54:8080/
[05:55:57] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 03 -np 4 -nocpulock -checkpoint 9 -verbose -lifeline 3620 -version 634'

[05:55:57] 
[05:55:57] *------------------------------*
[05:55:57] Folding@Home Gromacs GB Core
[05:55:57] Version 2.27 (Dec. 15, 2010)
[05:55:57] 
[05:55:57] Preparing to commence simulation
[05:55:57] - Ensuring status. Please wait.
[05:56:06] - Looking at optimizations...
[05:56:06] - Working with standard loops on this execution.
[05:56:06] - Previous termination of core was improper.
[05:56:06] - Going to use standard loops.
[05:56:06] - Files status OK
[05:56:06] - Expanded 662633 -> 1297016 (decompressed 195.7 percent)
[05:56:06] Called DecompressByteArray: compressed_data_size=662633 data_size=1297016, decompressed_data_size=1297016 diff=0
[05:56:06] - Digital signature verified
[05:56:06] 
[05:56:06] Project: 7200 (Run 52, Clone 33, Gen 4)
[05:56:06] 
[05:56:06] Entering M.D.
[05:56:12] Using Gromacs checkpoints
[05:56:12] Mapping NT from 4 to 4 
[05:56:13] Resuming from checkpoint
[05:56:13] Verified work/wudata_03.log
[05:56:13] Verified work/wudata_03.trr
[05:56:13] Verified work/wudata_03.xtc
[05:56:13] Verified work/wudata_03.edr
[05:56:13] Completed 104070 out of 750000 steps  (13%)
[05:56:18] - Couldn't send HTTP request to server
[05:56:18] + Could not connect to Work Server (results)
[05:56:18]     (171.64.65.54:8080)
[05:56:18] + Retrying using alternative port
[05:56:18] Connecting to http://171.64.65.54:80/
[05:56:37] Completed 105000 out of 750000 steps  (14%)
[05:56:39] - Couldn't send HTTP request to server
[05:56:39] + Could not connect to Work Server (results)
[05:56:39]     (171.64.65.54:80)
[05:56:39] - Error: Could not transmit unit 02 (completed June 30) to work server.
[05:56:39] - 5 failed uploads of this unit.


[05:56:39] + Attempting to send results [June 30 05:56:39 UTC]
[05:56:39] - Reading file work/wuresults_02.dat from core
[05:56:39]   (Read 3795130 bytes from disk)
[05:56:39] Connecting to http://171.67.108.25:8080/
[05:56:40] - Couldn't send HTTP request to server
[05:56:40] + Could not connect to Work Server (results)
[05:56:40]     (171.67.108.25:8080)
[05:56:40] + Retrying using alternative port
[05:56:40] Connecting to http://171.67.108.25:80/
[05:56:42] - Couldn't send HTTP request to server
[05:56:42] + Could not connect to Work Server (results)
[05:56:42]     (171.67.108.25:80)
[05:56:42]   Could not transmit unit 02 to Collection server; keeping in queue.
[05:56:42] + Sent 0 of 1 completed units to the server
[05:56:42] - Autosend completed
[05:59:53] Completed 112500 out of 750000 steps  (15%)

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 10:32 am
by Grimsrud
I have the same problem since this morning, can not send results from client.

Code: Select all

[10:23:32] + Attempting to send results [June 30 10:23:32 UTC]
[10:23:32] Core found.
[10:23:32] Working on queue slot 07 [June 30 10:23:32 UTC]
[10:23:32] + Working ...
[10:23:33]
[10:23:33] *------------------------------*
[10:23:33] Folding@Home Gromacs SMP Core
[10:23:33] Version 2.27 (Dec. 15, 2010)
[10:23:33]
[10:23:33] Preparing to commence simulation
[10:23:33] - Ensuring status. Please wait.
[10:23:42] - Looking at optimizations...
[10:23:42] - Working with standard loops on this execution.
[10:23:42] - Previous termination of core was improper.
[10:23:42] - Going to use standard loops.
[10:23:42] - Files status OK
[10:23:43] - Expanded 1768557 -> 1957708 (decompressed 110.6 percent)
[10:23:43] Called DecompressByteArray: compressed_data_size=1768557 data_size=1957708, decompressed_data_size=1957708 diff=0
[10:23:43] - Digital signature verified
[10:23:43]
[10:23:43] Project: 6970 (Run 0, Clone 86, Gen 101)
[10:23:43]
[10:23:43] Entering M.D.
[10:23:49] Using Gromacs checkpoints
[10:23:49] Mapping NT from 2 to 2
[10:23:49] Resuming from checkpoint
[10:23:49] Verified work/wudata_07.log
[10:23:49] Verified work/wudata_07.trr
[10:23:49] Verified work/wudata_07.edr
[10:23:49] Completed 155586 out of 500000 steps  (31%)
[10:23:53] - Couldn't send HTTP request to server
[10:23:53] + Could not connect to Work Server (results)
[10:23:53]     (171.64.65.54:8080)
[10:23:53] + Retrying using alternative port
[10:24:15] - Couldn't send HTTP request to server
[10:24:15] + Could not connect to Work Server (results)
[10:24:15]     (171.64.65.54:80)
[10:24:15] - Error: Could not transmit unit 06 (completed June 30) to work server.


[10:24:15] + Attempting to send results [June 30 10:24:15 UTC]
[10:24:28] - Couldn't send HTTP request to server
[10:24:28] + Could not connect to Work Server (results)
[10:24:28]     (171.67.108.25:8080)
[10:24:28] + Retrying using alternative port
[10:24:29] - Couldn't send HTTP request to server
[10:24:29] + Could not connect to Work Server (results)
[10:24:29]     (171.67.108.25:80)
[10:24:29]   Could not transmit unit 06 to Collection server; keeping in queue.

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 10:35 am
by ThunderRd
Confirmed here as well on one of my machines. Server needs a kick ;)

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 11:40 am
by kasson
The machine is currently unresponsive. We're working on it.

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 11:42 am
by Foxbat
My Mac Pro would like to send its last WU. The server is not responding:

Code: Select all

[11:27:19] Attempting to return result(s) to server...
[11:27:19] Trying to send all finished work units
[11:27:19] Project: 6059 (Run 0, Clone 97, Gen 394)


[11:27:19] + Attempting to send results [June 30 11:27:19 UTC]
[11:27:19] - Reading file work/wuresults_04.dat from core
[11:27:19]   (Read 3795333 bytes from disk)
[11:27:19] Connecting to http://171.64.65.54:8080/
[11:28:34] - Couldn't send HTTP request to server
[11:28:34] + Could not connect to Work Server (results)
[11:28:34]     (171.64.65.54:8080)
[11:28:34] + Retrying using alternative port
[11:28:34] Connecting to http://171.64.65.54:80/
[11:29:49] - Couldn't send HTTP request to server
[11:29:49] + Could not connect to Work Server (results)
[11:29:49]     (171.64.65.54:80)
[11:29:49] - Error: Could not transmit unit 04 (completed June 30) to work server.
[11:29:49] - 4 failed uploads of this unit.


[11:29:49] + Attempting to send results [June 30 11:29:49 UTC]
[11:29:49] - Reading file work/wuresults_04.dat from core
[11:29:49]   (Read 3795333 bytes from disk)
[11:29:49] Connecting to http://171.67.108.25:8080/
[11:29:49] - Couldn't send HTTP request to server
[11:29:49] + Could not connect to Work Server (results)
[11:29:49]     (171.67.108.25:8080)
[11:29:49] + Retrying using alternative port
[11:29:49] Connecting to http://171.67.108.25:80/
[11:29:49] - Couldn't send HTTP request to server
[11:29:49] + Could not connect to Work Server (results)
[11:29:49]     (171.67.108.25:80)
[11:29:49]   Could not transmit unit 04 to Collection server; keeping in queue.
[11:29:49] + Sent 0 of 1 completed units to the server
kasson wrote:The machine is currently unresponsive. We're working on it.
Sorry, kasson, I didn't see your post before I hit send on mine. Thanks! :D

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 11:49 am
by kasson
Unfortunately it's something I can't fix from here, so it will have to wait until someone wakes up and gets to work at Stanford. We'll try to get it up and running as soon as we can, though.

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 12:34 pm
by BonaSwirl
Same problem. Glad to hear it'll be sorted though! I was worried I'd managed to mess something up at my end!

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 5:55 pm
by TomJohnson
This Server is still down.

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 8:28 pm
by 314159
How late do those folks at Stanford sleep? :)
Still down.

Re: 171.64.65.54 - DOWN

Posted: Thu Jun 30, 2011 10:24 pm
by kasson
Still haven't heard a response from the networking folks, but one of my students noticed it was back up this afternoon. Server is accepting and assigning again. Thanks for your patience.