Page 1 of 1

130.237.232.141 and 130.237.232.237 offline

Posted: Sun Sep 04, 2011 7:48 pm
by kasson
servers 130.237.232.141 and 130.237.232.237 are currently offline. We're investigating. They're located in Sweden, so I'm guessing uptime will be no sooner than Monday morning CET. We'll do the best we can.

Thanks for your patience.

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 8:10 am
by Amaruk
Looks like the folks over in Sweden are working on it. Just switched two SR2s back over to bigadv and both received work from 130.237.232.237 - although the status page does show it in standby, not accepting. Hopefully it all gets sorted soon.

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 1:40 pm
by bollix47
Just sent one to .237 and it was accepted and the client was assigned another from the same server.

Server .141 however is still not accepting uploads. Same problem as:

viewtopic.php?p=194668#p194668

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 1:50 pm
by DrSpalding
I am still having the same issue as well on .141 as of now (06:50 PDT).

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 3:55 pm
by itsmekirill
Noon EST still can't upload. Bigadv unit completed over 24 hours ago now looks like it's going to expire.

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 4:14 pm
by rexrzer
I've been having this issue since yesterday afternoon (Sunday, 9/4/2011) and I'm afraid
that my Big Adv WU is going to be worth next to nothing, if, when, it finally gets uploaded.
I'm on the 7th failed upload of the WU above, and apparently the same server is supposed
to be working with the assignment server for Big Adv, and I'm getting nothing but A3 Core WU's since
this whole scene started happening.

I have two instances of FAHome to source on my PC No.1, and both come up with the same
A3 piddly WU's since the server started malfunctioning, or went offline, whatever. Here's
what the situation is like with one of those Clients:

Code: Select all

[15:42:05] - Ask before connecting: No
[15:42:05] - User name: rexrzer (Team 111065)
[15:42:05] - User ID: 2CCA92EE4AB7901C
[15:42:05] - Machine ID: 1
[15:42:05] 
[15:42:05] Loaded queue successfully.
[15:42:05] 
[15:42:05] - Autosending finished units... [September 5 15:42:05 UTC]
[15:42:05] + Processing work unit
[15:42:05] Trying to send all finished work units
[15:42:05] Core required: FahCore_a3.exe
[15:42:05] Project: 6900 (Run 83, Clone 3, Gen 2)
[15:42:05] Core found.


[15:42:05] Working on queue slot 04 [September 5 15:42:05 UTC]
[15:42:05] + Attempting to send results [September 5 15:42:05 UTC]
[15:42:05] + Working ...
[15:42:05] - Reading file work/wuresults_02.dat from core
[15:42:05] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 10 -checkpoint 15 -verbose -lifeline 1436 -version 634'

[15:42:05]   (Read 100152494 bytes from disk)
[15:42:05] Connecting to http://130.237.232.141:80/
[15:42:05] 
[15:42:05] *------------------------------*
[15:42:05] Folding@Home Gromacs SMP Core
[15:42:05] Version 2.27 (Dec. 15, 2010)
[15:42:05] 
[15:42:05] Preparing to commence simulation
[15:42:05] - Looking at optimizations...
[15:42:05] - Created dyn
[15:42:05] - Files status OK
[15:42:06] - Expanded 1765427 -> 2257001 (decompressed 127.8 percent)
[15:42:06] Called DecompressByteArray: compressed_data_size=1765427 data_size=2257001, decompressed_data_size=2257001 diff=0
[15:42:06] - Digital signature verified
[15:42:06] 
[15:42:06] Project: 6060 (Run 1, Clone 8, Gen 392)
[15:42:06] 
[15:42:06] Assembly optimizations on if available.
[15:42:06] Entering M.D.
[15:42:12] Mapping NT from 10 to 10 
[15:42:12] Completed 0 out of 500000 steps  (0%)
[15:42:26] - Couldn't send HTTP request to server
[15:42:28] + Could not connect to Work Server (results)
[15:42:28]     (130.237.232.141:80)
[15:42:28] + Retrying using alternative port
[15:42:28] Connecting to http://130.237.232.141:8080/
[15:42:49] - Couldn't send HTTP request to server
[15:42:49] + Could not connect to Work Server (results)
[15:42:49]     (130.237.232.141:8080)
[15:42:49] - Error: Could not transmit unit 02 (completed September 5) to work server.
[15:42:49] - 6 failed uploads of this unit.


[15:42:49] + Attempting to send results [September 5 15:42:49 UTC]
[15:42:49] - Reading file work/wuresults_02.dat from core
[15:42:49]   (Read 100152494 bytes from disk)
[15:42:49] Connecting to http://130.237.165.141:80/
[15:42:49] - Couldn't send HTTP request to server
[15:42:49] + Could not connect to Work Server (results)
[15:42:49]     (130.237.165.141:80)
[15:42:49] + Retrying using alternative port
[15:42:49] Connecting to http://130.237.165.141:8080/
[15:42:50] - Couldn't send HTTP request to server
[15:42:50] + Could not connect to Work Server (results)
[15:42:50]     (130.237.165.141:8080)
[15:42:50]   Could not transmit unit 02 to Collection server; keeping in queue.
[15:42:50] + Sent 0 of 1 completed units to the server
[15:42:50] - Autosend completed
[15:44:29] Completed 5000 out of 500000 steps  (1%)
I pray this gets resolved this AM or I'm going to either lose (expired) the WU above
or it will be worth something like 40K Points whenever, if, when, it gets uploaded finally. :(

Thanks for noting this irregularity Dr. Kasson, it's appreciated.

rexrzer 8-)

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 4:32 pm
by kasson
We've brought 130.237.232.237 back online; 141 should be coming online shortly. There may be a few more problems with 141 to debug, though.

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 5:50 pm
by bollix47
Thank you sir!

The WU finally uploaded after a client restart. :D

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 6:22 pm
by stevew
130.237.232.141 is back up. Thanks.

Code: Select all

[18:18:58] + Processing work unit
[18:18:58] Core required: FahCore_a5.exe
[18:18:58] Core found.
[18:18:58] Working on queue slot 01 [September 5 18:18:58 UTC]
[18:18:58] + Working ...
[18:18:58] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 01 -np 16 -nocpulock -checkpoint 30 -verbose -lifeline 8 -version 634'
[18:19:06] Project: 6900 (Run 45, Clone 0, Gen 36)

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Mon Sep 05, 2011 10:04 pm
by rexrzer
kasson wrote:We've brought 130.237.232.237 back online; 141 should be coming online shortly. There may be a few more problems with 141 to debug, though.
I don't know *why* it took all afternoon for my 6900 WU to get uploaded, but in any case
it just accomplished the task and got another 6900 WU assigned to it also, Woooot! :mrgreen: :mrgreen: :biggrin:

Thanks so much for being so on the spot and getting this resolved in record time on a
Holiday weekend here in the USA, Dr. Kasson. :!: :!: :D

All's well that ends well.

rexrzer 8-)

Re: 130.237.232.141 and 130.237.232.237 offline

Posted: Tue Sep 06, 2011 1:49 am
by augie
Give those Swedes some schnapps man! They deserve it.:)