Page 1 of 1

171.67.108.26 server does not want my finished units

Posted: Fri Feb 05, 2010 9:14 pm
by Teddy
I have three machines with completed GPU units & none of them want to go home to this server.
I did a quick search on the server status page & could not see a listing for this server.

Can somebody see there way to having at look at this, I would hate my units to miss their deadline which is usually very short.

Teddy

Edit added more info

Code: Select all

[20:56:44] 
[20:56:44] - Couldn't send HTTP request to server
[20:56:44] + Could not connect to Work Server (results)
[20:56:44]     (171.67.108.21:8080)
[20:56:44] + Retrying using alternative port
[20:56:44] Connecting to http://171.67.108.21:80/
[20:56:44] Working on Protein
[20:56:45] Client config found, loading data.
[20:56:45] Starting GUI Server
[20:57:05] - Couldn't send HTTP request to server
[20:57:05] + Could not connect to Work Server (results)
[20:57:05]     (171.67.108.21:80)
[20:57:05] - Error: Could not transmit unit 01 (completed February 5) to work server.
[20:57:05] - 3 failed uploads of this unit.
[20:57:05] - Read packet limit of 540015616... Set to 524286976.


[20:57:05] + Attempting to send results [February 5 20:57:05 UTC]
[20:57:05] - Reading file work/wuresults_01.dat from core
[20:57:05]   (Read 166403 bytes from disk)
[20:57:05] Connecting to http://171.67.108.26:8080/
[20:57:23] Completed 1%
[20:58:04] Completed 2%
[20:58:43] Completed 3%
[20:59:20] Completed 4%
[20:59:54] Completed 5%
[21:00:29] Completed 6%
[21:01:09] Completed 7%
[21:01:46] Completed 8%
[21:01:51] - Couldn't send HTTP request to server
[21:01:51] + Could not connect to Work Server (results)
[21:01:51]     (171.67.108.26:8080)
[21:01:51] + Retrying using alternative port
[21:01:51] Connecting to http://171.67.108.26:80/
[21:01:51] - Couldn't send HTTP request to server
[21:01:51]   (Got status 503)
[21:01:51] + Could not connect to Work Server (results)
[21:01:51]     (171.67.108.26:80)
[21:01:51]   Could not transmit unit 01 to Collection server; keeping in queue.
[21:01:51] + Sent 0 of 1 completed units to the server
[21:01:51] - Autosend completed
[21:02:27] Completed 9%
[21:03:04] Completed 10%

Re: 171.67.108.26 server does not want my finished units

Posted: Fri Feb 05, 2010 9:32 pm
by PantherX
I checked the server status (link is on the top of the page) and according to it it is fully operational so in theory should be accepting WUs. In my case, i would simply restart the client and the WU would be successfully sent.
Hope this was useful.

Re: 171.67.108.26 server does not want my finished units

Posted: Fri Feb 05, 2010 9:53 pm
by brityank
It shows up in my three systems as the CS server to return to if the WU server is down or overloaded, but only shows on the Status Page as the CS for 171.67.108.20. If I open it through the browser (http://171.67.108.26:8080) it pops back an OK; with a blank port or an :80 it fails to connect.

With the main WU server being down, there is no way to get the valid WUs over to the CS except manually, and that will take a lot of time and work.

Is Stanford aware that these units are still down? I've seen nothing in the forum or in Dr. Pande's News blog since Wednesday.

Re: 171.67.108.26 server does not want my finished units

Posted: Fri Feb 05, 2010 10:00 pm
by brityank
Also, http://171.67.108.26:8080/ goes to vsp09a according to my tracert list - that system doesn't appear in the listings.

Re: 171.67.108.26 server does not want my finished units

Posted: Fri Feb 05, 2010 11:21 pm
by Teddy
I always try restarting the client which rarely works. No matter one of them has gone off on its own accord.
Still we have had a pretty good run with the servers for a while, problems were bound to strike as workloads for them increase.

Teddy

Re: 171.67.108.26 server does not want my finished units

Posted: Sat Feb 06, 2010 12:25 am
by Teddy
All sent back now thank-you all for your help!

CHeers Teddy