Server - 171.64.65.106

Moderators: Site Moderators, FAHC Science Team

dedgar
Posts: 9
Joined: Mon Jan 21, 2008 12:36 am

Re: Server - 171.64.65.106

Post by dedgar »

My only two GPU's are trying to get work from the 106 server. One of them has been trying for more than an hour to get work.
My status logs tell me:

[18:13:17] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[18:13:17] + Couldn't get work instructions.

When I go to the server status page, that tells me that the 106 server is accepting. So what's up (or down)?
sbinh
Posts: 14
Joined: Mon Feb 04, 2008 4:28 am

Re: Server - 171.64.65.106

Post by sbinh »

9 out of 10 gpu clients are waiting for WUs .. the last one is at 91% complete .... When would this issue got fix?
shdbcamping
Posts: 81
Joined: Mon Nov 10, 2008 7:57 am
Hardware configuration: XPS 720 Q6600 9800GX2 3gig RAM
750W primary PSU 650W Aux VGA PSU

Re: Server - 171.64.65.106

Post by shdbcamping »

sbinh wrote:9 out of 10 gpu clients are waiting for WUs .. the last one is at 91% complete .... When would this issue got fix?
+++ lots of 1's here.
10 of 11 here, but my electric meter is still spinning waiting at "idle" :cry:
Plazzman
Posts: 28
Joined: Sat Jul 19, 2008 12:21 pm

Re: Server - 171.64.65.106

Post by Plazzman »

5 Client's here waiting for work....
Plazzman
Posts: 28
Joined: Sat Jul 19, 2008 12:21 pm

Re: Server - 171.64.65.106

Post by Plazzman »

Just received unit's for all 5 from 171.67.108.11...
MichaelO
Posts: 50
Joined: Tue Jan 01, 2008 8:59 pm

Re: Server - 171.64.65.106

Post by MichaelO »

My Nvidia clients have now all received work, however based on the performance this week, my guess is that there will be huge amonts of downtime this 3-day holiday weekend. :roll: :roll:
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Server - 171.64.65.106

Post by bruce »

shdbcamping wrote:Can we get an ETA from someone with authrity at Pande on how long the "Backlog" will take to get cleared up?
Somebody can probably estimate how many downloads per hour that a server can handle but how do you propose they count the number of clients that need work? (Then, of course, somebody can also estimate how many of them will return a result and be back for more work in X hours and what happens to them.)
dempaSD
Posts: 17
Joined: Tue Nov 18, 2008 2:16 am
Location: Sweden and US

Re: Server - 171.64.65.106

Post by dempaSD »

Was working nicely for a few days until now:

Code: Select all

[01:34:14] - Preparing to get new work unit...
[01:34:14] + Attempting to get work packet
[01:34:14] - Will indicate memory of 2815 MB
[01:34:14] - Connecting to assignment server
[01:34:14] Connecting to http://assign-GPU.stanford.edu:8080/
[01:34:15] Posted data.
[01:34:15] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:34:15] + News From Folding@Home: Welcome to Folding@Home
[01:34:16] Loaded queue successfully.
[01:34:16] Connecting to http://171.64.65.106:8080/
[01:34:29] - Couldn't send HTTP request to server
[01:34:29] + Could not connect to Work Server
[01:34:29] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[01:34:48] + Attempting to get work packet
[01:34:48] - Will indicate memory of 2815 MB
[01:34:48] - Connecting to assignment server
[01:34:48] Connecting to http://assign-GPU.stanford.edu:8080/
[01:34:49] Posted data.
[01:34:49] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:34:49] + News From Folding@Home: Welcome to Folding@Home
[01:34:50] Loaded queue successfully.
[01:34:50] Connecting to http://171.64.65.106:8080/
[01:34:51] - Couldn't send HTTP request to server
[01:34:51] + Could not connect to Work Server
[01:34:51] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[01:35:11] + Attempting to get work packet
[01:35:11] - Will indicate memory of 2815 MB
[01:35:11] - Connecting to assignment server
[01:35:11] Connecting to http://assign-GPU.stanford.edu:8080/
[01:35:12] Posted data.
[01:35:12] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:35:12] + News From Folding@Home: Welcome to Folding@Home
[01:35:12] Loaded queue successfully.
[01:35:12] Connecting to http://171.64.65.106:8080/
[01:35:26] - Couldn't send HTTP request to server
[01:35:26] + Could not connect to Work Server
[01:35:26] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[01:35:53] + Attempting to get work packet
[01:35:53] - Will indicate memory of 2815 MB
[01:35:53] - Connecting to assignment server
[01:35:53] Connecting to http://assign-GPU.stanford.edu:8080/
[01:35:54] Posted data.
[01:35:54] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:35:54] + News From Folding@Home: Welcome to Folding@Home
[01:35:54] Loaded queue successfully.
[01:35:54] Connecting to http://171.64.65.106:8080/
[01:35:55] Posted data.
[01:35:55] Initial: 0000; + Could not connect to Work Server
[01:35:55] - Attempt #4  to get work failed, and no other work to do.
Waiting before retry.
[01:36:39] + Attempting to get work packet
[01:36:39] - Will indicate memory of 2815 MB
[01:36:39] - Connecting to assignment server
[01:36:39] Connecting to http://assign-GPU.stanford.edu:8080/
[01:36:40] Posted data.
[01:36:40] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:36:40] + News From Folding@Home: Welcome to Folding@Home
[01:36:40] Loaded queue successfully.
[01:36:40] Connecting to http://171.64.65.106:8080/
[01:36:41] Posted data.
[01:36:41] Initial: 0000; + Could not connect to Work Server
[01:36:41] - Attempt #5  to get work failed, and no other work to do.
Waiting before retry.
[01:38:03] + Attempting to get work packet
[01:38:03] - Will indicate memory of 2815 MB
[01:38:03] - Connecting to assignment server
[01:38:03] Connecting to http://assign-GPU.stanford.edu:8080/
[01:38:05] Posted data.
[01:38:05] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:38:05] + News From Folding@Home: Welcome to Folding@Home
[01:38:05] Loaded queue successfully.
[01:38:05] Connecting to http://171.64.65.106:8080/
[01:38:06] Posted data.
[01:38:06] Initial: 0000; + Could not connect to Work Server
[01:38:06] - Attempt #6  to get work failed, and no other work to do.
Waiting before retry.
EDIT: Luckily seems like it was a short one:

Code: Select all

[01:40:51] - Attempt #7  to get work failed, and no other work to do.
Waiting before retry.
[01:46:18] + Attempting to get work packet
[01:46:18] - Will indicate memory of 2815 MB
[01:46:18] - Connecting to assignment server
[01:46:18] Connecting to http://assign-GPU.stanford.edu:8080/
[01:46:19] Posted data.
[01:46:19] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:46:19] + News From Folding@Home: Welcome to Folding@Home
[01:46:19] Loaded queue successfully.
[01:46:19] Connecting to http://171.64.65.106:8080/
[01:46:20] Posted data.
[01:46:20] Initial: 0000; - Receiving payload (expected size: 66761)
[01:46:22] - Downloaded at ~32 kB/s
[01:46:22] - Averaged speed for that direction ~69 kB/s
[01:46:22] + Received work.
[01:46:22] Trying to send all finished work units
[01:46:22] + No unsent completed units remaining.
[01:46:22] + Closed connections
[01:46:22] 
[01:46:22] + Processing work unit
[01:46:22] Core required: FahCore_11.exe
[01:46:22] Core found.
[01:46:22] Working on queue slot 00 [May 26 01:46:22 UTC]
[01:46:22] + Working ...
[01:46:22] - Calling '.\FahCore_11.exe -dir work/ -suffix 00 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 4360 -version 623'
All goody now.
Image
vsekh
Posts: 12
Joined: Mon Mar 03, 2008 8:52 pm

Re: Server - 171.64.65.106

Post by vsekh »

The server has 118 work units available. It will most likely run out. I have several GPU clients trying to get a work unit from this server, but it is not able to get a work unit. It is now saying "No appropriate work server was available; will try again in a bit".
ihaque
Pande Group Member
Posts: 74
Joined: Mon Dec 03, 2007 4:20 am
Location: Stanford
Contact:

Re: Server - 171.64.65.106

Post by ihaque »

Yes, this server looks like it's running low. I'm working on bringing up another server with a couple new projects to handle the load. Continuing network issues are causing problems for GPU assigns.
vsekh
Posts: 12
Joined: Mon Mar 03, 2008 8:52 pm

Re: Server - 171.64.65.106

Post by vsekh »

ihaque wrote:Yes, this server looks like it's running low. I'm working on bringing up another server with a couple new projects to handle the load. Continuing network issues are causing problems for GPU assigns.
Thanks ihaque. One of my gpu clients just received a 5911 project from server 171.64.65.20.
Post Reply