Page 3 of 25

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 6:14 pm
by Shadowtester
Still seems to be down

Code: Select all

[18:00:41] + Attempting to get work packet
[18:00:41] - Will indicate memory of 2048 MB
[18:00:41] - Connecting to assignment server
[18:00:41] Connecting to http://assign.stanford.edu:8080/
[18:00:42] Posted data.
[18:00:42] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[18:00:42] + News From Folding@Home: Welcome to Folding@Home
[18:00:42] Loaded queue successfully.
[18:00:42] Connecting to http://171.64.65.56:8080/
[18:00:42] - Couldn't send HTTP request to server
[18:00:42]   (Got status 503)
[18:00:42] + Could not connect to Work Server
[18:00:42] - Attempt #9  to get work failed, and no other work to do.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 6:42 pm
by MichaelO
Do you think if we post enough times someone will take a look at this server and at least give us some response? Why in the devil can Stanford not provide at leasta semblance of failover. If this was a corporation someone would already have been fired.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 7:27 pm
by Teddy
Same problem here 2 machines idle, can't send or receive new work & its stuck on server 56!

Teddy

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 7:38 pm
by BrokenWolf
I don't know about posting more when there is an error but they will realize that if contributors can not return the WU then then results they want to analyze could be delayed. I now have 4 clients getting 503 errors I trying to upload their WU's to this server and am sure that some of the other clients I have at work are erroring out as well. Luckily most of them have ~1 to 2 days until the final deadline but other contributors may not.

Broken

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 7:51 pm
by Teddy
& you see that's the problem, if you are sailing close to the wind on deadlines & if the server is unavailable to upload, well you get the picture.. no credit for the work done through no fault of their own.
The GPU servers themselves are giving enough grief to contributors & now we have a Linux server playing up. Yesterday it was the ATI servers problem.
I can imagine that a lot of contributors will be reassessing their contributions to the project after this.

Not to mention the dive in points that a lot of teams are experiencing at the moment.... something not right there.

Teddy

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 8:01 pm
by stevew
After several hours and many attempts (14+), I did get another WU #4433. The last WU is still in the queue.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 8:29 pm
by P5-133XL
Still not accepting WU's nor is is giving me any ...

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 8:54 pm
by Sahkuhnder
stevew wrote:After several hours and many attempts (14+), I did get another WU #4433. The last WU is still in the queue.
Same here. Got a new 4433 to work on but the old 2668 WU has been trying to send for 17 hours now.

MichaelO wrote:...If this was a corporation someone would already have been fired.
True, but if Stanford were a corporation it's doubtful that all of us would be as enthusiastic and persistent as we are in our efforts to contribute to the progress of the project. Let's hope they get things running again as soon as they can.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 9:01 pm
by rexrzer
-------- FAHlog.txt tail of Unit at ~/Library/Folding@home --------
[20:13:28] Could not transmit unit 01 to Collection server; keeping in queue.
[20:13:28] - Preparing to get new work unit...
[20:13:28] + Attempting to get work packet
[20:13:28] - Connecting to assignment server
[20:13:28] - Successful: assigned to (171.64.122.82).
[20:13:28] + News From Folding@Home: Welcome to Folding@Home
[20:13:28] Loaded queue successfully.
[20:13:29] Project: 2605 (Run 6, Clone 288, Gen 93)


[20:13:29] + Attempting to send results [October 5 20:13:29 UTC]
[20:13:29] - Couldn't send HTTP request to server
[20:13:29] + Could not connect to Work Server (results)
[20:13:29] (171.64.65.56:8080)
[20:13:29] + Retrying using alternative port
[20:13:30] - Couldn't send HTTP request to server
[20:13:30] + Could not connect to Work Server (results)
[20:13:30] (171.64.65.56:80)
[20:13:30] - Error: Could not transmit unit 01 (completed October 5) to work server.


[20:13:30] + Attempting to send results [October 5 20:13:30 UTC]
[20:15:02] - Server does not have record of this unit. Will try again later.
[20:15:02] Could not transmit unit 01 to Collection server; keeping in queue.
[20:15:02] + Closed connections
[20:15:02]


Well guys and gals, I am far from humored by the lack of attention to this out-of-commission server system, both primary and backup not accepting anything! :x :x :!:

The above is the result from my FAST iMac, which completes an SMP in less than 20 hours usually (a 3.06Ghz C2Duo Xtreme CPU in it), and here it's completed 2 SMP's and the GDMF server rejects the latest one with a "could not connect" response to either the primary or the 2ndary server, and again, gives the 1st SMP, completed yesterday in the early evening, with the "Server does not have record of this unit".

This very, very troubling! :shock: :!:

WHY are Dr. Kasson and staff not responding this weekend? :?: :?: :?:

I have been through problems before with servers in my 3.5 years with the Stanford.edu FAH program, many of them in fact, and there's always been at least some kind of response from Dr. Kasson and his staff EVEN ON WEEKENDS, BEFORE THIS TIME THAT IS!! :!:

I just don't know what my FAST iMac is going to do when it finishes its 3rd SMP of the weekend later today, don't know if it can continue on folding that is, if there are 3 SMP's in cue and it's working on a 4th SMP? Is this possible, even? Are my Prefs going to get corrupted and then I'll have another issue in the Mac OS to boot? Who knows.... :?:

[b]I am just extremely disappointed, that's the word that fits here. [/b] :roll: :( :?

My other 5 Intel iMacs are in the same boat as the FAST iMac here, by the way, I've just chosen to not post from them because what is the point if we can't even get a response from Dr. Kasson & staff? :?: :?:

I am rendered speechless by this whole episode, and that takes some doing because I'm usually very "Upbeat" about my Folding program, and the results I can get out of this little bevy of Macs I have on the program. I'm not tooting my own horn, but I am one of the key contributors on my small (44 active folders) team, and they are surely missing my results about now with no SMP's counted after 3PM on Saturday the 4th of October.

Somebody with a connection here, send Dr. Kasson a PM and let him know what is going on with his server, as ultimately the responsibility for this disaster rests with he and his staff. I am going to PM Paula, myself, and I think others should follow suit and get something happening before we start LOSING our SMP points, or worse....

I've written enough at this point, just being redundant by going further. Help me out, nice people! Start something happening, ok? We need this server FIXED and fixed right now, not when it just tanks belly up and we lose our accumulated points. If the server has had a hardware failure of some kind, please get it addressed and handled, as sooner now than later.

Have a nice evening, all you folders in this situation with me, and please PM somebody to get things FIXED around here! :!: :!: :!:

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 9:19 pm
by Teddy
You think that's bad I just got a File I/O error @ 100%, checksum not what expected blah, blah, back to 0% for that little protein..

My suggestion is just to be patient & wait a little longer & someone will hopefully fix that server, I still have 2 machines here out of work as well. 2 unsent proteins (Gromacs 33 cored ones) to 42 that I have had for days, at least my GPU ones are happy for now..

Teddy

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 9:23 pm
by torswin
No offense but it has only been a day. It's not like the world is going to end. They are probably going to fix it as soon as practically possible. I guess they probably don't have staff ready at all hours a day.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 9:36 pm
by Aardvark
For those that have feelings that Stanford has not acknowledged this problem, Paula did post this, in this thread, earlier today:
Re: 171.64.65.56 is in Reject status
by ppetrone on Sun Oct 05, 2008 2:12 am

Ok. Sorry about that.
I will take a look and if I cannot fix it I will contact Peter.

Paula
I think you can rest assured that PG/Stanford is aware of the problem. How good a handle they have on it is subject to serious question. What would Leland Stanford say about this mess???

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 10:54 pm
by MichaelO
Aardvark wrote:For those that have feelings that Stanford has not acknowledged this problem, Paula did post this, in this thread, earlier today:
Re: 171.64.65.56 is in Reject status
by ppetrone on Sun Oct 05, 2008 2:12 am

Ok. Sorry about that.
I will take a look and if I cannot fix it I will contact Peter.

Paula
I think you can rest assured that PG/Stanford is aware of the problem. How good a handle they have on it is subject to serious question. What would Leland Stanford say about this mess???
If they are aware of the problem then they are treating us like mushrooms and all that that implies !!!

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 11:00 pm
by toTOW
I've notified the person in charge of this server.

Re: 171.64.65.56 is in Reject status

Posted: Sun Oct 05, 2008 11:06 pm
by Shadowtester
I have had a smp client sitting for about 6 hours now waiting to upload and get a new wu. :(