Page 2 of 7

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sat Jun 21, 2014 4:47 pm
by toTOW
herbak, look at the server status page, many of them have been down since Thursday ... :(

I also have two 9800 GTX+ sitting idle since then.

And I guess the issue is pretty serious since it's taking a lot of time to bring them back online :?

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sat Jun 21, 2014 7:30 pm
by Luscious
toTOW wrote:herbak, look at the server status page, many of them have been down since Thursday ... :(

I also have two 9800 GTX+ sitting idle since then.

And I guess the issue is pretty serious since it's taking a lot of time to bring them back online :?
Similar problem here - both my GPU's have been dormant since Tuesday waiting on .21, and I've been watching the thread here since Wednesday. Sure takes a hit on my PPD with just a CPU folding.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sun Jun 22, 2014 4:20 am
by bruce
Unfortunately I suspect that projects that have been put on End-Of-Life status probably don't the same level of support as newer projects. :(

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sun Jun 22, 2014 6:11 am
by John_Weatherman
So the tired, poor, huddled masses with old PCs can wait at the back of the line ...

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sun Jun 22, 2014 8:03 pm
by Nick200
Kia ora, any update on the 171.67.108.201 server, as it still has no work units available?

I have tried altering my configuration by adding client-type=advanced to the GPU slot, but that was not enough to shift it to another server. Any ideas on what changes would work - or PG's time-frames for fixing the server?

Nga mihi, Nick

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sun Jun 22, 2014 9:39 pm
by herbak
toTOW wrote:herbak, look at the server status page, many of them have been down since Thursday ... :(
Sure, have done/did do. It still doesn't tell me if WU servers for GPU Core 11 are down or not, AFAICS. Maybe I've missed something?

Anyhow, the issue seems to be well-flagged by now, I think, and I guess I'll find out in due time whether the Core 11 WUs themselves are End-Of-Life - or just the servers the WUs were being distributed from ;-)
I'll just sit tight and wait for news, can live with that.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Sun Jun 22, 2014 10:09 pm
by bollix47
171.67.108.11 vsp07v vvoelz GPU full DOWN
171.67.108.21 vsp07b vvoelz GPU full DOWN

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Mon Jun 23, 2014 4:44 pm
by VijayPande
Sorry for the delay here. The sysadmins started work on this on Friday but it didn't come back up. They are working on it today. We don't have 24x7 sysadmin support.

BTW, in the new infrastructure we are building, we have a striped architecture such that all servers will serve all projects. In that case, if one server goes down, it doesn't impact what WUs will get served. That will be a huge boon to our ability to run 24x7 with the limited sysadmin support we have. Right now, that new infrastructure is still in the testing phases, but we're hoping to roll it out soon.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Mon Jun 23, 2014 9:32 pm
by herbak
bollix47 wrote:171.67.108.11 vsp07v vvoelz GPU full DOWN
171.67.108.21 vsp07b vvoelz GPU full DOWN
Right. Nothing new there and I still don't see anything about Core 11.
Should I be able to telepathically intuit that these are serving (or not, as the case may be) Core 11 WUs? ("GPU": it doesn't mean Core 11, it just means, well, GPU)

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Mon Jun 23, 2014 10:16 pm
by bollix47
Since both core 11 work servers are DOWN there are none of their projects listed in the Project Summary ... if they were listed you would see that they came from the work servers in this thread's title. One way to find out which server(s) is/are involved is to look at your previous logs to determine which one(s) your work was coming from or you could just trust the OP that they already knew which servers give out core 11 work and that's why they're in the title.

The Server Status page does not show the actual core of the projects a particular server is serving. There can actually be different projects coming from the same server using different cores. The Project Summary shows both the core and the server a particular project is working with but that's not useful in this case due to both core 11 work servers being down and the PS being dynamic in that it gets it's current info from the working servers. The previous logs, assuming they exist, are the easiest way to determine where the work is coming from when this situation arises.

From what Prof. Pande has said above this will change in the future.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Wed Jun 25, 2014 12:59 am
by VijayPande
Looks like these servers should be back up now.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Wed Jun 25, 2014 1:56 am
by Luscious
VijayPande wrote:Looks like these servers should be back up now.
Confirmed: both of my G94 GPU's successfully connected to .11 and are folding now.

Curious - what was the issue?

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Wed Jun 25, 2014 5:59 am
by Nick200
Likewise, but I needed to quit FAH, restart it - and, when that wouldn't work, do a full power-off/power-on cycle before it would download a new WU. That aside, now happily folding.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Wed Jun 25, 2014 6:54 pm
by VijayPande
These servers are a bit old and their fs had some issues that needed fsck-ing to clean up.

Re: 171.67.108.11 & 171.67.108.21 down

Posted: Thu Jun 26, 2014 2:40 am
by Calibrator
It's an honor Dr. Pande ("The Father of Folding at Home") to have you personally address these issues here. You are a dedicated professional!