171.67.108.11 & 171.67.108.21 down

Moderators: Site Moderators, FAHC Science Team

toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: 171.67.108.11 & 171.67.108.21 down

Post by toTOW »

herbak, look at the server status page, many of them have been down since Thursday ... :(

I also have two 9800 GTX+ sitting idle since then.

And I guess the issue is pretty serious since it's taking a lot of time to bring them back online :?
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Luscious
Posts: 49
Joined: Sat Oct 13, 2012 6:38 am

Re: 171.67.108.11 & 171.67.108.21 down

Post by Luscious »

toTOW wrote:herbak, look at the server status page, many of them have been down since Thursday ... :(

I also have two 9800 GTX+ sitting idle since then.

And I guess the issue is pretty serious since it's taking a lot of time to bring them back online :?
Similar problem here - both my GPU's have been dormant since Tuesday waiting on .21, and I've been watching the thread here since Wednesday. Sure takes a hit on my PPD with just a CPU folding.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.11 & 171.67.108.21 down

Post by bruce »

Unfortunately I suspect that projects that have been put on End-Of-Life status probably don't the same level of support as newer projects. :(
John_Weatherman
Posts: 289
Joined: Sun Dec 02, 2007 4:31 am
Location: Carrizo Plain National Monument, California
Contact:

Re: 171.67.108.11 & 171.67.108.21 down

Post by John_Weatherman »

So the tired, poor, huddled masses with old PCs can wait at the back of the line ...
Nick200
Posts: 86
Joined: Fri Jun 20, 2014 6:40 am
Location: New Zealand

Re: 171.67.108.11 & 171.67.108.21 down

Post by Nick200 »

Kia ora, any update on the 171.67.108.201 server, as it still has no work units available?

I have tried altering my configuration by adding client-type=advanced to the GPU slot, but that was not enough to shift it to another server. Any ideas on what changes would work - or PG's time-frames for fixing the server?

Nga mihi, Nick
herbak
Posts: 6
Joined: Fri Mar 12, 2010 10:09 am
Hardware configuration: Pentium III, 384 MB RAM, Linux Xubuntu 10.04

Re: 171.67.108.11 & 171.67.108.21 down

Post by herbak »

toTOW wrote:herbak, look at the server status page, many of them have been down since Thursday ... :(
Sure, have done/did do. It still doesn't tell me if WU servers for GPU Core 11 are down or not, AFAICS. Maybe I've missed something?

Anyhow, the issue seems to be well-flagged by now, I think, and I guess I'll find out in due time whether the Core 11 WUs themselves are End-Of-Life - or just the servers the WUs were being distributed from ;-)
I'll just sit tight and wait for news, can live with that.
bollix47
Posts: 2957
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: 171.67.108.11 & 171.67.108.21 down

Post by bollix47 »

171.67.108.11 vsp07v vvoelz GPU full DOWN
171.67.108.21 vsp07b vvoelz GPU full DOWN
Image
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: 171.67.108.11 & 171.67.108.21 down

Post by VijayPande »

Sorry for the delay here. The sysadmins started work on this on Friday but it didn't come back up. They are working on it today. We don't have 24x7 sysadmin support.

BTW, in the new infrastructure we are building, we have a striped architecture such that all servers will serve all projects. In that case, if one server goes down, it doesn't impact what WUs will get served. That will be a huge boon to our ability to run 24x7 with the limited sysadmin support we have. Right now, that new infrastructure is still in the testing phases, but we're hoping to roll it out soon.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
herbak
Posts: 6
Joined: Fri Mar 12, 2010 10:09 am
Hardware configuration: Pentium III, 384 MB RAM, Linux Xubuntu 10.04

Re: 171.67.108.11 & 171.67.108.21 down

Post by herbak »

bollix47 wrote:171.67.108.11 vsp07v vvoelz GPU full DOWN
171.67.108.21 vsp07b vvoelz GPU full DOWN
Right. Nothing new there and I still don't see anything about Core 11.
Should I be able to telepathically intuit that these are serving (or not, as the case may be) Core 11 WUs? ("GPU": it doesn't mean Core 11, it just means, well, GPU)
bollix47
Posts: 2957
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: 171.67.108.11 & 171.67.108.21 down

Post by bollix47 »

Since both core 11 work servers are DOWN there are none of their projects listed in the Project Summary ... if they were listed you would see that they came from the work servers in this thread's title. One way to find out which server(s) is/are involved is to look at your previous logs to determine which one(s) your work was coming from or you could just trust the OP that they already knew which servers give out core 11 work and that's why they're in the title.

The Server Status page does not show the actual core of the projects a particular server is serving. There can actually be different projects coming from the same server using different cores. The Project Summary shows both the core and the server a particular project is working with but that's not useful in this case due to both core 11 work servers being down and the PS being dynamic in that it gets it's current info from the working servers. The previous logs, assuming they exist, are the easiest way to determine where the work is coming from when this situation arises.

From what Prof. Pande has said above this will change in the future.
Image
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: 171.67.108.11 & 171.67.108.21 down

Post by VijayPande »

Looks like these servers should be back up now.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
Luscious
Posts: 49
Joined: Sat Oct 13, 2012 6:38 am

Re: 171.67.108.11 & 171.67.108.21 down

Post by Luscious »

VijayPande wrote:Looks like these servers should be back up now.
Confirmed: both of my G94 GPU's successfully connected to .11 and are folding now.

Curious - what was the issue?
Nick200
Posts: 86
Joined: Fri Jun 20, 2014 6:40 am
Location: New Zealand

Re: 171.67.108.11 & 171.67.108.21 down

Post by Nick200 »

Likewise, but I needed to quit FAH, restart it - and, when that wouldn't work, do a full power-off/power-on cycle before it would download a new WU. That aside, now happily folding.
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: 171.67.108.11 & 171.67.108.21 down

Post by VijayPande »

These servers are a bit old and their fs had some issues that needed fsck-ing to clean up.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
Calibrator
Posts: 17
Joined: Tue Aug 26, 2008 12:50 am
Hardware configuration: Intel i7-7820X 3.6 GHz
Nvidia GTX 1080 Ti 11 GB
Nvidia GTX 1080 TI 11 GB (in SLI)
64 GB 3200 MHz Corsair Dominator Platinum
Liquid-cooled CPU and GPU's
Location: Indiana, USA

Re: 171.67.108.11 & 171.67.108.21 down

Post by Calibrator »

It's an honor Dr. Pande ("The Father of Folding at Home") to have you personally address these issues here. You are a dedicated professional!
Folding 24/7/365 since 2008.
Post Reply