GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Moderators: Site Moderators, FAHC Science Team

*hondo*
Posts: 102
Joined: Sat Mar 08, 2008 9:50 am
Hardware configuration: Geoforce 250 GTS Active for F@H

4 Core Intel not used at all for F@H
Location: England UK

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by *hondo* »

Same problem with at least 3 team members from team 51078, I'd normally complete between 5 & 10 WUs per day. My PC as normal was Folding yesterday @ 8:30 GMT I checked the # of completed WUs on the stats page I do know for a fact that I saw 2 WUs go but now approx 23 F@H hours later nothing has been added to my stats score total. Also for the last 12 F@H hours my PC hasn't been able to download a single WU.

Come on Stanford tell us. WHAT IS GOING ON?
noorman
Posts: 270
Joined: Sun Dec 02, 2007 2:26 pm
Hardware configuration: Folders: Intel C2D E6550 @ 3.150 GHz + GPU XFX 9800GTX+ @ 765 MHZ w. WinXP-GPU
AMD A2X64 3800+ @ stock + GPU XFX 9800GTX+ @ 775 MHZ w. WinXP-GPU
Main rig: an old Athlon Barton 2500+ @2.25 GHz & 2* 512 MB RAM Apacer, Radeon 9800Pro, WinXP SP3+
Location: Belgium, near the International Sea-Port of Antwerp

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by noorman »

toTOW wrote:I've just sent an email to Vijay ... I hope well get more informations soon ...
.


I already sent him a PM about 171.67.108.21 almost 2 hours before you did ...

Nor even read yet !

.
- stopped Linux SMP w. HT on i7-860@3.5 GHz
....................................
Folded since 10-06-04 till 09-2010
Russ_64
Posts: 47
Joined: Wed Dec 05, 2007 4:31 pm
Hardware configuration: Dual Xeon E5645 (12C/24T) / 24Gb DDR3 - VMware ESXi 6.7.0
FAH v7.5.1
Location: London, UK

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Russ_64 »

Today (15th) is a holiday in USA, so don't expect a quick answer or solution.

I shutdown my clients yesterday and tried again earlier today, both my GPU's and SMP recieved new WU's today.....
ImageImageImage
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Tobit »

I've had work for the past few hours but I'm still having the original "Server has already received unit" problem.
noorman
Posts: 270
Joined: Sun Dec 02, 2007 2:26 pm
Hardware configuration: Folders: Intel C2D E6550 @ 3.150 GHz + GPU XFX 9800GTX+ @ 765 MHZ w. WinXP-GPU
AMD A2X64 3800+ @ stock + GPU XFX 9800GTX+ @ 775 MHZ w. WinXP-GPU
Main rig: an old Athlon Barton 2500+ @2.25 GHz & 2* 512 MB RAM Apacer, Radeon 9800Pro, WinXP SP3+
Location: Belgium, near the International Sea-Port of Antwerp

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by noorman »

.

Just did another shutdown and restart of F@H and got a WU from server 171.64.65.71 in stead of 171.67.108.21 at which all former requests for work were directed ...

It 's a new P10105 jobby (first one for me)


.
- stopped Linux SMP w. HT on i7-860@3.5 GHz
....................................
Folded since 10-06-04 till 09-2010
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by VijayPande »

Thanks for the posts. It's early AM in Califorina (that's why this went unfixed for several hours), but I think we've got everything going again. I've contacted Joe regarding this issue: there was a WS bug.

I've also balanced the weights so the other NV WS's can get into the mix better and improve the redundancy.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
chriskwarren
Posts: 28
Joined: Sun Nov 30, 2008 2:13 am

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by chriskwarren »

Thanks Dr. Pande. Can you confirm that the "Server has already received unit" problem means that our WUs were accepted by the server and not wasted? From our end it looks like the server rejects our work, and our WU gets wasted.
Image
ikerekes
Posts: 94
Joined: Thu Nov 13, 2008 4:18 pm
Hardware configuration: q6600 @ 3.3Ghz windows xp-sp3 one SMP2 (2.15 core) + 1 9800GT native GPU2
Athlon x2 6000+ @ 3.0Ghz ubuntu 8.04 smp + asus 9600GSO gpu2 in wine wrapper
5600X2 @ 3.19Ghz ubuntu 8.04 smp + asus 9600GSO gpu2 in wine wrapper
E5200 @ 3.7Ghz ubuntu 8.04 smp2 + asus 9600GT silent gpu2 in wine wrapper
E5200 @ 3.65Ghz ubuntu 8.04 smp2 + asus 9600GSO gpu2 in wine wrapper
E6550 vmware ubuntu 8.4.1
q8400 @ 3.3Ghz windows xp-sp3 one SMP2 (2.15 core) + 1 9800GT native GPU2
Athlon II 620 @ 2.6 Ghz windows xp-sp3 one SMP2 (2.15 core) + 1 9800GT native GPU2
Location: Calgary, Canada

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by ikerekes »

Before I went to sleep last night, all of my GPU's were working (on units from 171.64.65.71). Woke up Today morning to a picture where all my GPU's are down, the assignment server reassigned all of them to 108.21 and it was dead in the water.
As of 9:31 PST I restarted every GPU's and they are all loaded 3 from 65.71, 3 from 108.11 and one from 108.21 :)

Hurray!!! Apparently the assignment server needed the biggest kick. (Valentine's day is over, for whoever did the kick doesn't have to feel bad)
Image
tonic
Posts: 42
Joined: Sat Aug 02, 2008 4:05 am
Location: Seattle, WA

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by tonic »

I just got WUs on all 5 of my clients...so perhaps the problem is fixed.
Image
Pette Broad
Posts: 128
Joined: Mon Dec 03, 2007 9:38 pm
Hardware configuration: CPU folding on only one machine a laptop

GPU Hardware..
3 x 460
1 X 260
4 X 250

+ 1 X 9800GT (3 days a week)
Location: Chester U.K

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Pette Broad »

Early days yet, but I've just uploaded 3 units (and got some more) :)

Pete
Image
noorman
Posts: 270
Joined: Sun Dec 02, 2007 2:26 pm
Hardware configuration: Folders: Intel C2D E6550 @ 3.150 GHz + GPU XFX 9800GTX+ @ 765 MHZ w. WinXP-GPU
AMD A2X64 3800+ @ stock + GPU XFX 9800GTX+ @ 775 MHZ w. WinXP-GPU
Main rig: an old Athlon Barton 2500+ @2.25 GHz & 2* 512 MB RAM Apacer, Radeon 9800Pro, WinXP SP3+
Location: Belgium, near the International Sea-Port of Antwerp

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by noorman »

.

The redistribution is the main thing that helped the NetLoad to come down drastically on 171.67.108.21 (I 've seen).
It was extremely high compared to 'normal' figures, it 's come down to the usual levels already !

I got a WU the minute I restarted my GPU-Client after the news from another member (living in the U.K.) that he 'd got a WU, just before.
I thought that the problem for me was the high network delays I watched when pinging the server.
Since that U.K.member is across the Pond from the U.S. too, I tried my luck again and got a P10105 straight away :D

A fellow Folder (from the W. of the U.S.) got some WU's, 1 about every 2 hrs ...
I knew the server wasn't completely down because it had responded to a check by webbrowser with 'OK' and it was pingable all of the time.
Because of that, I sent a PM to Vijay. I only forgot to calculate the time difference and I also didn't know about the Public Holiday.

By the way, Uploading was no problem for my GPU-Client; that was done already, it just couldn't get new Work.


.
- stopped Linux SMP w. HT on i7-860@3.5 GHz
....................................
Folded since 10-06-04 till 09-2010
Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Nathan_P »

chriskwarren wrote:Thanks Dr. Pande. Can you confirm that the "Server has already received unit" problem means that our WUs were accepted by the server and not wasted? From our end it looks like the server rejects our work, and our WU gets wasted.
Yes i'd like to know as well, are we going to have to refold all those wu or is there a way to force the upload, i have about a dozen that the server says it has already received
Image
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Tobit »

Nathan_P wrote:Yes i'd like to know as well, are we going to have to refold all those wu or is there a way to force the upload, i have about a dozen that the server says it has already received
Unfortunately, there is nothing left to force. When the client receives the message that the server has already received the work unit, the slot in queue.dat the work was assigned to is "emptied". Some of us still have some wuresults.dat files. However, this problem had gone on for so long, many of mine were over written several times with newer work. The clients have only so many slots and once the slot is cleared, there is no way to send any lingering work files back to Stanford.
Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Nathan_P »

Tobit wrote:
Nathan_P wrote:Yes i'd like to know as well, are we going to have to refold all those wu or is there a way to force the upload, i have about a dozen that the server says it has already received
Unfortunately, there is nothing left to force. When the client receives the message that the server has already received the work unit, the slot in queue.dat the work was assigned to is "emptied". Some of us still have some wuresults.dat files. However, this problem had gone on for so long, many of mine were over written several times with newer work. The clients have only so many slots and once the slot is cleared, there is no way to send any lingering work files back to Stanford.
Thats a shame as i still have the files in my work directories. At least i'm folding again, and seeing my gtx 275 beaten by my gts 250 is indeed a sight to behold :lol:

I'd post a fahmon shot but i can't grab a screenie
Image
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by VijayPande »

Nathan_P wrote:
chriskwarren wrote:Thanks Dr. Pande. Can you confirm that the "Server has already received unit" problem means that our WUs were accepted by the server and not wasted? From our end it looks like the server rejects our work, and our WU gets wasted.
Yes i'd like to know as well, are we going to have to refold all those wu or is there a way to force the upload, i have about a dozen that the server says it has already received
It depends on the nature of the WS bug that's causing this, but I'm worried that these won't go back. I've escalated this bug to the highest level on our bug tracker and Joe's on it. I'll post more when we know more.

Note that, as far as we can tell so far, this is only an issue for people with multiple GPUs in the same box. If you're seeing it in some other case, please let us know.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
Post Reply