171.64.65.56 not responding

Moderators: Site Moderators, FAHC Science Team

Post Reply
ArVee
Posts: 121
Joined: Sun Dec 02, 2007 9:25 am

171.64.65.56

Post by ArVee »

171.64.65.56 is in Reject mode and the CS is failing to accept as well. Again.
314159
Posts: 232
Joined: Sun Dec 02, 2007 2:46 am
Location: http://www.teammacosx.org/

Re: 171.64.65.56 is in Reject status

Post by 314159 »

Rejecting again. :roll:

Any troubleshooters awake? :wink:

Thanks,

John
John (from the central part of the Commonwealth of Virginia, U.S.A.)

A friendly visitor to what hopefully will remain a friendly Forum.
With thanks to all of the dedicated volunteers on the staff here!!
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 171.64.65.56 is in Reject status

Post by kasson »

Restarting the sever code now...
ppetrone
Pande Group Member
Posts: 115
Joined: Wed Dec 12, 2007 6:20 pm
Location: Stanford
Contact:

Re: 171.64.65.56

Post by ppetrone »

It's up now. Thanks for the report.

Paula
314159
Posts: 232
Joined: Sun Dec 02, 2007 2:46 am
Location: http://www.teammacosx.org/

Re: 171.64.65.56 is in Reject status

Post by 314159 »

Thanks for the quick resolution. :)
This server has been "behaving" fairly well recently.

Serverstats' DL column looks a bit thin.
Should a Linux/OSX folder be concerned? :?

John
John (from the central part of the Commonwealth of Virginia, U.S.A.)

A friendly visitor to what hopefully will remain a friendly Forum.
With thanks to all of the dedicated volunteers on the staff here!!
Foxbat
Posts: 94
Joined: Wed Dec 05, 2007 10:23 pm
Hardware configuration: Apple Mac Pro 1,1 2x2.66 GHz Dual-Core Xeon w/10 GB RAM | EVGA GTX 960, Zotac GTX 750 Ti | Ubuntu 14.04 LTS
Dell Precision T7400 2x3.0 GHz Quad-Core Xeon w/16 GB RAM | Zotac GTX 970 | Ubuntu 14.04 LTS
Apple iMac Retina 5K 4.00 GHz Core i7 w/8 GB RAM | OS X 10.11.3 (El Capitan)
Location: Michiana, USA

Re: 171.64.65.56 is in Reject status

Post by Foxbat »

My Mac Mini stalled waiting on 171.64.65.56 this evening. It took about 30 minutes before it downloaded another WU. Must have been rush hour; the server status showed Accepting, but I got reject about a dozen times. I try not to take it personally... ;)
Image
Ragnar Dan
Posts: 52
Joined: Fri Dec 07, 2007 3:21 am
Location: U.S. (TechReport.com's Team 2630)

Re: 171.64.65.56 is in Reject status

Post by Ragnar Dan »

This server, and all servers my machines use are obviously not behaving properly. Either add more servers or quit allowing new downloads of the client so you're not so overwhelmed.
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: 171.64.65.56 is in Reject status

Post by susato »

Peace Ragnar, balky servers are just as troublesome for the PandeGroup scientists as they are for us donors.

Got a question though, Paula and Peter - For the past week or so the "WU To Go" and "WU Available" numbers on this server have hovered around 100 or less except for some VERY short intervals (for two hours there were between 300 and 100 WU available, and or another hour around 1600 WU were available)

In the past a low number of "WU Available" on a server meant a shortage of work units for the machines provisioned by that server.
Is this still true or has the WU supply behind the work servers been adjusted so that units will continue to flow freely to donors even if the numbers available on the work server are low?

TIA for your answers.
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 171.64.65.56 is in Reject status

Post by kasson »

We had a problem with the server code on that machine that was causing it to "leak" jobs--jobs that should be available for assignment were being marked as not available. We've fixed the code, but we need to reclaim more of the jobs. We're hoping to improve the situation.
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: 171.64.65.56 is in Reject status

Post by susato »

Thanks Peter. It's now 12 days later and the server is still low on WU. Over the last week my three older Linux dual-core machines have been unable to get any work at all from this server. They are folding WU from the .65.64 server designed for quad core machines. at an average of 1.67x minimum speed. Similar reports are coming in on team forum pages from other donors whose mac minis, Linux servers and mac laptops are also struggling to finish these units on time.

All donors know by heart that the PG needs work units returned promptly in order to keep the average generation time down and move the research forward swiftly. The original assignment-server logic diverting dual-core machines to the .65.64 server was supposed to be a stopgap to keep duallies in work during brief upsets of the .65.56 server. Three weeks is not brief. Is this still a server problem or is there a shortage of duallie work units?

The serverstats page also indicates that very few work units are returning to 65.56 -- this has to be related to the server's failure to distribute WU in the first place, because dual core machines are out there ready to fold them.

Looking forward to an update on this situation. Thanks.
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 171.64.65.56 is in Reject status

Post by kasson »

Thanks for the ping. We currently have 3644 jobs available on vspg4; hopefully that will help somewhat.
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: 171.64.65.56 is in Reject status

Post by susato »

Helps? Definitely! Those jobs are quickly being snapped up - a moment ago there were only 2136. At this rate the Mini's will be hungry again by noon Thursday.
It can't be easy keeping up with the demand for all kinds of work units around the clock. Thanks for keeping us "provisioned".
Aardvark
Posts: 143
Joined: Sat Jul 12, 2008 4:22 pm
Location: Team MacResource

Re: 171.64.65.56 is in Reject status

Post by Aardvark »

Server will not accept WU and CS is not accepting either. Serverstats indicate the 171.64.65.56 is REJECT. Am sending SOS...
What is past is prologue!
AgrFan
Posts: 63
Joined: Sat Mar 15, 2008 8:07 pm

Re: 171.64.65.56 is in Reject status

Post by AgrFan »

171.64.65.56 has been having issues for almost a month now. It looks to have run out of disk space. I noticed the DL column on the server stat page was showing a zero last night.

Peter, any ETA on when this server will be functioning normally again?

Code: Select all

[15:01:07] + Attempting to send results
[15:01:07] - Reading file work/wuresults_05.dat from core
[15:01:07]   (Read 26035859 bytes from disk)
[15:01:07] Connecting to http://171.64.65.56:8080/
[15:01:07] - Couldn't send HTTP request to server
[15:01:07] + Could not connect to Work Server (results)
[15:01:07]     (171.64.65.56:8080)
[15:01:07] - Error: Could not transmit unit 05 (completed November 16) to work server.
[15:01:07] - 4 failed uploads of this unit.


[15:01:07] + Attempting to send results
[15:01:07] - Reading file work/wuresults_05.dat from core
[15:01:07]   (Read 26035859 bytes from disk)
[15:01:07] Connecting to http://171.67.108.25:8080/
[15:01:08] - Couldn't send HTTP request to server
[15:01:08] + Could not connect to Work Server (results)
[15:01:08]     (171.67.108.25:8080)
[15:01:08]   Could not transmit unit 05 to Collection server; keeping in queue.
[15:01:08] + Sent 0 of 1 completed units to the server
[15:01:08] - Autosend completed
noorman
Posts: 270
Joined: Sun Dec 02, 2007 2:26 pm
Hardware configuration: Folders: Intel C2D E6550 @ 3.150 GHz + GPU XFX 9800GTX+ @ 765 MHZ w. WinXP-GPU
AMD A2X64 3800+ @ stock + GPU XFX 9800GTX+ @ 775 MHZ w. WinXP-GPU
Main rig: an old Athlon Barton 2500+ @2.25 GHz & 2* 512 MB RAM Apacer, Radeon 9800Pro, WinXP SP3+
Location: Belgium, near the International Sea-Port of Antwerp

Re: 171.64.65.56 is in Reject status

Post by noorman »

Aardvark wrote:Server will not accept WU and CS is not accepting either. Serverstats indicate the 171.64.65.56 is REJECT. Am sending SOS...
.


SOS to whom ?

I made a new thread on this and sent a PM to Kasson


.
- stopped Linux SMP w. HT on i7-860@3.5 GHz
....................................
Folded since 10-06-04 till 09-2010
Post Reply