Page 28 of 28
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 11:37 am
by bollix47
171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 12:45 pm
by noorman
bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 5:33 pm
by Cajun_Don
noorman wrote:bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
Same issue here, tried 25 times now with no uploaded WU.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 5:46 pm
by ikerekes
noorman wrote:bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
And what they are doing about it?
I have 2 gpu client which tries to send since yesterday 4pm the queue on both card already rolled over.
We were told to use passkey even in the gpu clients, but when I asked will it be counted in the 80% threshold, never got an answer only "not sure"
I am really reluctant to use the passkey on GPU clients when the server status is such horribly unreliable.
Prof. Pande called it the most serious major disaster.
I don't want to sound overly inpatient but this is going on for more than 3 months
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 6:09 pm
by Wrish
The 80% threshold has nothing to do with GPU work units.
If I see a lot of unsent work on a client, I'll copy the whole working folder + client to the desktop, then delete the wuresults.dat files of the affected queue slots in the original location, and script a command to -send all periodically. That way the active GPU client doesn't stall at the end of every work unit.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 6:46 pm
by noorman
ikerekes wrote:noorman wrote:bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
And what they are doing about it?
I have 2 gpu client which tries to send since yesterday 4pm the queue on both card already rolled over.
We were told to use passkey even in the gpu clients, but when I asked will it be counted in the 80% threshold, never got an answer only "not sure"
I am really reluctant to use the passkey on GPU clients when the server status is such horribly unreliable.
Prof. Pande called it the most serious major disaster.
I don't want to sound overly inpatient but this is going on for more than 3 months
.
171.64.65.71 is no GPU WS ...
CS6 has been down a long time because it was malfunctioning with its new software; that is being looked at, as far as I know.
Normally, the WS would accept an upload or CS4 or CS5 should accept it.
If you would have included the IP or server name, there could be a search for a cause.
A copy of a piece of log also helps a lot, certainly if the start of the WU is in it together with some important info on the project, etc..
( it also displays the Client type and version used )
171.67.108.21 was last in REJECT the day before yesterday, other GPU servers (WS) are not in Reject or Down (I just checked them in the stats)
I 've had a dip in my production on the day 171.67.108.21 was in REJECT, but everything is OK ever since (and the days before that) ...
.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 6:48 pm
by noorman
Cajun_Don wrote:noorman wrote:bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
Same issue here, tried 25 times now with no uploaded WU.
.
Please check your log(s), and report if you wish, to see to which server the Client wants to upload the Results ...
It won't (normally) be CS6 (108.26) because it has been out of action for weeks now.
.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 6:52 pm
by VijayPande
noorman wrote:bollix47 wrote:171.64.65.71 - vspg11a gbowman standby Not Accept
Collection server is 108.26 which is still in Reject so the WU can't upload anywhere.
.
passed on to Pande Group
.
VSPG11a is accepting right now. The CS has been turned off until the code can be fixed. You do not want us to turn it on before then, because with the current code, the clients go to the CS, spend time uploading, but never get points that way. Joe has been working on a fix for this one.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 7:38 pm
by noorman
.
.
Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26
Posted: Thu Mar 11, 2010 10:20 pm
by Cajun_Don
Thanks for the information Prof. Vijay. I hope all the bugs will be resolved soon, for all clients and servers. Good luck.