Page 1 of 1
Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 5:38 pm
by art_l_j_PlanetAMD64
I have a problem, that completed WUs from my 20 CPU clients and 22 GPU clients are not being accepted by the Folding@Home Servers.
I was having some electrical work done at my home yesterday, so all of my systems were powered off for a couple of hours. Please note that as a result of this, my IP Address will have changed, as the dynamic IP address assignment from my ISP will be different, when compared to the IP address that existed when the WUs were sent to me. I mention this only because it might possibly have something to do with this problem, if the IP address is different when the results are returned, compared to when the WUs were sent out.
I have seen, that recently the stats shown on kakaostats.com for the team I belong to (PlanetAMD64), has enormous numbers for 2 team members (Viper666 and ArigornStrider), 28,800 points and 25,894 points respectively, for the most recent 'update' (ie the points accumulated for the last 3 hours).
Now, the 'normal' average for each of these 2 team members is (plu = 'points last update' (the last 3 hours)):
Viper666 = 1430 plu (based on about 80,000 points-per-week (ppw))
ArigornStrider = 1160 plu (based on about 65,000 points-per-week (ppw))
So, there appears to be something very strange going on with the FAH Servers that accept completed WUs, based on the above observations.
I currently have, by my own estimate, about 50K to 60K 'tied up' in completed WUs that are not being accepted by the FAH Servers.
I just checked the FAH 'News' link from the Home page, but I did not see anything about this problem.
Could someone please look into this 'problem' (for lack of a better word) and let us all know what is going on?
If you need any further information from me, please do not hesitate to ask me for it. I am at my Mother's place now, but I will be back at home tonight, to answer any questions that you may have, or to send you some additional information.
Thank You, and Best Regards,
Art
Re: Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 5:46 pm
by PantherX
Re: Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 6:07 pm
by art_l_j_PlanetAMD64
Thank you, for this information!
So, are you saying that there definitely are FAH server problems, or that this is just a possibility?
Thanks again, for your fast response to my message,
Art
Re: Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 6:57 pm
by uncle fuzzy
The first link from PantherX is done (I think), and the second is being worked on as I write.
Re: Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 7:12 pm
by 7im
And for reference, your IP address is not used for WU upload and download checks.
Re: Completed WUs not accepted by FAH Servers
Posted: Thu Jul 22, 2010 10:20 pm
by bruce
With regard to your teammates' jump in points, the stats for server 171.64.65.56 were off-line for about a day and the WUs uploaded during that day should have been credited but they were delayed and all appeared at the same time. You can't judge your own progress relative to that of others except over the long term. If you did not upload any WUs to that server during that particular day, it simply does not apply to you.
PantherX's second link refers to a separate problem which has been delaying the upload of WUs to two other servers. The server problem has also been fixed but it still may take some time for the WUs to upload, depending on how big the backlog has become. The credits should appear after the normal delays from the time the WUs are actually uploaded.
Whether this applies to you or not depends on which server your WUs need to be returned to. For more information, please read the
Do this first post in the
Issues with a specific server forum.
Re: Completed WUs not accepted by FAH Servers
Posted: Fri Jul 23, 2010 1:47 am
by art_l_j_PlanetAMD64
Thanks, everyone, and I promise to be patient, now that I know the details of what is going on!
Best regards,
Art
Re: Completed WUs not accepted by FAH Servers
Posted: Fri Jul 23, 2010 9:14 am
by Russ_64
I have also noticed that some clients fail to send results back automatically. For some like GPU3 it will continue to recieve new work and queue the completed work, for others like SMP it will just sit and wait for many hours until a manual restart or successful send.
I usually shutdown my clients in the evening and do a manual send using the -send xx flag and monitor network traffic - many times it will connect to server but stop sending within a few seconds, multiple tries will usually result in a successful send.
I know that this issue has already been raised in other threads as an area to improve in new clients.
Re: Completed WUs not accepted by FAH Servers
Posted: Mon Jul 26, 2010 12:45 pm
by art_l_j_PlanetAMD64
Here is a block from one of my CPU client's FAHlog.txt file, just this morning:
Code: Select all
[09:08:06] + Attempting to send results [July 26 09:08:06 UTC]
[09:11:12] - Couldn't send HTTP request to server
[09:11:12] + Could not connect to Work Server (results)
[09:11:12] (171.64.65.60:8080)
[09:11:12] + Retrying using alternative port
[09:11:33] - Couldn't send HTTP request to server
[09:11:33] + Could not connect to Work Server (results)
[09:11:33] (171.64.65.60:80)
[09:11:33] - Error: Could not transmit unit 01 (completed July 26) to work server.
[09:11:33] - Read packet limit of 540015616... Set to 524286976.
[09:11:33] + Attempting to send results [July 26 09:11:33 UTC]
[09:11:46] - Server does not have record of this unit. Will try again later.
[09:11:46] Could not transmit unit 01 to Collection server; keeping in queue.
[09:41:19] + Attempting to get work packet
[09:41:19] - Connecting to assignment server
[09:41:19] - Successful: assigned to (171.64.65.60).
[09:41:19] + News From Folding@Home: Welcome to Folding@Home
[09:41:19] Loaded queue successfully.
[09:41:41] - Couldn't send HTTP request to server
[09:41:41] + Could not connect to Work Server
[09:41:41] - Attempt #16 to get work failed, and no other work to do.
Waiting before retry.
[10:29:54] + Attempting to get work packet
[10:29:54] - Connecting to assignment server
[10:29:54] - Successful: assigned to (171.64.65.60).
[10:29:54] + News From Folding@Home: Welcome to Folding@Home
[10:29:55] Loaded queue successfully.
[10:30:16] - Couldn't send HTTP request to server
[10:30:16] + Could not connect to Work Server
[10:30:16] - Attempt #17 to get work failed, and no other work to do.
Waiting before retry.
[11:18:22] + Attempting to get work packet
[11:18:22] - Connecting to assignment server
[11:18:22] - Successful: assigned to (171.64.65.60).
[11:18:22] + News From Folding@Home: Welcome to Folding@Home
[11:18:23] Loaded queue successfully.
[11:21:29] + Could not connect to Work Server
[11:21:29] - Attempt #18 to get work failed, and no other work to do.
Waiting before retry.
[12:09:43] + Attempting to get work packet
[12:09:43] - Connecting to assignment server
[12:09:43] - Successful: assigned to (171.64.65.60).
[12:09:43] + News From Folding@Home: Welcome to Folding@Home
[12:09:43] Loaded queue successfully.
[12:10:04] - Couldn't send HTTP request to server
[12:10:04] + Could not connect to Work Server
[12:10:04] - Attempt #19 to get work failed, and no other work to do.
Waiting before retry.
You can see that it says:
Server does not have record of this unit. Will try again later.
This is the problem that I described in my first message in this Topic.
Now, I do understand that on the weekend, especially on Sunday, that systems are commonly taken offline for routine maintenance, repairs, upgrades, etc. So, I expect that I will frequently see these sorts of messages in my FAHlog.txt files on the weekend.
But this time, the 'outage' does seem to be much longer than what has been 'usual'.
I just checked the 'News' link from the FAH Home page, and there is no news about this 'problem' (if that is what it is, and it's not just a 'routine' weekend outage).
Just a suggestion, and I
am not making any kind of a criticism here, but it might save 'you' (the experts here) some time answering questions in this Forum, if someone Posted a short news article in the 'News' area, whenever there is a (relatively) minor problem like this.
Thanks again, and Best regards,
Art