Page 1 of 1

uploads failing

Posted: Thu Nov 06, 2014 2:34 pm
by D.G.Lang
I have ben getting the following message on THREE different machines this morning:
********************* Log Started 2014-11-06T02:54:11Z ***********************

08:42:53:WARNING:WU03:FS00:WorkServer connection failed on port 8080 trying 80
08:43:14:ERROR:WU03:FS00:Exception: Failed to connect to 171.64.65.124:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
08:43:35:WARNING:WU03:FS00:WorkServer connection failed on port 8080 trying 80
08:43:57:ERROR:WU03:FS00:Exception: Failed to connect to 171.64.65.124:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
08:45:15:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
08:45:36:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.124:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
******************************* Date: 2014-11-06 *******************************
12:16:43:WARNING:WU03:FS00:WorkServer connection failed on port 8080 trying 80

Re: uploads failing

Posted: Sat Nov 08, 2014 7:30 am
by wuffy68
Seeing this too over the last several days. Maybe server is too busy, or has some intermittency :

Code: Select all

******************************* Date: 2014-11-02 *******************************
08:23:55:WARNING:WU00:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
******************************* Date: 2014-11-02 *******************************
08:23:55:WARNING:WU00:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
******************************* Date: 2014-11-04 *******************************
17:09:11:WARNING:WU01:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
******************************* Date: 2014-11-06 *******************************
14:00:11:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
Heres a typical sequence (from the first job)....

Code: Select all

...
08:10:33:WU00:FS00:0x17:Completed 4900000 out of 5000000 steps (98%)
08:16:03:WU00:FS00:0x17:Completed 4950000 out of 5000000 steps (99%)
08:21:34:WU00:FS00:0x17:Completed 5000000 out of 5000000 steps (100%)
08:21:40:WU00:FS00:0x17:Saving result file logfile_01.txt
08:21:40:WU00:FS00:0x17:Saving result file checkpointState.xml
08:21:42:WU00:FS00:0x17:Saving result file checkpt.crc
08:21:42:WU00:FS00:0x17:Saving result file log.txt
08:21:42:WU00:FS00:0x17:Saving result file positions.xtc
08:21:46:WU00:FS00:0x17:Folding@home Core Shutdown: FINISHED_UNIT
08:21:46:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
08:21:46:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:9201 run:413 clone:4 gen:2 core:0x17 unit:0x000000076652edc45399e64e1ee04bb1
08:21:46:WU00:FS00:Uploading 8.44MiB to 171.67.108.52
08:21:46:WU00:FS00:Connecting to 171.67.108.52:8080
08:21:52:WU00:FS00:Upload 43.71%
08:21:58:WU00:FS00:Upload 68.89%
08:22:24:WU00:FS00:Upload 78.52%
08:23:55:WARNING:WU00:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
08:23:55:WU00:FS00:Trying to send results to collection server
08:23:55:WU00:FS00:Uploading 8.44MiB to 171.65.103.160
08:23:55:WU00:FS00:Connecting to 171.65.103.160:8080
08:24:01:WU00:FS00:Upload 51.11%
08:24:07:WU00:FS00:Upload complete
08:24:07:WU00:FS00:Server responded WORK_ACK (400)
08:24:07:WU00:FS00:Final credit estimate, 20547.00 points
08:24:07:WU00:FS00:Cleaning up

Re: uploads failing

Posted: Tue Nov 11, 2014 1:24 am
by bruce
At some point, that WU was already credited but apparently the confirmation message never reached the client so the client is repeatedly retrying.

Hi wuffy68 (team 224497),
Your WU (P9201 R413 C4 G2) was added to the stats database on 2014-11-02 01:06:23 for 22598.4 points of credit.

I must admit that the server message should give a clearer indication of what's wrong, but actually evaluating the various possibilities is somewhat complicated for the software to figure out what to tell you.

Re: uploads failing

Posted: Tue Nov 11, 2014 8:57 am
by wuffy68
bruce wrote:Your WU (P9201 R413 C4 G2) was added to the stats database on 2014-11-02 01:06:23 for 22598.4 points of credit.
Thank you - sounds like a network mis-communication ... I haven't seen it in the last 7 days. Looks good now.

Re: uploads failing

Posted: Tue Nov 11, 2014 6:19 pm
by bruce
If there's any way to figure out what happened plus how to avoid it in the future, it might help others, but it seems to happen rarely so don't worry about it.