Page 1 of 2

171.65.103.160, 171.64.65.98

Posted: Tue Nov 26, 2013 11:15 am
by widsss
11:12:09:WARNING:WU01:FS00:Failed to send results, will try again later
11:12:41:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
11:13:02:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.98:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

Re: 171.65.103.160, 171.64.65.98

Posted: Tue Nov 26, 2013 11:45 am
by PantherX
Server 171.65.103.160 appears to be only accepting WUs and not assigning them.

Server 171.64.65.98 appears to be down. I have informed the server owner about this.

Re: 171.65.103.160, 171.64.65.98

Posted: Tue Nov 26, 2013 2:05 pm
by widsss
Thank you! Seems like I've been having a lot of sending problems lately.

Re: 171.65.103.160, 171.64.65.98

Posted: Tue Nov 26, 2013 11:52 pm
by BWG
Same problem here with several folks on OCN. Mine is specific to the first server in the subject line. Do the server owners report back once they resolve the issues?

Re: 171.65.103.160, 171.64.65.98

Posted: Wed Nov 27, 2013 12:06 am
by bruce
Sometimes, but rarely.

Look at the data presented on http://fah-web.stanford.edu/pybeta/serverstat.html.
My guess is that 171.65.103.160 may be being drained in preparation to using it for new project(s).

What WU(s) have you completed that need to be uploaded?

Re: 171.65.103.160, 171.64.65.98

Posted: Wed Nov 27, 2013 3:22 am
by PantherX
Please note that Server 171.65.103.160 should be accepting completed WUs according to the Server page. If it isn't can you please post the log file showing this?

Server 171.64.65.98 is now fully functional.

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 12:01 am
by bison88
Currently down again, not accepting finished WU.

Code: Select all

23:42:35:WU00:FS00:0x17:Completed 1800000 out of 2000000 steps (90%)
23:44:09:WU00:FS00:0x17:Completed 1820000 out of 2000000 steps (91%)
23:45:33:WU00:FS00:0x17:Completed 1840000 out of 2000000 steps (92%)
23:47:07:WU00:FS00:0x17:Completed 1860000 out of 2000000 steps (93%)
23:48:31:WU00:FS00:0x17:Completed 1880000 out of 2000000 steps (94%)
23:49:55:WU00:FS00:0x17:Completed 1900000 out of 2000000 steps (95%)
23:51:29:WU00:FS00:0x17:Completed 1920000 out of 2000000 steps (96%)
23:52:53:WU00:FS00:0x17:Completed 1940000 out of 2000000 steps (97%)
23:54:28:WU00:FS00:0x17:Completed 1960000 out of 2000000 steps (98%)
23:55:52:WU00:FS00:0x17:Completed 1980000 out of 2000000 steps (99%)
23:57:15:WU00:FS00:0x17:Completed 2000000 out of 2000000 steps (100%)
23:57:26:WU00:FS00:0x17:Saving result file logfile_01.txt
23:57:26:WU00:FS00:0x17:Saving result file checkpointState.xml
23:57:27:WU00:FS00:0x17:Saving result file checkpt.crc
23:57:27:WU00:FS00:0x17:Saving result file log.txt
23:57:27:WU00:FS00:0x17:Saving result file positions.xtc
23:57:28:WU00:FS00:0x17:Folding@home Core Shutdown: FINISHED_UNIT
23:57:28:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
23:57:28:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:7810 run:0 clone:429 gen:461 core:0x17 unit:0x000001e40a3b1e8651d34a80a9345572
23:57:28:WU00:FS00:Uploading 5.75MiB to 171.64.65.98
23:57:28:WU00:FS00:Connecting to 171.64.65.98:8080
23:57:49:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
23:57:49:WU00:FS00:Connecting to 171.64.65.98:80
23:58:11:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.98:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
23:58:11:WU00:FS00:Trying to send results to collection server
23:58:11:WU00:FS00:Uploading 5.75MiB to 171.65.103.160
23:58:11:WU00:FS00:Connecting to 171.65.103.160:8080
23:58:17:WU00:FS00:Upload 27.15%
23:58:23:WU00:FS00:Upload 53.22%
23:58:29:WU00:FS00:Upload 79.29%
23:58:33:WU00:FS00:Upload complete
23:58:33:WU00:FS00:Server responded PLEASE_WAIT (464)
23:58:33:WARNING:WU00:FS00:Failed to send results, will try again later
23:58:34:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:7810 run:0 clone:429 gen:461 core:0x17 unit:0x000001e40a3b1e8651d34a80a9345572
23:58:34:WU00:FS00:Uploading 5.75MiB to 171.64.65.98
23:58:34:WU00:FS00:Connecting to 171.64.65.98:8080
23:58:55:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
23:58:55:WU00:FS00:Connecting to 171.64.65.98:80
23:59:16:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.98:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
23:59:16:WU00:FS00:Trying to send results to collection server
23:59:16:WU00:FS00:Uploading 5.75MiB to 171.65.103.160
23:59:16:WU00:FS00:Connecting to 171.65.103.160:8080
23:59:22:WU00:FS00:Upload 24.98%
23:59:28:WU00:FS00:Upload 51.05%
23:59:34:WU00:FS00:Upload 78.20%
23:59:39:WU00:FS00:Upload complete
23:59:39:WU00:FS00:Server responded PLEASE_WAIT (464)
23:59:39:WARNING:WU00:FS00:Failed to send results, will try again later
23:59:42:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:7810 run:0 clone:429 gen:461 core:0x17 unit:0x000001e40a3b1e8651d34a80a9345572
23:59:42:WU00:FS00:Uploading 5.75MiB to 171.64.65.98
23:59:42:WU00:FS00:Connecting to 171.64.65.98:8080
00:00:03:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
00:00:03:WU00:FS00:Connecting to 171.64.65.98:80
00:00:24:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.98:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
00:00:24:WU00:FS00:Trying to send results to collection server
00:00:24:WU00:FS00:Uploading 5.75MiB to 171.65.103.160
00:00:24:WU00:FS00:Connecting to 171.65.103.160:8080

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 1:08 am
by bruce
171.64.65.98 is being rebooted. The cause of the server failures has not been determined (yet). I'm also trying to find out what's going on with 171.65.103.160.

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 1:52 am
by ALUCARDVPR
Thanks Bruce

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 3:05 am
by Joe_H
I had a WU successfully upload to 171.65.103.160 within the last few hours, so it is accepting some WU's. I don't know if this means the server is fixed. It does appear to be busy.

Code: Select all

01:05:59:WU01:FS01:Uploading 1.73MiB to 171.65.103.160
01:05:59:WU01:FS01:Connecting to 171.65.103.160:8080
01:06:05:WU01:FS01:Upload 28.97%
01:06:11:WU01:FS01:Upload 57.93%
01:06:17:WU01:FS01:Upload 83.28%
01:06:23:WU01:FS01:Upload complete
01:06:23:WU01:FS01:Server responded WORK_ACK (400)

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 3:17 am
by sco01
03:14:11:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:7811 run:0 clone:258 gen:544 core:0x17 unit:0x0000023b0a3b1e8651db487cae97e94e
03:14:11:WU00:FS00:Uploading 4.27MiB to 171.64.65.98
03:14:11:WU00:FS00:Connecting to 171.64.65.98:8080
03:14:32:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
03:14:32:WU00:FS00:Connecting to 171.64.65.98:80
03:15:14:WU00:FS00:Upload complete
03:15:14:WU00:FS00:Server responded PLEASE_WAIT (464)
03:15:14:WARNING:WU00:FS00:Failed to send results, will try again later

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 3:37 am
by widsss
Still failing here. 12 and 13 attempts for each GPU.

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 5:27 am
by beer
I do not kniw if this is related but I cannot send this WU to server 171.64.65.99

05:25:18:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:7808 run:2 clone:211 gen:77 core:0xa4 unit:0x000000670a3b1e874e30f4420009dca5
05:25:18:WU01:FS01:Uploading 4.05MiB to 171.64.65.99
05:25:18:WU01:FS01:Connecting to 171.64.65.99:8080
05:25:34:WU02:FS01:0xa3:Completed 25000 out of 500000 steps (5%)
05:25:39:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
05:25:39:WU01:FS01:Connecting to 171.64.65.99:80
05:26:00:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.99:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 7:16 am
by Gust4f
Failing here too.

Re: 171.65.103.160, 171.64.65.98

Posted: Thu Nov 28, 2013 7:35 am
by billford
beer wrote:I do not kniw if this is related but I cannot send this WU to server 171.64.65.99
Same here:

Code: Select all

04:01:48:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:7809 run:7 clone:139 gen:56 core:0xa4 unit:0x0000004d0a3b1e874e31145327206b2b
04:01:48:WU00:FS00:Uploading 4.13MiB to 171.64.65.99
04:01:48:WU00:FS00:Connecting to 171.64.65.99:8080
04:03:55:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
04:03:55:WU00:FS00:Connecting to 171.64.65.99:80
04:06:02:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.99:80: Connection timed out
First attempt was at about 02:40Z.