The SMP servers are showing that they are in good shape, but I'm getting a lot of 503s. I have about 20 SMP WUs waiting to be uploaded, my machines connect, begin to upload, and then they just fade out and stop. I can and do connect to the servers in my browser. There isn't any problem with security software. This farm has been running for a year and a half and, other than the occasional hitch, has been fine. I have checked my ISP and can't seem to find anything there. Here is a sample:
Code: Select all
[02:19:06] Timered checkpoint triggered.
[02:19:06] - Autosending finished units... [December 17 02:19:06 UTC]
[02:19:06] Trying to send all finished work units
[02:19:06] Project: 3064 (Run 2, Clone 138, Gen 33)
[02:19:06] + Attempting to send results [December 17 02:19:06 UTC]
[02:19:06] - Reading file work/wuresults_00.dat from core
[02:19:06]   (Read 1846876 bytes from disk)
[02:19:06] Connecting to http://171.64.65.63:8080/
[02:19:28] Writing local files
[02:19:28] Completed 12500 out of 250000 steps  (5 percent)
[02:34:30] Timered checkpoint triggered.
[02:49:31] Timered checkpoint triggered.
[03:00:29] - Couldn't send HTTP request to server
[03:00:29] + Could not connect to Work Server (results)
[03:00:29]     (171.64.65.63:8080)
[03:00:29] + Retrying using alternative port
[03:00:29] Connecting to http://171.64.65.63:80/
[03:04:31] Timered checkpoint triggered.
[03:04:41] Writing local files
[03:04:41] Completed 15000 out of 250000 steps  (6 percent)
[03:19:42] Timered checkpoint triggered.
[03:34:43] Timered checkpoint triggered.
[03:49:43] Timered checkpoint triggered.
[03:49:52] Writing local files
[03:49:53] Completed 17500 out of 250000 steps  (7 percent)
[04:04:53] Timered checkpoint triggered.
[04:19:54] Timered checkpoint triggered.
[04:26:31] - Couldn't send HTTP request to server
[04:26:31] + Could not connect to Work Server (results)
[04:26:31]     (171.64.65.63:80)
[04:26:31] - Error: Could not transmit unit 00 (completed December 16) to work server.
[04:26:31] - 4 failed uploads of this unit.
[04:26:31] + Attempting to send results [December 17 04:26:31 UTC]
[04:26:31] - Reading file work/wuresults_00.dat from core
[04:26:31]   (Read 1846876 bytes from disk)
[04:26:31] Connecting to http://171.67.108.17:8080/
[04:29:01] - Couldn't send HTTP request to server
[04:29:01] + Could not connect to Work Server (results)
[04:29:01]     (171.67.108.17:8080)
[04:29:01] + Retrying using alternative port
[04:29:01] Connecting to http://171.67.108.17:80/
[04:29:22] - Couldn't send HTTP request to server
[04:29:22] + Could not connect to Work Server (results)
[04:29:22]     (171.67.108.17:80)
[04:29:22]   Could not transmit unit 00 to Collection server; keeping in queue.
[04:29:22] Project: 3064 (Run 2, Clone 138, Gen 33)
[04:29:22] + Attempting to send results [December 17 04:29:22 UTC]
[04:29:22] - Reading file work/wuresults_00.dat from core
[04:29:22]   (Read 1846876 bytes from disk)
[04:29:22] Connecting to http://171.64.65.63:8080/
[04:29:43] - Couldn't send HTTP request to server
[04:29:43] + Could not connect to Work Server (results)
[04:29:43]     (171.64.65.63:8080)
[04:29:43] + Retrying using alternative port
[04:29:43] Connecting to http://171.64.65.63:80/
[04:30:04] - Couldn't send HTTP request to server
[04:30:04] + Could not connect to Work Server (results)
[04:30:04]     (171.64.65.63:80)
[04:30:04] - Error: Could not transmit unit 00 (completed December 16) to work server.
[04:30:04] - 5 failed uploads of this unit.
[04:30:04] + Attempting to send results [December 17 04:30:04 UTC]
[04:30:04] - Reading file work/wuresults_00.dat from core
[04:30:04]   (Read 1846876 bytes from disk)
[04:30:04] Connecting to http://171.67.108.17:8080/
[04:34:54] Timered checkpoint triggered.
[04:35:02] Writing local files
[04:35:03] Completed 20000 out of 250000 steps  (8 percent)
[04:50:03] Timered checkpoint triggered.
[04:51:35] - Couldn't send HTTP request to server
[04:51:35] + Could not connect to Work Server (results)
[04:51:35]     (171.67.108.17:8080)
[04:51:35] + Retrying using alternative port
[04:51:35] Connecting to http://171.67.108.17:80/
[04:51:43] - Couldn't send HTTP request to server
[04:51:43] + Could not connect to Work Server (results)
[04:51:43]     (171.67.108.17:80)
[04:51:43]   Could not transmit unit 00 to Collection server; keeping in queue.
[04:51:43] + Sent 0 of 2 completed units to the server
[04:51:43] - Autosend completed
[05:05:04] Timered checkpoint triggered.
[05:20:05] Timered checkpoint triggered.