Page 1 of 1

Upload failed: WS 171.64.65.106 CS 171.67.108.25

Posted: Wed Dec 28, 2011 7:47 am
by daikerjohn
Currently dealing with 3 separate work queue items (from 3 different GPUs) that aren't uploading. Each of them are attempting to upload to work server 171.64.65.106, which makes me think that my data is good.

I see from the server stats page that 171.64.65.106 is currently accepting, but it's not reporting any CPU load or disk usage. Maybe it's hung?

Code: Select all

06:46:16:Unit 02:Completed 98%
06:49:13:Unit 02:Completed 99%
06:52:12:Unit 02:Completed 100%
06:52:13:Unit 02:Successful run
06:52:13:Unit 02:DynamicWrapper: Finished Work Unit: sleep=10000
06:52:23:Unit 02:Reserved 149632 bytes for xtc file; Cosm status=0
06:52:23:Unit 02:Allocated 149632 bytes for xtc file
06:52:23:Unit 02:- Reading up to 149632 from "02/wudata_01.xtc": Read 149632
06:52:23:Unit 02:Read 149632 bytes from xtc file; available packet space=786280832
06:52:23:Unit 02:xtc file hash check passed.
06:52:23:Unit 02:Reserved 22704 22704 786280832 bytes for arc file=<02/wudata_01.trr> Cosm status=0
06:52:23:Unit 02:Allocated 22704 bytes for arc file
06:52:23:Unit 02:- Reading up to 22704 from "02/wudata_01.trr": Read 22704
06:52:23:Unit 02:Read 22704 bytes from arc file; available packet space=786258128
06:52:23:Unit 02:trr file hash check passed.
06:52:23:Unit 02:Allocated 560 bytes for edr file
06:52:23:Unit 02:Read bedfile
06:52:23:Unit 02:edr file hash check passed.
06:52:23:Unit 02:Allocated 0 bytes for logfile
06:52:23:Unit 02:Could not open/read logfile=<02/wudata_01.log>; Cosm status=-1
06:52:23:Unit 02:GuardedRun: success in DynamicWrapper
06:52:23:Unit 02:GuardedRun: done
06:52:23:Unit 02:Run: GuardedRun completed.
06:52:24:Unit 02:+ Opened results file
06:52:24:Unit 02:- Writing 173408 bytes of core data to disk...
06:52:24:Unit 02:Done: 172896 -> 171470 (compressed to 99.1 percent)
06:52:24:Unit 02:  ... Done.
06:52:24:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
06:52:24:FahCore, running Unit 02, returned: FINISHED_UNIT (100 = 0x64)
06:52:25:Sending unit results: id:02 state:SEND error:OK project:5792 run:7 clone:678 gen:3 core:0x11 unit:0x57e343464efa776a000302a6000716a0
06:52:25:Unit 02: Uploading 167.95KiB to 171.64.65.106
06:52:25:Connecting to 171.64.65.106:8080
06:52:25:WARNING: Exception: Failed to send results to work server: Upload failed
06:52:25:Trying to send results to collection server
06:52:25:Unit 02: Uploading 167.95KiB to 171.67.108.25
06:52:25:Connecting to 171.67.108.25:8080
06:52:43:WARNING: WorkServer connection failed on port 8080 trying 80
06:52:43:Connecting to 171.67.108.25:80
06:53:02:ERROR: Exception: Failed to connect to 171.67.108.25:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
06:53:02:Sending unit results: id:02 state:SEND error:OK project:5792 run:7 clone:678 gen:3 core:0x11 unit:0x57e343464efa776a000302a6000716a0
06:53:02:Unit 02: Uploading 167.95KiB to 171.64.65.106
06:53:02:Connecting to 171.64.65.106:8080
06:53:02:WARNING: Exception: Failed to send results to work server: Upload failed
06:53:02:Trying to send results to collection server
06:53:02:Unit 02: Uploading 167.95KiB to 171.67.108.25
06:53:02:Connecting to 171.67.108.25:8080
06:53:04:WARNING: WorkServer connection failed on port 8080 trying 80
06:53:04:Connecting to 171.67.108.25:80
06:53:05:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
06:54:02:Sending unit results: id:02 state:SEND error:OK project:5792 run:7 clone:678 gen:3 core:0x11 unit:0x57e343464efa776a000302a6000716a0
06:54:02:Unit 02: Uploading 167.95KiB to 171.64.65.106
06:54:02:Connecting to 171.64.65.106:8080
06:54:02:WARNING: Exception: Failed to send results to work server: Upload failed
06:54:02:Trying to send results to collection server
06:54:02:Unit 02: Uploading 167.95KiB to 171.67.108.25
06:54:02:Connecting to 171.67.108.25:8080
06:54:04:WARNING: WorkServer connection failed on port 8080 trying 80
06:54:04:Connecting to 171.67.108.25:80
06:54:05:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
06:55:39:Sending unit results: id:02 state:SEND error:OK project:5792 run:7 clone:678 gen:3 core:0x11 unit:0x57e343464efa776a000302a6000716a0
06:55:39:Unit 02: Uploading 167.95KiB to 171.64.65.106
06:55:39:Connecting to 171.64.65.106:8080
06:55:39:WARNING: Exception: Failed to send results to work server: Upload failed
06:55:39:Trying to send results to collection server
06:55:39:Unit 02: Uploading 167.95KiB to 171.67.108.25
06:55:39:Connecting to 171.67.108.25:8080
06:55:41:WARNING: WorkServer connection failed on port 8080 trying 80
06:55:41:Connecting to 171.67.108.25:80
06:55:42:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.

Re: Upload failed: WS 171.64.65.106 CS 171.67.108.25

Posted: Wed Dec 28, 2011 5:54 pm
by bruce
The status of NO RESPONSE has been showing since before you posted. The server can't be contacted so something is wrong.

I'll try to find somebody who can fix it.