Page 1 of 1

143.89.243.111 is down

Posted: Wed Nov 04, 2020 8:39 pm
by Teddy
It seems 143.89.243.111 is reported as down and have not been able to return completed work unit to this server for a couple of days now.
Is anybody looking at fixing this please? I did restart folding after a reboot but no change.
See below

Code: Select all

20:34:36:WU00:FS00:0xa7:Completed 480000 out of 500000 steps (96%)
20:34:40:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 143.89.243.111:80: A connection attempt failed because the connected party did not properly respond after a period of time or established connection failed because connected host has failed to respond.
20:35:39:WU03:FS01:0x22:Completed 1440000 out of 2000000 steps (72%)
20:36:35:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16515 run:0 clone:1153 gen:5 core:0xa7 unit:0x000000088f59f36f5f7e98b9b8d044b5
20:36:35:WU01:FS00:Uploading 7.33MiB to 143.89.243.111
20:36:35:WU01:FS00:Connecting to 143.89.243.111:8080
20:36:56:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
20:36:56:WU01:FS00:Connecting to 143.89.243.111:80
20:37:17:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 143.89.243.111:80: A connection attempt failed because the connected party did not properly respond after a period of time or established connection failed because connected host has failed to respond.
20:37:35:WU03:FS01:0x22:Completed 1460000 out of 2000000 steps (73%)
20:37:48:WU00:FS00:0xa7:Completed 485000 out of 500000 steps (97%)
Mod Edit: Changed Quote Tags To Code Tags - PantherX

Re: 143.89.243.111 is down

Posted: Thu Nov 05, 2020 6:45 am
by PantherX
Thanks for that. I have informed the correct people :)

Re: 143.89.243.111 is down

Posted: Thu Nov 05, 2020 2:44 pm
by Hopfgeist
Thanks for reporting. Just wanted to add that I'm also having problems.

It is shown as "Down" in the server stats page, but something else puzzles me:

It is listed as handling "OPENMM_22" projects, which I understand are GPU projects. I only fold on CPUs, so why would it try to upload to 143.89.243.111?

Cheers,
HG.

Re: 143.89.243.111 is down

Posted: Thu Nov 05, 2020 2:52 pm
by Joe_H
Hopfgeist wrote:It is listed as handling "OPENMM_22" projects, which I understand are GPU projects. I only fold on CPUs, so why would it try to upload to 143.89.243.111?
When it is down the field may not get the proper information filled in, I also have CPU projects waiting to upload to this server. I do know that looking through project numbers from this lab, some are GPU based and the rest are CPU based.

Re: 143.89.243.111 is down

Posted: Sun Nov 08, 2020 5:40 am
by bruce
Hopfgeist wrote:Thanks for reporting. Just wanted to add that I'm also having problems.

It is shown as "Down" in the server stats page, but something else puzzles me:

It is listed as handling "OPENMM_22" projects, which I understand are GPU projects. I only fold on CPUs, so why would it try to upload to 143.89.243.111?

Cheers,
HG.
Every WU must (eventually) be returned to the server that issued it. If you scroll back far enough the log to find where the WU was originally downloaded, you can find the Work Server that it came from.

Now suppose that WS is unavailable. In most cases, there will be a backup path called a Collection Server which is permitted to accept the WU that (temporarily) is unable to upload to the WS. Any server can act as a CS for any WS so the fact that the CS reportedly handles Core_22 is not a problem. The CS will forward the WU to the WS whenever it can, but your WU has safely been recorded as accepted by FAH. This also stops the duration timeclock used by the bonus points calculator.

Re: 143.89.243.111 is down

Posted: Sun Nov 08, 2020 6:14 am
by Teddy
PantherX wrote:Thanks for that. I have informed the correct people :)
Noticed the unit has finally sent,
Thanks.