Page 1 of 2
WU 11758 & 11778 [Fixed]
Posted: Tue Mar 24, 2020 11:00 am
by uro666
Hey guys, I seem to having send issues with the following two units, both have been trying to send for a number hours now:
PRCG 11758 (0, 53, 0):
Code: Select all
10:24:05:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11758 run:0 clone:53 gen:0 core:0x22 unit:0x0000000d9bf7a4d55e6d770ece7dfad8
10:24:05:WU01:FS01:Uploading 55.24MiB to 155.247.164.213
10:24:05:WU01:FS01:Connecting to 155.247.164.213:8080
10:24:06:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
10:24:06:WU01:FS01:Trying to send results to collection server
10:24:06:WU01:FS01:Uploading 55.24MiB to 155.247.164.214
10:24:06:WU01:FS01:Connecting to 155.247.164.214:8080
10:24:06:ERROR:WU01:FS01:Exception: Transfer failed
PCRG 11778 (0, 20518, 0):
Code: Select all
10:24:05:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:11778 run:0 clone:20518 gen:0 core:0x22 unit:0x00000001287234c95e77490314d62814
10:24:05:WU02:FS01:Uploading 23.05MiB to 40.114.52.201
10:24:05:WU02:FS01:Connecting to 40.114.52.201:8080
10:24:26:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
10:24:26:WU02:FS01:Connecting to 40.114.52.201:80
10:24:48:WARNING:WU02:FS01:Exception: Failed to send results to work server: Failed to connect to 40.114.52.201:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
10:24:48:WU02:FS01:Trying to send results to collection server
10:24:48:WU02:FS01:Uploading 23.05MiB to 13.90.152.57
10:24:48:WU02:FS01:Connecting to 13.90.152.57:8080
10:25:09:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
10:25:09:WU02:FS01:Connecting to 13.90.152.57:80
10:25:30:ERROR:WU02:FS01:Exception: Failed to connect to 13.90.152.57:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
I realise your all hyper busy and I dont expect an instant answer but thought I'd let you know.
If there's any more log data you require I can fish it out, cheers!
Re: WU 11758 & 11778
Posted: Tue Mar 24, 2020 6:00 pm
by uro666
Just an update to this WU 11778 (0, 20518, 0) eventually uploaded to 40.114.52.201 & I see it was restarted ~1hour ago.
WU 11758 is still Transfer Failing
Re: WU 11758 & 11778
Posted: Tue Mar 24, 2020 7:41 pm
by nastasache
Lot of Transfer failed on 11758 for me too. And a lot of waiting time for new Wu's.
Code: Select all
19:24:03:WU02:FS02:Sending unit results: id:02 state:SEND error:NO_ERROR project:11758 run:0 clone:362 gen:0 core:0x22 unit:0x0000000b9bf7a4d55e6d770f028d9463
19:24:03:WU02:FS02:Uploading 55.24MiB to 155.247.164.213
19:24:03:WU02:FS02:Connecting to 155.247.164.213:8080
19:24:04:WARNING:WU02:FS02:Exception: Failed to send results to work server: Transfer failed
19:24:04:WU02:FS02:Trying to send results to collection server
19:24:04:WU02:FS02:Uploading 55.24MiB to 155.247.164.214
19:24:04:WU02:FS02:Connecting to 155.247.164.214:8080
19:24:04:WU00:FS00:Connecting to 65.254.110.245:8080
19:24:04:ERROR:WU02:FS02:Exception: Transfer failed
19:24:04:WU01:FS01:Connecting to 65.254.110.245:8080
19:24:04:WU03:FS02:Connecting to 65.254.110.245:8080
19:24:04:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:24:04:WU00:FS00:Connecting to 18.218.241.186:80
19:24:04:WU01:FS01:Assigned to work server 40.114.52.201
19:24:04:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 from 40.114.52.201
19:24:04:WU01:FS01:Connecting to 40.114.52.201:8080
19:24:05:WU03:FS02:Assigned to work server 40.114.52.201
19:24:05:WU03:FS02:Requesting new work unit for slot 02: READY gpu:1:GP102 [GeForce GTX 1080 Ti] 11380 from 40.114.52.201
19:24:05:WU03:FS02:Connecting to 40.114.52.201:8080
19:24:05:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
19:24:05:ERROR:WU00:FS00:Exception: Could not get an assignment
19:24:25:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
19:24:25:WU01:FS01:Connecting to 40.114.52.201:80
19:24:26:WARNING:WU03:FS02:WorkServer connection failed on port 8080 trying 80
19:24:26:WU03:FS02:Connecting to 40.114.52.201:80
19:24:47:ERROR:WU01:FS01:Exception: Failed to connect to 40.114.52.201:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
19:24:47:ERROR:WU03:FS02:Exception: Failed to connect to 40.114.52.201:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
I think they have no enough hardware and human resources to handle such a huge request in helping to run Covid projects. Maybe they need a government help to run more servers or a cloud solution.
Re: WU 11758 & 11778
Posted: Tue Mar 24, 2020 8:17 pm
by Kebast
I'm having issues uploading 11778 as well:
20:01:54:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
Code: Select all
19:40:49:WU00:FS00:0x22:Completed 2000000 out of 2000000 steps (100%)
19:40:52:WU00:FS00:0x22:Saving result file ../logfile_01.txt
19:40:52:WU00:FS00:0x22:Saving result file checkpointState.xml
19:40:52:WU00:FS00:0x22:Saving result file checkpt.crc
19:40:52:WU00:FS00:0x22:Saving result file positions.xtc
19:40:52:WU00:FS00:0x22:Saving result file science.log
19:40:52:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
19:40:52:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11778 run:0 clone:4969 gen:7 core:0x22 unit:0x0000000d287234c95e73c3ff9fe05b59
19:40:52:WU00:FS00:Uploading 23.10MiB to 40.114.52.201
19:40:52:WU00:FS00:Connecting to 40.114.52.201:8080
19:41:23:WU00:FS00:Upload 0.27%
19:41:49:WU00:FS00:Upload 2.16%
19:42:08:WU00:FS00:Upload 5.68%
19:42:30:WU00:FS00:Upload 7.31%
19:42:44:WU00:FS00:Upload 8.93%
19:43:08:WU00:FS00:Upload 10.55%
19:43:30:WU00:FS00:Upload 12.18%
19:43:45:WU00:FS00:Upload 13.53%
19:44:14:WU00:FS00:Upload 15.15%
19:44:29:WU00:FS00:Upload 16.78%
19:44:50:WU00:FS00:Upload 18.40%
19:45:07:WU00:FS00:Upload 19.76%
19:45:31:WU00:FS00:Upload 21.38%
19:45:46:WU00:FS00:Upload 23.00%
19:46:01:WU00:FS00:Upload 24.63%
19:46:21:WU00:FS00:Upload 26.25%
19:46:35:WU00:FS00:Upload 27.87%
19:46:51:WU00:FS00:Upload 29.50%
19:47:06:WU00:FS00:Upload 31.12%
19:47:26:WU00:FS00:Upload 32.47%
19:47:40:WU00:FS00:Upload 34.10%
19:48:00:WU00:FS00:Upload 35.72%
19:48:13:WU00:FS00:Upload 37.35%
19:48:34:WU00:FS00:Upload 38.70%
19:49:08:WU00:FS00:Upload 40.32%
19:49:43:WU00:FS00:Upload 41.95%
19:50:18:WU00:FS00:Upload 43.57%
19:50:55:WU00:FS00:Upload 44.92%
19:51:18:WU00:FS00:Upload 46.55%
19:51:34:WU00:FS00:Upload 48.17%
19:51:47:WU00:FS00:Upload 49.79%
19:51:58:WU00:FS00:Upload 51.15%
19:52:11:WU00:FS00:Upload 52.77%
19:52:29:WU00:FS00:Upload 54.39%
19:52:46:WU00:FS00:Upload 56.02%
19:53:07:WU00:FS00:Upload 57.37%
19:53:24:WU00:FS00:Upload 58.99%
19:53:36:WU00:FS00:Upload 60.62%
19:53:56:WU00:FS00:Upload 62.24%
19:54:16:WU00:FS00:Upload 63.87%
19:54:45:WU00:FS00:Upload 65.22%
19:55:04:WU00:FS00:Upload 66.84%
19:55:32:WU00:FS00:Upload 68.47%
19:55:54:WU00:FS00:Upload 70.09%
19:56:20:WU00:FS00:Upload 71.44%
19:56:38:WU00:FS00:Upload 73.07%
19:56:54:WU00:FS00:Upload 74.69%
19:57:07:WU00:FS00:Upload 76.31%
19:57:20:WU00:FS00:Upload 77.67%
19:57:33:WU00:FS00:Upload 79.29%
19:57:50:WU00:FS00:Upload 80.92%
19:58:05:WU00:FS00:Upload 82.54%
19:58:18:WU00:FS00:Upload 83.89%
19:58:38:WU00:FS00:Upload 85.52%
19:58:52:WU00:FS00:Upload 87.14%
19:59:14:WU00:FS00:Upload 88.76%
19:59:32:WU00:FS00:Upload 90.12%
19:59:55:WU00:FS00:Upload 91.74%
20:00:22:WU00:FS00:Upload 93.36%
20:00:42:WU00:FS00:Upload 94.99%
20:01:05:WU00:FS00:Upload 96.34%
20:01:23:WU00:FS00:Upload 97.96%
20:01:53:WU00:FS00:Upload 99.59%
20:01:54:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
20:01:54:WU00:FS00:Trying to send results to collection server
20:01:54:WU00:FS00:Uploading 23.10MiB to 155.247.164.214
20:01:54:WU00:FS00:Connecting to 155.247.164.214:8080
20:01:54:ERROR:WU00:FS00:Exception: Transfer failed
20:01:54:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11778 run:0 clone:4969 gen:7 core:0x22 unit:0x0000000d287234c95e73c3ff9fe05b59
20:01:54:WU00:FS00:Uploading 23.10MiB to 40.114.52.201
20:01:54:WU00:FS00:Connecting to 40.114.52.201:8080
20:04:04:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
20:04:04:WU00:FS00:Connecting to 40.114.52.201:80
20:05:09:WU00:FS00:Upload 0.27%
20:05:16:WU00:FS00:Upload 4.87%
20:05:26:WU00:FS00:Upload 7.31%
20:05:44:WU00:FS00:Upload 8.39%
20:06:12:WU00:FS00:Upload 9.47%
20:06:42:WU00:FS00:Upload 10.55%
20:07:01:WU00:FS00:Upload 11.91%
20:07:15:WU00:FS00:Upload 12.99%
20:07:35:WU00:FS00:Upload 14.07%
20:07:56:WU00:FS00:Upload 15.43%
20:08:05:WU00:FS00:Upload 16.51%
20:08:12:WU00:FS00:Upload 17.59%
20:08:21:WU00:FS00:Upload 20.03%
20:08:30:WU00:FS00:Upload 22.46%
20:09:13:WU00:FS00:Upload 23.54%
20:09:54:WU00:FS00:Upload 24.63%
20:10:34:WU00:FS00:Upload 25.98%
20:11:29:WU00:FS00:Upload 27.06%
20:12:03:WU00:FS00:Upload 28.14%
20:12:40:WU00:FS00:Upload 29.23%
20:12:56:WU00:FS00:Upload 30.58%
20:13:06:WU00:FS00:Upload 31.66%
20:13:12:WU00:FS00:Upload 32.74%
20:13:23:WU00:FS00:Upload 35.18%
20:13:54:WU00:FS00:Upload 37.62%
Re: WU 11758 & 11778
Posted: Tue Mar 24, 2020 8:24 pm
by alxbelu
There's a known issue with WU 11758 and servers 213/214 and project/server managers have been notified: viewtopic.php?f=18&t=32492&start=90
As for 11778; I've seen WU 11777 struggle to upload to server .201 this morning (europe), but eventually got through. In this case I believe it's a simple case of overloaded servers. (edit: and for the record, .201 is actually an Azure/cloud server)
Re: WU 11758 & 11778
Posted: Tue Mar 24, 2020 8:32 pm
by bruce
Right.
Known, notifed, and (to me) it looks like a capacity issue with Azure.
All this volunteer help seems to be able to overload whatever new resources we manage to bring on-line.
Re: WU 11758 & 11778
Posted: Thu Mar 26, 2020 4:09 pm
by uro666
Update.
WU 11758 eventually uploaded this morning:
Code: Select all
07:08:16:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11758 run:0 clone:53 gen:0 core:0x22 unit:0x0000000d9bf7a4d55e6d770ece7dfad8
07:08:16:WU01:FS01:Uploading 55.24MiB to 155.247.164.213
07:08:16:WU01:FS01:Connecting to 155.247.164.213:8080
07:08:37:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
07:08:37:WU01:FS01:Connecting to 155.247.164.213:80
07:08:58:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 155.247.164.213:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
07:08:58:WU01:FS01:Trying to send results to collection server
07:08:58:WU01:FS01:Uploading 55.24MiB to 155.247.164.214
07:08:58:WU01:FS01:Connecting to 155.247.164.214:8080
07:09:04:WU01:FS01:Upload 5.66%
07:09:10:WU01:FS01:Upload 11.99%
07:09:16:WU01:FS01:Upload 18.33%
07:09:22:WU01:FS01:Upload 24.78%
07:09:28:WU01:FS01:Upload 30.89%
07:09:34:WU01:FS01:Upload 37.00%
07:09:40:WU01:FS01:Upload 42.54%
07:09:46:WU01:FS01:Upload 48.88%
07:09:52:WU01:FS01:Upload 55.22%
07:09:58:WU01:FS01:Upload 60.42%
07:10:04:WU01:FS01:Upload 65.97%
07:10:10:WU01:FS01:Upload 71.96%
07:10:16:WU01:FS01:Upload 78.07%
07:10:22:WU01:FS01:Upload 84.30%
07:10:28:WU01:FS01:Upload 90.18%
07:10:34:WU01:FS01:Upload 96.63%
07:10:38:WU01:FS01:Upload complete
07:10:38:WU01:FS01:Server responded WORK_ACK (400)
07:10:38:WU01:FS01:Final credit estimate, 16615.00 points
07:10:38:WU01:FS01:Cleaning up
I note the developer stated in
another thread relating to this WU that the issue should be fixed now, which was not long after my WU uploaded.
As my WU uploaded this morning my issues with this WU are now resolved.
Thank you to those involved for reporting & fixing this.
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 11:46 am
by KjartanD
Are you guys getting crazy lolw points for this 11778 WU also? My 1070 worked on one for some times and when finally returned, it got 9405points for it. Took just minutes to upload.
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 11:55 am
by Neil-B
It has a one day timeout for the QRB if not returned within that then base points which are 9405
https://apps.foldingathome.org/psummary
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 12:23 pm
by KjartanD
Yes but it was returned within 3hours...
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 12:27 pm
by Neil-B
Can't see how many WUs you have done but until you completed 10 WUs (with other caveats see link) you won't see QRBs …
https://foldingathome.org/support/faq/points/ … there have been a few (relatively rare I believe) issues with specific servers no doing points correctly that have been stood up quickly - this might be one of the cases - these issues are being looked into and should be resolved over time … these links may have information that clarifies …
viewtopic.php?f=18&t=33072&start=15 and
viewtopic.php?f=74&t=33302
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 4:43 pm
by KjartanD
Neil-B wrote:Can't see how many WUs you have done but until you completed 10 WUs (with other caveats see link) you won't see QRBs …
https://foldingathome.org/support/faq/points/ … there have been a few (relatively rare I believe) issues with specific servers no doing points correctly that have been stood up quickly - this might be one of the cases - these issues are being looked into and should be resolved over time … these links may have information that clarifies …
viewtopic.php?f=18&t=33072&start=15 and
viewtopic.php?f=74&t=33302
I have completed more than 2400 WU
I will look at those links, thanks.
Re: WU 11758 & 11778 [Fixed]
Posted: Sat Mar 28, 2020 6:13 pm
by Neil-B
Fair chance you would have done more than 10 but have seen so many 1st WU not credited post in the last week or so I thought I would mention it as a quick check on the stats for you just gave me the current Bad Gateway result … Hope the links help - hopefully one of the known issues (which are being worked on any hopefully resolved soon).
Re: WU 11758 & 11778 [Fixed]
Posted: Sun Mar 29, 2020 2:52 pm
by uro666
KjartanD wrote:Are you guys getting crazy lolw points for this 11778 WU also? My 1070 worked on one for some times and when finally returned, it got 9405points for it. Took just minutes to upload.
Yep I got another 11778 WU last night and returned it in around 2h 35m (also a 1070 GPU) with:
Final credit estimate, 9405.00 points - which is the base credit for the WU.
Log:
Code: Select all
00:46:40:WU00:FS01:Connecting to 18.218.241.186:80
00:46:40:WU00:FS01:Assigned to work server 40.114.52.201
00:46:40:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP104 [GeForce GTX 1070] 6463 from 40.114.52.201
00:46:40:WU00:FS01:Connecting to 40.114.52.201:8080
00:47:10:WU00:FS01:Downloading 29.59MiB
00:47:16:WU00:FS01:Download 13.10%
00:47:22:WU00:FS01:Download 46.05%
00:47:36:WU00:FS01:Download 80.69%
00:47:41:WU00:FS01:Download complete
00:47:41:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:11778 run:0 clone:15499 gen:3 core:0x22 unit:0x00000009287234c95e77496fad6276f7
00:47:41:WU00:FS01:Starting
00:47:41:WU00:FS01:Running FahCore: "d:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\<user>\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 00 -suffix 01 -version 705 -lifeline 40092 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
00:47:41:WU00:FS01:Started FahCore on PID 55776
00:47:41:WU00:FS01:Core PID:56788
00:47:41:WU00:FS01:FahCore 0x22 started
00:47:41:WU00:FS01:0x22:*********************** Log Started 2020-03-29T00:47:41Z ***********************
00:47:41:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
00:47:41:WU00:FS01:0x22: Type: 0x22
00:47:41:WU00:FS01:0x22: Core: Core22
00:47:41:WU00:FS01:0x22: Website: https://foldingathome.org/
00:47:41:WU00:FS01:0x22: Copyright: (c) 2009-2018 foldingathome.org
00:47:41:WU00:FS01:0x22: Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
00:47:41:WU00:FS01:0x22: <rafal.wiewiora@choderalab.org>
00:47:41:WU00:FS01:0x22: Args: -dir 00 -suffix 01 -version 705 -lifeline 55776 -checkpoint 15
00:47:41:WU00:FS01:0x22: -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
00:47:41:WU00:FS01:0x22: 0 -gpu 0
00:47:41:WU00:FS01:0x22: Config: <none>
00:47:41:WU00:FS01:0x22:************************************ Build *************************************
00:47:41:WU00:FS01:0x22: Version: 0.0.2
00:47:41:WU00:FS01:0x22: Date: Dec 6 2019
00:47:41:WU00:FS01:0x22: Time: 21:30:31
00:47:41:WU00:FS01:0x22: Repository: Git
00:47:41:WU00:FS01:0x22: Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
00:47:41:WU00:FS01:0x22: Branch: HEAD
00:47:41:WU00:FS01:0x22: Compiler: Visual C++ 2008
00:47:41:WU00:FS01:0x22: Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
00:47:41:WU00:FS01:0x22: Platform: win32 10
00:47:41:WU00:FS01:0x22: Bits: 64
00:47:41:WU00:FS01:0x22: Mode: Release
00:47:41:WU00:FS01:0x22:************************************ System ************************************
00:47:41:WU00:FS01:0x22: CPU: AMD Ryzen 9 3900X 12-Core Processor
00:47:41:WU00:FS01:0x22: CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
00:47:41:WU00:FS01:0x22: CPUs: 24
00:47:41:WU00:FS01:0x22: Memory: 31.92GiB
00:47:41:WU00:FS01:0x22:Free Memory: 21.56GiB
00:47:41:WU00:FS01:0x22: Threads: WINDOWS_THREADS
00:47:41:WU00:FS01:0x22: OS Version: 6.2
00:47:41:WU00:FS01:0x22:Has Battery: false
00:47:41:WU00:FS01:0x22: On Battery: false
00:47:41:WU00:FS01:0x22: UTC Offset: 0
00:47:41:WU00:FS01:0x22: PID: 56788
00:47:41:WU00:FS01:0x22: CWD: C:\Users\<user>\AppData\Roaming\FAHClient\work
00:47:41:WU00:FS01:0x22: OS: Windows 10 Pro
00:47:41:WU00:FS01:0x22: OS Arch: AMD64
00:47:41:WU00:FS01:0x22:********************************************************************************
00:47:41:WU00:FS01:0x22:Project: 11778 (Run 0, Clone 15499, Gen 3)
00:47:41:WU00:FS01:0x22:Unit: 0x00000009287234c95e77496fad6276f7
00:47:41:WU00:FS01:0x22:Reading tar file core.xml
00:47:41:WU00:FS01:0x22:Reading tar file integrator.xml
00:47:41:WU00:FS01:0x22:Reading tar file state.xml
00:47:41:WU00:FS01:0x22:Reading tar file system.xml
00:47:41:WU00:FS01:0x22:Digital signatures verified
00:47:41:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
00:47:41:WU00:FS01:0x22:Version 0.0.2
00:47:46:WU00:FS01:0x22:Completed 0 out of 2000000 steps (0%)
00:47:46:WU00:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
00:49:16:WU00:FS01:0x22:Completed 20000 out of 2000000 steps (1%)
00:50:48:WU00:FS01:0x22:Completed 40000 out of 2000000 steps (2%)
00:52:25:WU00:FS01:0x22:Completed 60000 out of 2000000 steps (3%)
00:53:58:WU00:FS01:0x22:Completed 80000 out of 2000000 steps (4%)
00:55:32:WU00:FS01:0x22:Completed 100000 out of 2000000 steps (5%)
00:57:08:WU00:FS01:0x22:Completed 120000 out of 2000000 steps (6%)
00:58:41:WU00:FS01:0x22:Completed 140000 out of 2000000 steps (7%)
01:00:17:WU00:FS01:0x22:Completed 160000 out of 2000000 steps (8%)
01:01:51:WU00:FS01:0x22:Completed 180000 out of 2000000 steps (9%)
01:03:23:WU00:FS01:0x22:Completed 200000 out of 2000000 steps (10%)
01:04:57:WU00:FS01:0x22:Completed 220000 out of 2000000 steps (11%)
01:06:29:WU00:FS01:0x22:Completed 240000 out of 2000000 steps (12%)
01:08:03:WU00:FS01:0x22:Completed 260000 out of 2000000 steps (13%)
01:09:35:WU00:FS01:0x22:Completed 280000 out of 2000000 steps (14%)
01:11:07:WU00:FS01:0x22:Completed 300000 out of 2000000 steps (15%)
01:12:41:WU00:FS01:0x22:Completed 320000 out of 2000000 steps (16%)
01:14:13:WU00:FS01:0x22:Completed 340000 out of 2000000 steps (17%)
01:15:44:WU00:FS01:0x22:Completed 360000 out of 2000000 steps (18%)
01:17:12:WU00:FS01:0x22:Completed 380000 out of 2000000 steps (19%)
01:18:41:WU00:FS01:0x22:Completed 400000 out of 2000000 steps (20%)
01:20:11:WU00:FS01:0x22:Completed 420000 out of 2000000 steps (21%)
01:21:40:WU00:FS01:0x22:Completed 440000 out of 2000000 steps (22%)
01:23:10:WU00:FS01:0x22:Completed 460000 out of 2000000 steps (23%)
01:24:39:WU00:FS01:0x22:Completed 480000 out of 2000000 steps (24%)
01:26:08:WU00:FS01:0x22:Completed 500000 out of 2000000 steps (25%)
01:27:38:WU00:FS01:0x22:Completed 520000 out of 2000000 steps (26%)
01:29:07:WU00:FS01:0x22:Completed 540000 out of 2000000 steps (27%)
01:30:37:WU00:FS01:0x22:Completed 560000 out of 2000000 steps (28%)
01:32:06:WU00:FS01:0x22:Completed 580000 out of 2000000 steps (29%)
01:33:34:WU00:FS01:0x22:Completed 600000 out of 2000000 steps (30%)
01:35:05:WU00:FS01:0x22:Completed 620000 out of 2000000 steps (31%)
01:36:33:WU00:FS01:0x22:Completed 640000 out of 2000000 steps (32%)
01:38:04:WU00:FS01:0x22:Completed 660000 out of 2000000 steps (33%)
01:39:32:WU00:FS01:0x22:Completed 680000 out of 2000000 steps (34%)
01:41:01:WU00:FS01:0x22:Completed 700000 out of 2000000 steps (35%)
01:42:33:WU00:FS01:0x22:Completed 720000 out of 2000000 steps (36%)
01:44:04:WU00:FS01:0x22:Completed 740000 out of 2000000 steps (37%)
01:45:38:WU00:FS01:0x22:Completed 760000 out of 2000000 steps (38%)
01:47:09:WU00:FS01:0x22:Completed 780000 out of 2000000 steps (39%)
01:48:41:WU00:FS01:0x22:Completed 800000 out of 2000000 steps (40%)
01:50:15:WU00:FS01:0x22:Completed 820000 out of 2000000 steps (41%)
01:51:46:WU00:FS01:0x22:Completed 840000 out of 2000000 steps (42%)
01:53:20:WU00:FS01:0x22:Completed 860000 out of 2000000 steps (43%)
01:54:52:WU00:FS01:0x22:Completed 880000 out of 2000000 steps (44%)
01:56:23:WU00:FS01:0x22:Completed 900000 out of 2000000 steps (45%)
01:57:57:WU00:FS01:0x22:Completed 920000 out of 2000000 steps (46%)
01:59:28:WU00:FS01:0x22:Completed 940000 out of 2000000 steps (47%)
02:01:02:WU00:FS01:0x22:Completed 960000 out of 2000000 steps (48%)
02:02:33:WU00:FS01:0x22:Completed 980000 out of 2000000 steps (49%)
02:04:05:WU00:FS01:0x22:Completed 1000000 out of 2000000 steps (50%)
02:05:38:WU00:FS01:0x22:Completed 1020000 out of 2000000 steps (51%)
02:07:10:WU00:FS01:0x22:Completed 1040000 out of 2000000 steps (52%)
02:08:44:WU00:FS01:0x22:Completed 1060000 out of 2000000 steps (53%)
02:10:15:WU00:FS01:0x22:Completed 1080000 out of 2000000 steps (54%)
02:11:46:WU00:FS01:0x22:Completed 1100000 out of 2000000 steps (55%)
02:13:20:WU00:FS01:0x22:Completed 1120000 out of 2000000 steps (56%)
02:14:51:WU00:FS01:0x22:Completed 1140000 out of 2000000 steps (57%)
02:16:25:WU00:FS01:0x22:Completed 1160000 out of 2000000 steps (58%)
02:17:57:WU00:FS01:0x22:Completed 1180000 out of 2000000 steps (59%)
02:19:28:WU00:FS01:0x22:Completed 1200000 out of 2000000 steps (60%)
02:21:02:WU00:FS01:0x22:Completed 1220000 out of 2000000 steps (61%)
02:22:33:WU00:FS01:0x22:Completed 1240000 out of 2000000 steps (62%)
02:24:07:WU00:FS01:0x22:Completed 1260000 out of 2000000 steps (63%)
02:25:39:WU00:FS01:0x22:Completed 1280000 out of 2000000 steps (64%)
02:27:10:WU00:FS01:0x22:Completed 1300000 out of 2000000 steps (65%)
02:28:44:WU00:FS01:0x22:Completed 1320000 out of 2000000 steps (66%)
02:30:15:WU00:FS01:0x22:Completed 1340000 out of 2000000 steps (67%)
02:31:49:WU00:FS01:0x22:Completed 1360000 out of 2000000 steps (68%)
02:33:20:WU00:FS01:0x22:Completed 1380000 out of 2000000 steps (69%)
02:34:51:WU00:FS01:0x22:Completed 1400000 out of 2000000 steps (70%)
02:36:25:WU00:FS01:0x22:Completed 1420000 out of 2000000 steps (71%)
02:37:56:WU00:FS01:0x22:Completed 1440000 out of 2000000 steps (72%)
02:39:30:WU00:FS01:0x22:Completed 1460000 out of 2000000 steps (73%)
02:41:00:WU00:FS01:0x22:Completed 1480000 out of 2000000 steps (74%)
02:42:29:WU00:FS01:0x22:Completed 1500000 out of 2000000 steps (75%)
02:43:59:WU00:FS01:0x22:Completed 1520000 out of 2000000 steps (76%)
02:45:27:WU00:FS01:0x22:Completed 1540000 out of 2000000 steps (77%)
02:46:58:WU00:FS01:0x22:Completed 1560000 out of 2000000 steps (78%)
02:48:26:WU00:FS01:0x22:Completed 1580000 out of 2000000 steps (79%)
02:49:55:WU00:FS01:0x22:Completed 1600000 out of 2000000 steps (80%)
02:51:26:WU00:FS01:0x22:Completed 1620000 out of 2000000 steps (81%)
02:52:54:WU00:FS01:0x22:Completed 1640000 out of 2000000 steps (82%)
02:54:24:WU00:FS01:0x22:Completed 1660000 out of 2000000 steps (83%)
02:55:53:WU00:FS01:0x22:Completed 1680000 out of 2000000 steps (84%)
02:57:23:WU00:FS01:0x22:Completed 1700000 out of 2000000 steps (85%)
02:58:54:WU00:FS01:0x22:Completed 1720000 out of 2000000 steps (86%)
03:00:25:WU00:FS01:0x22:Completed 1740000 out of 2000000 steps (87%)
03:01:57:WU00:FS01:0x22:Completed 1760000 out of 2000000 steps (88%)
03:03:27:WU00:FS01:0x22:Completed 1780000 out of 2000000 steps (89%)
03:04:57:WU00:FS01:0x22:Completed 1800000 out of 2000000 steps (90%)
03:06:29:WU00:FS01:0x22:Completed 1820000 out of 2000000 steps (91%)
03:07:59:WU00:FS01:0x22:Completed 1840000 out of 2000000 steps (92%)
03:09:31:WU00:FS01:0x22:Completed 1860000 out of 2000000 steps (93%)
03:11:01:WU00:FS01:0x22:Completed 1880000 out of 2000000 steps (94%)
03:12:31:WU00:FS01:0x22:Completed 1900000 out of 2000000 steps (95%)
03:14:03:WU00:FS01:0x22:Completed 1920000 out of 2000000 steps (96%)
03:15:33:WU00:FS01:0x22:Completed 1940000 out of 2000000 steps (97%)
03:17:05:WU00:FS01:0x22:Completed 1960000 out of 2000000 steps (98%)
03:18:35:WU00:FS01:0x22:Completed 1980000 out of 2000000 steps (99%)
03:19:38:WU02:FS01:Connecting to 18.218.241.186:80
03:19:38:WU02:FS01:Assigned to work server 40.114.52.201
03:19:38:WU02:FS01:Requesting new work unit for slot 01: RUNNING gpu:0:GP104 [GeForce GTX 1070] 6463 from 40.114.52.201
03:19:38:WU02:FS01:Connecting to 40.114.52.201:8080
03:20:05:WU00:FS01:0x22:Completed 2000000 out of 2000000 steps (100%)
03:20:07:WU00:FS01:0x22:Saving result file ..\logfile_01.txt
03:20:07:WU00:FS01:0x22:Saving result file checkpointState.xml
03:20:07:WU00:FS01:0x22:Saving result file checkpt.crc
03:20:07:WU00:FS01:0x22:Saving result file positions.xtc
03:20:07:WU00:FS01:0x22:Saving result file science.log
03:20:07:WU00:FS01:0x22:Folding@home Core Shutdown: FINISHED_UNIT
03:20:08:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
03:20:08:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:11778 run:0 clone:15499 gen:3 core:0x22 unit:0x00000009287234c95e77496fad6276f7
03:20:08:WU00:FS01:Uploading 23.05MiB to 40.114.52.201
03:20:08:WU00:FS01:Connecting to 40.114.52.201:8080
03:20:59:WU00:FS01:Upload 0.54%
03:20:59:WARNING:WU00:FS01:Exception: Failed to send results to work server: Transfer failed
03:20:59:WU00:FS01:Trying to send results to collection server
03:20:59:WU00:FS01:Uploading 23.05MiB to 155.247.164.214
03:20:59:WU00:FS01:Connecting to 155.247.164.214:8080
03:21:05:WU00:FS01:Upload 13.56%
03:21:11:WU00:FS01:Upload 29.01%
03:21:17:WU00:FS01:Upload 44.47%
03:21:23:WU00:FS01:Upload 59.92%
03:21:29:WU00:FS01:Upload 75.65%
03:21:35:WU00:FS01:Upload 90.84%
03:21:38:WU00:FS01:Upload complete
03:21:39:WU00:FS01:Server responded WORK_ACK (400)
03:21:39:WU00:FS01:Final credit estimate, 9405.00 points
03:21:39:WU00:FS01:Cleaning up
The only thing weird in the log file is this part:
Code: Select all
03:20:17:ERROR:WU02:FS01:Exception: 10002: Received short response, expected 512 bytes, got 0
03:20:59:WU00:FS01:Upload 0.54%
03:20:59:WARNING:WU00:FS01:Exception: Failed to send results to work server: Transfer failed
03:20:59:WU00:FS01:Trying to send results to collection server
03:20:59:WU00:FS01:Uploading 23.05MiB to 155.247.164.214
03:20:59:WU00:FS01:Connecting to 155.247.164.214:8080
FAH tries to send the WU back to the Work Server, fails and then correctly sends it to the collection server.
I don't know the inner workings of the FAH backend or if a
"Transfer failed" affects anything points-wise, but it does seem odd that it first tries to send the WU back to the WS instead of straight to a CS, thankfully it fails-over to a Collection Server and the GPU work isn't wasted, albeit with a base points reward only.
With that said it's better than not uploading at all like it was doing prior to Thursday, still the correct points estimate for a quick turnaround on a WU would be the better situation.
Maybe an Admin/Mod can enlighten us further and/or highlight this issue to the relevant people.
Re: WU 11758 & 11778 [Fixed]
Posted: Sun Mar 29, 2020 2:57 pm
by Neil-B
As I understand it WUs are normally returned to the WS they are received from so the first attempts will be what would normally happen (and be successful) - CS are only a fall back if WS is unreachable (or overloaded) they then return the WU to the WS as/when comms capacity allow it to.