Page 1 of 1
66.170.111.50 Connection refused error
Posted: Thu Nov 05, 2020 4:07 pm
by comixgoddess
I was checking on the status of this server as I have a WU that uploaded successfully but has not been credited (project:17407 run:0 clone:109 gen:136 shows as "Not found" on the WU status page).
Clicking on the hostname on the Server Status page (fah-w1.vmware.com) gets me the following response - ERR_CONNECTION_REFUSED. I've tried Chrome, Vivaldi, and Opera browsers with the same result.
Re: 66.170.111.50 Connection refused error
Posted: Thu Nov 05, 2020 4:24 pm
by Joe_H
66.170.111.50 has been reported for this problem and being low on space.
Re: 66.170.111.50 Connection refused error
Posted: Sun Nov 15, 2020 10:42 pm
by robinson
Is it okay to keep accepting work from this server?
I would expect the issue the server is having will not eclipse the time-out for the WU, at least at my completion rate.
Re: 66.170.111.50 Connection refused error
Posted: Sun Nov 15, 2020 10:51 pm
by Joe_H
The previous report was over a week ago. In the meantime about 12 TB of space has been freed up on the server. The server appears to be fine now, if you run into some problem then report it.
Re: 66.170.111.50 Connection refused error
Posted: Mon Nov 16, 2020 3:15 am
by robinson
I get errors and have a back log of finished WUs.
I've poked around with my anti-virus and found no issues. I've folded with this PC for a few weeks without major issues, except that the folding slots reset after a reboot.
Code: Select all
13:35:06:WU04:FS00:0xa8:*********************** Log Started 2020-11-15T13:35:05Z ***********************
13:35:06:WU04:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
13:35:06:WU04:FS00:0xa8: Core: Gromacs
13:35:06:WU04:FS00:0xa8: Type: 0xa8
13:35:06:WU04:FS00:0xa8: Version: 0.0.9
13:35:06:WU04:FS00:0xa8: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:35:06:WU04:FS00:0xa8: Copyright: 2020 foldingathome.org
13:35:06:WU04:FS00:0xa8: Homepage: https://foldingathome.org/
13:35:06:WU04:FS00:0xa8: Date: Oct 28 2020
13:35:06:WU04:FS00:0xa8: Time: 15:43:30
13:35:06:WU04:FS00:0xa8: Revision: 15f9b5e1edd6089f1a45553d2709a6aba62f735c
13:35:06:WU04:FS00:0xa8: Branch: master
13:35:06:WU04:FS00:0xa8: Compiler: Visual C++ 2019 16.7
13:35:06:WU04:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
13:35:06:WU04:FS00:0xa8: Platform: win32 10
13:35:06:WU04:FS00:0xa8: Bits: 64
13:35:06:WU04:FS00:0xa8: Mode: Release
13:35:06:WU04:FS00:0xa8: SIMD: avx_256
13:35:06:WU04:FS00:0xa8: OpenMP: ON
13:35:06:WU04:FS00:0xa8: CUDA: OFF
13:35:06:WU04:FS00:0xa8: Args: -dir 04 -suffix 01 -version 706 -lifeline 10828 -checkpoint 15 -np
13:35:06:WU04:FS00:0xa8: 32
13:35:06:WU04:FS00:0xa8:************************************ libFAH ************************************
13:35:06:WU04:FS00:0xa8: Date: Oct 28 2020
13:35:06:WU04:FS00:0xa8: Time: 15:11:42
13:35:06:WU04:FS00:0xa8: Revision: 15f9b5e1edd6089f1a45553d2709a6aba62f735c
13:35:06:WU04:FS00:0xa8: Branch: master
13:35:06:WU04:FS00:0xa8: Compiler: Visual C++ 2019 16.7
13:35:06:WU04:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
13:35:06:WU04:FS00:0xa8: Platform: win32 10
13:35:06:WU04:FS00:0xa8: Bits: 64
13:35:06:WU04:FS00:0xa8: Mode: Release
13:35:06:WU04:FS00:0xa8:************************************ CBang *************************************
13:35:06:WU04:FS00:0xa8: Date: Oct 28 2020
13:35:06:WU04:FS00:0xa8: Time: 15:11:22
13:35:06:WU04:FS00:0xa8: Revision: 15f9b5e1edd6089f1a45553d2709a6aba62f735c
13:35:06:WU04:FS00:0xa8: Branch: master
13:35:06:WU04:FS00:0xa8: Compiler: Visual C++ 2019 16.7
13:35:06:WU04:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
13:35:06:WU04:FS00:0xa8: Platform: win32 10
13:35:06:WU04:FS00:0xa8: Bits: 64
13:35:06:WU04:FS00:0xa8: Mode: Release
13:35:06:WU04:FS00:0xa8:************************************ System ************************************
13:35:06:WU04:FS00:0xa8: CPU: Intel(R) Xeon(R) CPU E5-2660 0 @ 2.20GHz
13:35:06:WU04:FS00:0xa8: CPU ID: GenuineIntel Family 6 Model 45 Stepping 7
13:35:06:WU04:FS00:0xa8: CPUs: 32
13:35:06:WU04:FS00:0xa8: Memory: 31.93GiB
13:35:06:WU04:FS00:0xa8:Free Memory: 27.92GiB
13:35:06:WU04:FS00:0xa8: Threads: WINDOWS_THREADS
13:35:06:WU04:FS00:0xa8: OS Version: 6.2
13:35:06:WU04:FS00:0xa8:Has Battery: false
13:35:06:WU04:FS00:0xa8: On Battery: false
13:35:06:WU04:FS00:0xa8: UTC Offset: -6
13:35:06:WU04:FS00:0xa8: PID: 8964
13:35:06:WU04:FS00:0xa8: CWD: C:\ProgramData\FAHClient\work
13:35:06:WU04:FS00:0xa8:********************************************************************************
13:35:06:WU04:FS00:0xa8:Project: 16812 (Run 1, Clone 1284, Gen 18)
13:35:06:WU04:FS00:0xa8:Unit: 0x00000017b2aec48a5f8ddfba9920098e
13:35:06:WU04:FS00:0xa8:Reading tar file core.xml
13:35:06:WU04:FS00:0xa8:Reading tar file frame18.tpr
13:35:06:WU04:FS00:0xa8:Digital signatures verified
13:35:06:WU04:FS00:0xa8:Calling: mdrun -c frame18.gro -s frame18.tpr -x frame18.xtc -cpt 15 -nt 32 -ntmpi 1
13:35:06:WU04:FS00:0xa8:Steps: first=9000000 total=9500000
13:35:09:WU04:FS00:0xa8:Completed 1 out of 500000 steps (0%)
13:35:12:WU03:FS00:Upload 33.36%
13:35:18:WU03:FS00:Upload 50.60%
13:35:24:WU03:FS00:Upload 67.28%
13:35:30:WU03:FS00:Upload 84.51%
13:35:36:WU03:FS00:Upload complete
13:35:36:WU03:FS00:Server responded WORK_ACK (400)
13:35:36:WU03:FS00:Final credit estimate, 7950.00 points
13:35:36:WU03:FS00:Cleaning up
13:36:10:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:17407 run:0 clone:554 gen:153 core:0xa7 unit:0x000000b242aa6f325f61845452a574e0
13:36:10:WU01:FS00:Uploading 9.94MiB to 66.170.111.50
13:36:10:WU01:FS00:Connecting to 66.170.111.50:8080
13:36:23:WU04:FS00:0xa8:Completed 5000 out of 500000 steps (1%)
13:36:40:WARNING:WU01:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
13:37:37:WU04:FS00:0xa8:Completed 10000 out of 500000 steps (2%)
13:38:51:WU04:FS00:0xa8:Completed 15000 out of 500000 steps (3%)
******************************* Date: 2020-11-15 *******************************
13:40:05:WU04:FS00:0xa8:Completed 20000 out of 500000 steps (4%)
13:41:19:WU04:FS00:0xa8:Completed 25000 out of 500000 steps (5%)
13:42:33:WU04:FS00:0xa8:Completed 30000 out of 500000 steps (6%)
13:43:57:WU04:FS00:0xa8:Completed 35000 out of 500000 steps (7%)
13:45:18:WU04:FS00:0xa8:Completed 40000 out of 500000 steps (8%)
13:46:32:WU04:FS00:0xa8:Completed 45000 out of 500000 steps (9%)
13:47:44:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:17410 run:0 clone:285 gen:183 core:0xa7 unit:0x000000d242aa6f325f618419c77530bd
13:47:44:WU02:FS00:Uploading 9.93MiB to 66.170.111.50
13:47:44:WU02:FS00:Connecting to 66.170.111.50:8080
13:47:46:WU04:FS00:0xa8:Completed 50000 out of 500000 steps (10%)
13:48:14:WARNING:WU02:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
13:49:00:WU04:FS00:0xa8:Completed 55000 out of 500000 steps (11%)
13:50:15:WU04:FS00:0xa8:Completed 60000 out of 500000 steps (12%)
13:51:28:WU04:FS00:0xa8:Completed 65000 out of 500000 steps (13%)
13:52:41:WU04:FS00:0xa8:Completed 70000 out of 500000 steps (14%)
13:53:55:WU04:FS00:0xa8:Completed 75000 out of 500000 steps (15%)
13:55:09:WU04:FS00:0xa8:Completed 80000 out of 500000 steps (16%)
13:56:23:WU04:FS00:0xa8:Completed 85000 out of 500000 steps (17%)
13:57:37:WU04:FS00:0xa8:Completed 90000 out of 500000 steps (18%)
13:58:50:WU04:FS00:0xa8:Completed 95000 out of 500000 steps (19%)
14:00:04:WU04:FS00:0xa8:Completed 100000 out of 500000 steps (20%)
14:01:18:WU04:FS00:0xa8:Completed 105000 out of 500000 steps (21%)
14:02:32:WU04:FS00:0xa8:Completed 110000 out of 500000 steps (22%)
14:03:46:WU04:FS00:0xa8:Completed 115000 out of 500000 steps (23%)
14:05:00:WU04:FS00:0xa8:Completed 120000 out of 500000 steps (24%)
14:06:15:WU04:FS00:0xa8:Completed 125000 out of 500000 steps (25%)
14:07:29:WU04:FS00:0xa8:Completed 130000 out of 500000 steps (26%)
14:08:43:WU04:FS00:0xa8:Completed 135000 out of 500000 steps (27%)
14:09:57:WU04:FS00:0xa8:Completed 140000 out of 500000 steps (28%)
14:11:11:WU04:FS00:0xa8:Completed 145000 out of 500000 steps (29%)
14:12:24:WU04:FS00:0xa8:Completed 150000 out of 500000 steps (30%)
14:13:38:WU04:FS00:0xa8:Completed 155000 out of 500000 steps (31%)
14:14:52:WU04:FS00:0xa8:Completed 160000 out of 500000 steps (32%)
14:15:31:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:17408 run:0 clone:817 gen:185 core:0xa7 unit:0x000000d042aa6f325f6184d73d5fd1ad
14:15:31:WU00:FS00:Uploading 9.94MiB to 66.170.111.50
14:15:31:WU00:FS00:Connecting to 66.170.111.50:8080
14:16:02:WARNING:WU00:FS00:Exception: Failed to send results to work server: 10002: Received short response, expected 512 bytes, got 0
Re: 66.170.111.50 Connection refused error
Posted: Wed Dec 02, 2020 11:08 pm
by comixgoddess
Since I initially posted, I have had several units upload successfully to this server, all of which have appeared on the WU Status page in a relatively short time. However, the original work unit completed on November 5 (project:17407 run:0 clone:109 gen:136) is still coming up as "not found".
Re: 66.170.111.50 Connection refused error
Posted: Thu Dec 10, 2020 8:02 pm
by bruce
Recovering the stats that were "lost" is a manual operation, Joseph and/or the owner of that project needs to grep the server logs and reprocess the missing items. I believe Joseph has a script that makes this a lot easier than doing it manually.
One of them needs to be notified that this needs to be done.