WU stuck, won't send.
Posted: Sat Nov 11, 2017 10:30 pm
Mod - merged with existing topic on this problem - j
Had a work unit finish yesterday, and i just noticed that it for some reason never sent, and keeps giving me an error. Here is the log. 2 other units continue as this one is stuck.
Had a work unit finish yesterday, and i just noticed that it for some reason never sent, and keeps giving me an error. Here is the log. 2 other units continue as this one is stuck.
Code: Select all
*********************** Log Started 2017-11-11T22:09:23Z ***********************
22:09:23:************************* Folding@home Client *************************
22:09:23: Website: http://folding.stanford.edu/
22:09:23: Copyright: (c) 2009-2014 Stanford University
22:09:23: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:09:23: Args:
22:09:23: Config: C:/Users/Aaron Klotz/AppData/Roaming/FAHClient/config.xml
22:09:23:******************************** Build ********************************
22:09:23: Version: 7.4.4
22:09:23: Date: Mar 4 2014
22:09:23: Time: 20:26:54
22:09:23: SVN Rev: 4130
22:09:23: Branch: fah/trunk/client
22:09:23: Compiler: Intel(R) C++ MSVC 1500 mode 1200
22:09:23: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
22:09:23: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
22:09:23: Platform: win32 XP
22:09:23: Bits: 32
22:09:23: Mode: Release
22:09:23:******************************* System ********************************
22:09:23: CPU: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
22:09:23: CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
22:09:23: CPUs: 8
22:09:23: Memory: 15.96GiB
22:09:23: Free Memory: 13.41GiB
22:09:23: Threads: WINDOWS_THREADS
22:09:23: OS Version: 6.2
22:09:23: Has Battery: false
22:09:23: On Battery: false
22:09:23: UTC Offset: -5
22:09:23: PID: 13212
22:09:23: CWD: C:/Users/Aaron Klotz/AppData/Roaming/FAHClient
22:09:23: OS: Windows 10 Home
22:09:23: OS Arch: AMD64
22:09:23: GPUs: 1
22:09:23: GPU 0: NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
22:09:23: CUDA: 6.1
22:09:23: CUDA Driver: 9010
22:09:23:Win32 Service: false
22:09:23:***********************************************************************
22:09:23:<config>
22:09:23: <!-- User Information -->
22:09:23: <passkey v='********************************'/>
22:09:23: <team v='223518'/>
22:09:23: <user v='klotza1'/>
22:09:23:
22:09:23: <!-- Folding Slots -->
22:09:23: <slot id='0' type='CPU'/>
22:09:23: <slot id='1' type='GPU'/>
22:09:23:</config>
22:09:23:Trying to access database...
22:09:23:Successfully acquired database lock
22:09:23:Enabled folding slot 00: READY cpu:6
22:09:23:Enabled folding slot 01: READY gpu:0:GP106 [GeForce GTX 1060 6GB] 4372
22:09:23:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:09:23:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:09:23:WU00:FS00:Connecting to 134.139.52.2:8080
22:09:23:WU02:FS00:Starting
22:09:23:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Aaron Klotz/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/AVX/Core_a7.fah/FahCore_a7.exe" -dir 02 -suffix 01 -version 704 -lifeline 13212 -checkpoint 15 -np 6
22:09:23:WU02:FS00:Started FahCore on PID 1040
22:09:23:WU02:FS00:Core PID:2520
22:09:23:WU02:FS00:FahCore 0xa7 started
22:09:24:WU01:FS01:Starting
22:09:24:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Aaron Klotz/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe" -dir 01 -suffix 01 -version 704 -lifeline 13212 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
22:09:24:WU01:FS01:Started FahCore on PID 13324
22:09:24:WU01:FS01:Core PID:13352
22:09:24:WU01:FS01:FahCore 0x21 started
22:09:24:WU02:FS00:0xa7:*********************** Log Started 2017-11-11T22:09:24Z ***********************
22:09:24:WU02:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
22:09:24:WU02:FS00:0xa7: Type: 0xa7
22:09:24:WU02:FS00:0xa7: Core: Gromacs
22:09:24:WU02:FS00:0xa7: Website: http://folding.stanford.edu/
22:09:24:WU02:FS00:0xa7: Copyright: (c) 2009-2016 Stanford University
22:09:24:WU02:FS00:0xa7: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:09:24:WU02:FS00:0xa7: Args: -dir 02 -suffix 01 -version 704 -lifeline 1040 -checkpoint 15 -np 6
22:09:24:WU02:FS00:0xa7: Config: <none>
22:09:24:WU02:FS00:0xa7:************************************ Build *************************************
22:09:24:WU02:FS00:0xa7: Version: 0.0.11
22:09:24:WU02:FS00:0xa7: Date: Sep 21 2016
22:09:24:WU02:FS00:0xa7: Time: 01:43:48
22:09:24:WU02:FS00:0xa7: Repository: Git
22:09:24:WU02:FS00:0xa7: Revision: 957bd90e68d95ddcf1594dc15ff6c64cc4555146
22:09:24:WU02:FS00:0xa7: Branch: master
22:09:24:WU02:FS00:0xa7: Compiler: GNU 4.2.1 Compatible Clang 3.9.0 (trunk 274080)
22:09:24:WU02:FS00:0xa7: Options: -std=gnu++98 -O3 -funroll-loops -ffast-math -mfpmath=sse
22:09:24:WU02:FS00:0xa7: -fno-unsafe-math-optimizations -msse2 -I/mingw64/include
22:09:24:WU02:FS00:0xa7: -Wno-inconsistent-dllimport -Wno-parentheses-equality
22:09:24:WU02:FS00:0xa7: -Wno-deprecated-register -Wno-unused-local-typedef
22:09:24:WU02:FS00:0xa7: Platform: linux2 4.6.0-1-amd64
22:09:24:WU02:FS00:0xa7: Bits: 64
22:09:24:WU02:FS00:0xa7: Mode: Release
22:09:24:WU02:FS00:0xa7: SIMD: avx_256
22:09:24:WU02:FS00:0xa7:************************************ System ************************************
22:09:24:WU02:FS00:0xa7: CPU: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
22:09:24:WU02:FS00:0xa7: CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
22:09:24:WU02:FS00:0xa7: CPUs: 8
22:09:24:WU02:FS00:0xa7: Memory: 15.96GiB
22:09:24:WU02:FS00:0xa7:Free Memory: 13.35GiB
22:09:24:WU02:FS00:0xa7: Threads: WINDOWS_THREADS
22:09:24:WU02:FS00:0xa7: OS Version: 6.2
22:09:24:WU02:FS00:0xa7:Has Battery: false
22:09:24:WU02:FS00:0xa7: On Battery: false
22:09:24:WU02:FS00:0xa7: UTC Offset: -5
22:09:24:WU02:FS00:0xa7: PID: 2520
22:09:24:WU02:FS00:0xa7: CWD: C:\Users\Aaron Klotz\AppData\Roaming\FAHClient\work
22:09:24:WU02:FS00:0xa7: OS: Windows 10 Home
22:09:24:WU02:FS00:0xa7: OS Arch: AMD64
22:09:24:WU02:FS00:0xa7:********************************************************************************
22:09:24:WU02:FS00:0xa7:Project: 13708 (Run 7, Clone 64, Gen 1)
22:09:24:WU02:FS00:0xa7:Unit: 0x000000010002894b59bff4e8d7fdd5f8
22:09:24:WU02:FS00:0xa7:Digital signatures verified
22:09:24:WU02:FS00:0xa7:Calling: mdrun -s frame1.tpr -o frame1.trr -cpi state.cpt -cpt 15 -nt 6
22:09:24:WU02:FS00:0xa7:ERROR:Guru Meditation #9d91a3a626fe1335.cdde5e4f9d8913c2 (5417.5531) '02/01/pullx.xvg'
22:09:24:WU02:FS00:0xa7:WARNING:Unexpected exit() call
22:09:24:WU02:FS00:0xa7:WARNING:Unexpected exit from science code
22:09:24:WU02:FS00:0xa7:Saving result file ..\logfile_01.txt
22:09:24:WU02:FS00:0xa7:Saving result file frame1.trr
22:09:24:WU01:FS01:0x21:*********************** Log Started 2017-11-11T22:09:24Z ***********************
22:09:24:WU01:FS01:0x21:Project: 13147 (Run 71, Clone 4, Gen 36)
22:09:24:WU01:FS01:0x21:Unit: 0x00000027ab436c6559e67f79978a733d
22:09:24:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
22:09:24:WU01:FS01:0x21:Machine: 1
22:09:24:WU01:FS01:0x21:Digital signatures verified
22:09:24:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
22:09:24:WU01:FS01:0x21:Version 0.0.18
22:09:24:WU01:FS01:0x21: Found a checkpoint file
22:09:24:WU02:FS00:0xa7:Saving result file md.log
22:09:24:WU02:FS00:0xa7:Saving result file pullf.xvg
22:09:24:WU02:FS00:0xa7:ERROR:Guru Meditation #fd354946e0c669d7.cab850ae0f8e7a03 (2118.2160) '02/01/pullf.xvg'
22:09:25:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:09:25:WU00:FS00:Connecting to 134.139.52.2:80
22:09:32:WARNING:WU02:FS00:FahCore returned: BAD_FRAME_CHECKSUM (112 = 0x70)
22:09:32:WARNING:WU02:FS00:Fatal error, dumping
22:09:32:WU02:FS00:Sending unit results: id:02 state:SEND error:DUMPED project:13708 run:7 clone:64 gen:1 core:0xa7 unit:0x000000010002894b59bff4e8d7fdd5f8
22:09:32:WU02:FS00:Uploading 4.87MiB to 155.247.166.219
22:09:32:WU02:FS00:Connecting to 155.247.166.219:8080
22:09:33:WU03:FS00:Connecting to 171.67.108.45:8080
22:09:34:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:09:34:WU00:FS00:Trying to send results to collection server
22:09:34:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:09:34:WU00:FS00:Connecting to 134.139.52.3:8080
22:09:34:WU03:FS00:Assigned to work server 155.247.166.220
22:09:34:WU03:FS00:Requesting new work unit for slot 00: READY cpu:6 from 155.247.166.220
22:09:34:WU03:FS00:Connecting to 155.247.166.220:8080
22:09:35:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:09:35:WU00:FS00:Connecting to 134.139.52.3:80
22:09:35:WU03:FS00:Downloading 649.24KiB
22:09:35:WU03:FS00:Download complete
22:09:36:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:14010 run:0 clone:163 gen:0 core:0xa4 unit:0x000000000002894c59e4d1bb33b59518
22:09:36:WU03:FS00:Starting
22:09:36:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Aaron Klotz/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe" -dir 03 -suffix 01 -version 704 -lifeline 13212 -checkpoint 15 -np 6
22:09:36:WU03:FS00:Started FahCore on PID 12380
22:09:36:WU03:FS00:Core PID:13556
22:09:36:WU03:FS00:FahCore 0xa4 started
22:09:36:WU03:FS00:0xa4:
22:09:36:WU03:FS00:0xa4:*------------------------------*
22:09:36:WU03:FS00:0xa4:Folding@Home Gromacs GB Core
22:09:36:WU03:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
22:09:36:WU03:FS00:0xa4:
22:09:36:WU03:FS00:0xa4:Preparing to commence simulation
22:09:36:WU03:FS00:0xa4:- Looking at optimizations...
22:09:36:WU03:FS00:0xa4:- Created dyn
22:09:36:WU03:FS00:0xa4:- Files status OK
22:09:36:WU03:FS00:0xa4:- Expanded 664309 -> 1931708 (decompressed 290.7 percent)
22:09:36:WU03:FS00:0xa4:Called DecompressByteArray: compressed_data_size=664309 data_size=1931708, decompressed_data_size=1931708 diff=0
22:09:36:WU03:FS00:0xa4:- Digital signature verified
22:09:36:WU03:FS00:0xa4:
22:09:36:WU03:FS00:0xa4:Project: 14010 (Run 0, Clone 163, Gen 0)
22:09:36:WU03:FS00:0xa4:
22:09:36:WU03:FS00:0xa4:Assembly optimizations on if available.
22:09:36:WU03:FS00:0xa4:Entering M.D.
22:09:36:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:09:37:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:09:37:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:09:37:WU00:FS00:Connecting to 134.139.52.2:8080
22:09:38:WU02:FS00:Upload 52.64%
22:09:38:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:09:38:WU00:FS00:Connecting to 134.139.52.2:80
22:09:39:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:09:39:WU00:FS00:Trying to send results to collection server
22:09:39:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:09:39:WU00:FS00:Connecting to 134.139.52.3:8080
22:09:40:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:09:40:WU00:FS00:Connecting to 134.139.52.3:80
22:09:42:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:09:42:WU03:FS00:0xa4:Mapping NT from 6 to 6
22:09:42:WU03:FS00:0xa4:Completed 0 out of 2500000 steps (0%)
22:09:42:WU02:FS00:Upload complete
22:09:43:WU02:FS00:Server responded WORK_QUIT (404)
22:09:43:WARNING:WU02:FS00:Server did not like results, dumping
22:09:43:WU02:FS00:Cleaning up
22:09:44:WU01:FS01:0x21:Completed 280000 out of 520000 steps (53%)
22:09:44:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:09:54:WU01:FS01:0x21:Completed 280800 out of 520000 steps (54%)
22:10:37:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:10:37:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:10:37:WU00:FS00:Connecting to 134.139.52.2:8080
22:10:38:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:10:38:WU00:FS00:Connecting to 134.139.52.2:80
22:10:39:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:10:39:WU00:FS00:Trying to send results to collection server
22:10:39:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:10:39:WU00:FS00:Connecting to 134.139.52.3:8080
22:10:53:WU01:FS01:0x21:Completed 286000 out of 520000 steps (55%)
22:11:01:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:11:01:WU00:FS00:Connecting to 134.139.52.3:80
22:11:02:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:11:52:WU01:FS01:0x21:Completed 291200 out of 520000 steps (56%)
22:12:14:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:12:14:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:12:14:WU00:FS00:Connecting to 134.139.52.2:8080
22:12:15:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:12:15:WU00:FS00:Connecting to 134.139.52.2:80
22:12:17:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:12:17:WU00:FS00:Trying to send results to collection server
22:12:17:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:12:17:WU00:FS00:Connecting to 134.139.52.3:8080
22:12:18:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:12:18:WU00:FS00:Connecting to 134.139.52.3:80
22:12:20:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:12:52:WU01:FS01:0x21:Completed 296400 out of 520000 steps (57%)
22:13:51:WU01:FS01:0x21:Completed 301600 out of 520000 steps (58%)
22:14:50:WU01:FS01:0x21:Completed 306800 out of 520000 steps (59%)
22:14:51:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:14:51:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:14:51:WU00:FS00:Connecting to 134.139.52.2:8080
22:14:52:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:14:52:WU00:FS00:Connecting to 134.139.52.2:80
22:14:54:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:14:54:WU00:FS00:Trying to send results to collection server
22:14:54:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:14:54:WU00:FS00:Connecting to 134.139.52.3:8080
22:14:55:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:14:55:WU00:FS00:Connecting to 134.139.52.3:80
22:14:57:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:15:02:WU03:FS00:0xa4:Completed 25000 out of 2500000 steps (1%)
22:15:49:WU01:FS01:0x21:Completed 312000 out of 520000 steps (60%)
22:16:48:WU01:FS01:0x21:Completed 317200 out of 520000 steps (61%)
22:17:52:WU01:FS01:0x21:Completed 322400 out of 520000 steps (62%)
22:18:51:WU01:FS01:0x21:Completed 327600 out of 520000 steps (63%)
22:19:05:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8209 run:8 clone:60 gen:82 core:0xa7 unit:0x00000064868b340258ed356ea67678dd
22:19:05:WU00:FS00:Uploading 6.18MiB to 134.139.52.2
22:19:05:WU00:FS00:Connecting to 134.139.52.2:8080
22:19:07:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:19:07:WU00:FS00:Connecting to 134.139.52.2:80
22:19:08:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 134.139.52.2:80: No connection could be made because the target machine actively refused it.
22:19:08:WU00:FS00:Trying to send results to collection server
22:19:08:WU00:FS00:Uploading 6.18MiB to 134.139.52.3
22:19:08:WU00:FS00:Connecting to 134.139.52.3:8080
22:19:09:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
22:19:09:WU00:FS00:Connecting to 134.139.52.3:80
22:19:11:ERROR:WU00:FS00:Exception: Failed to connect to 134.139.52.3:80: No connection could be made because the target machine actively refused it.
22:19:51:WU01:FS01:0x21:Completed 332800 out of 520000 steps (64%)
22:20:21:WU03:FS00:0xa4:Completed 50000 out of 2500000 steps (2%)