171.67.108.11, .25, assign3.stanford.edu

Moderators: Site Moderators, FAHC Science Team

t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Post by t-fh »

Here we go again.

Code: Select all

15:15:13:Unit 02:Completed 90%
15:17:45:Sending unit results: id:01 state:SEND project:10514 run:8 clone:889 gen:480 core:0x11 unit:0x76490b014e21600b01e0037900082912
15:17:45:Unit 01: Uploading 4.85KiB
15:17:45:Connecting to 171.64.65.61:8080
15:17:45:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:17:45:Trying to send results to collection server
15:17:45:Unit 01: Uploading 4.85KiB
15:17:45:Connecting to 171.67.108.25:8080
15:17:47:WARNING: WorkServer connection failed on port 8080 trying 80
15:17:47:Connecting to 171.67.108.25:80
15:17:48:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:22:56:Unit 02:Completed 91%
15:24:36:Sending unit results: id:01 state:SEND project:10514 run:8 clone:889 gen:480 core:0x11 unit:0x76490b014e21600b01e0037900082912
15:24:36:Unit 01: Uploading 4.85KiB
15:24:36:Connecting to 171.64.65.61:8080
15:24:37:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:24:37:Trying to send results to collection server
15:24:37:Unit 01: Uploading 4.85KiB
15:24:37:Connecting to 171.67.108.25:8080
15:24:38:WARNING: WorkServer connection failed on port 8080 trying 80
15:24:38:Connecting to 171.67.108.25:80
15:24:40:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:30:42:Unit 02:Run: exception thrown during GuardedRun
15:30:42:Unit 02:Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
15:30:42:Unit 02:Going to send back what have done -- stepsTotalG=15000000
15:30:42:Unit 02:Work fraction=0.9198 steps=15000000.
15:30:46:Unit 02:logfile size=28739 infoLength=28739 edr=0 trr=23
15:30:46:Unit 02:+ Opened results file
15:30:46:Unit 02:- Writing 29275 bytes of core data to disk...
15:30:47:Unit 02:Done: 28763 -> 6028 (compressed to 20.9 percent)
15:30:47:Unit 02:  ... Done.
15:30:47:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
15:30:47:Unit 02:
15:30:47:Unit 02:Folding@home Core Shutdown: UNSTABLE_MACHINE
15:30:47:FahCore, running Unit 02, returned: UNSTABLE_MACHINE (122)
15:30:47:Starting Unit 02
15:30:47:Running core: "C:/Program Files/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/G80/Core_11.fah/FahCore_11.exe" -dir 02 -suffix 01 -lifeline 2436 -version 701 -checkpoint 15 -gpu 0
15:30:47:Started core on PID 3696
15:30:47:FahCore 0x11 started
15:30:48:FahCore, running Unit 02, returned: MISSING_WORK_FILES (116)
15:30:48:WARNING: Unit 02 Fatal error, dumping
15:30:48:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:30:48:Unit 02: Uploading 6.39KiB
15:30:48:Connecting to 171.67.108.11:8080
15:30:48:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:30:48:Trying to send results to collection server
15:30:48:Unit 02: Uploading 6.39KiB
15:30:48:Connecting to 171.67.108.25:8080
15:30:48:Connecting to assign-GPU.stanford.edu:80
15:30:49:News: Welcome to Folding@Home
15:30:49:Assigned to work server 171.64.65.61
15:30:49:Requesting new work unit for slot 01: READY gpu:0:"G84 [GeForce 8600M GT]" from 171.64.65.61
15:30:49:Connecting to 171.64.65.61:8080
15:30:49:WARNING: WorkServer connection failed on port 8080 trying 80
15:30:49:Connecting to 171.67.108.25:80
15:30:50:Slot 01: Downloading 72.68KiB
15:30:51:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:30:51:Slot 01: Download complete
15:30:51:Received Unit: id:00 state:DOWNLOAD project:6602 run:3 clone:107 gen:847 core:0x11 unit:0x164a14824e2996eb034f006b000319ca
15:30:51:Starting Unit 00
15:30:51:Running core: "C:/Program Files/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/G80/Core_11.fah/FahCore_11.exe" -dir 00 -suffix 01 -lifeline 2436 -version 701 -checkpoint 15 -gpu 0
15:30:51:Started core on PID 1836
15:30:51:FahCore 0x11 started
15:30:51:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:30:51:Unit 02: Uploading 6.39KiB
15:30:51:Connecting to 171.67.108.11:8080
15:30:51:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:30:52:Unit 00:
15:30:53:Trying to send results to collection server
15:30:53:Unit 00:*------------------------------*
15:30:53:Unit 00:Folding@Home GPU Core
15:30:53:Unit 00:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
15:30:53:Unit 00:
15:30:53:Unit 00:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
15:30:53:Unit 00:Build host: amoeba
15:30:53:Unit 00:Board Type: Nvidia
15:30:53:Unit 00:Core      : 
15:30:53:Unit 00:Preparing to commence simulation
15:30:53:Unit 00:- Looking at optimizations...
15:30:53:Unit 00:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp
15:30:53:Unit 00:- Created dyn
15:30:53:Unit 00:- Files status OK
15:30:53:Unit 00:- Expanded 73917 -> 383588 (decompressed 518.9 percent)
15:30:53:Unit 00:Called DecompressByteArray: compressed_data_size=73917 data_size=383588, decompressed_data_size=383588 diff=0
15:30:53:Unit 00:- Digital signature verified
15:30:53:Unit 00:
15:30:53:Unit 00:Project: 6602 (Run 3, Clone 107, Gen 847)
15:30:53:Unit 00:
15:30:53:Unit 00:Assembly optimizations on if available.
15:30:53:Unit 00:Entering M.D.
15:30:53:Unit 02: Uploading 6.39KiB
15:30:53:Connecting to 171.67.108.25:8080
15:30:55:WARNING: WorkServer connection failed on port 8080 trying 80
15:30:55:Connecting to 171.67.108.25:80
15:30:56:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:30:57:Unit 00:Tpr hash 00/wudata_01.tpr:  263549288 4011315235 3917081767 2130351881 63853135
15:30:57:Unit 00:
15:30:57:Unit 00:Calling fah_main args: 14 usage=100
15:30:57:Unit 00:
15:30:58:Unit 00:Working on Protein
15:31:01:Unit 00:Client config unavailable.
15:31:01:Unit 00:Starting GUI Server
15:31:51:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:31:51:Unit 02: Uploading 6.39KiB
15:31:51:Connecting to 171.67.108.11:8080
15:31:52:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:31:52:Trying to send results to collection server
15:31:52:Unit 02: Uploading 6.39KiB
15:31:52:Connecting to 171.67.108.25:8080
15:31:53:WARNING: WorkServer connection failed on port 8080 trying 80
15:31:53:Connecting to 171.67.108.25:80
15:31:55:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:33:28:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:33:28:Unit 02: Uploading 6.39KiB
15:33:28:Connecting to 171.67.108.11:8080
15:33:29:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:33:29:Trying to send results to collection server
15:33:29:Unit 02: Uploading 6.39KiB
15:33:29:Connecting to 171.67.108.25:8080
15:33:30:WARNING: WorkServer connection failed on port 8080 trying 80
15:33:30:Connecting to 171.67.108.25:80
15:33:32:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:35:42:Sending unit results: id:01 state:SEND project:10514 run:8 clone:889 gen:480 core:0x11 unit:0x76490b014e21600b01e0037900082912
15:35:42:Unit 01: Uploading 4.85KiB
15:35:42:Connecting to 171.64.65.61:8080
15:35:42:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:35:42:Trying to send results to collection server
15:35:42:Unit 01: Uploading 4.85KiB
15:35:42:Connecting to 171.67.108.25:8080
15:35:44:WARNING: WorkServer connection failed on port 8080 trying 80
15:35:44:Connecting to 171.67.108.25:80
15:35:45:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:36:06:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:36:06:Unit 02: Uploading 6.39KiB
15:36:06:Connecting to 171.67.108.11:8080
15:36:06:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:36:06:Trying to send results to collection server
15:36:06:Unit 02: Uploading 6.39KiB
15:36:06:Connecting to 171.67.108.25:8080
15:36:08:WARNING: WorkServer connection failed on port 8080 trying 80
15:36:08:Connecting to 171.67.108.25:80
15:36:09:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:40:20:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:40:20:Unit 02: Uploading 6.39KiB
15:40:20:Connecting to 171.67.108.11:8080
15:40:20:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:40:20:Trying to send results to collection server
15:40:21:Unit 02: Uploading 6.39KiB
15:40:21:Connecting to 171.67.108.25:8080
15:40:22:WARNING: WorkServer connection failed on port 8080 trying 80
15:40:22:Connecting to 171.67.108.25:80
15:40:23:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:43:15:Unit 00:Completed 1%
15:47:11:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:47:11:Unit 02: Uploading 6.39KiB
15:47:11:Connecting to 171.67.108.11:8080
15:47:12:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:47:12:Trying to send results to collection server
15:47:12:Unit 02: Uploading 6.39KiB
15:47:12:Connecting to 171.67.108.25:8080
15:47:13:WARNING: WorkServer connection failed on port 8080 trying 80
15:47:13:Connecting to 171.67.108.25:80
15:47:15:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:53:39:Sending unit results: id:01 state:SEND project:10514 run:8 clone:889 gen:480 core:0x11 unit:0x76490b014e21600b01e0037900082912
15:53:39:Unit 01: Uploading 4.85KiB
15:53:39:Connecting to 171.64.65.61:8080
15:53:39:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:53:39:Trying to send results to collection server
15:53:39:Unit 01: Uploading 4.85KiB
15:53:39:Connecting to 171.67.108.25:8080
15:53:41:WARNING: WorkServer connection failed on port 8080 trying 80
15:53:41:Connecting to 171.67.108.25:80
15:53:42:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
15:55:31:Unit 00:Completed 2%
15:58:17:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
15:58:17:Unit 02: Uploading 6.39KiB
15:58:17:Connecting to 171.67.108.11:8080
15:58:17:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
15:58:17:Trying to send results to collection server
15:58:17:Unit 02: Uploading 6.39KiB
15:58:17:Connecting to 171.67.108.25:8080
15:58:19:WARNING: WorkServer connection failed on port 8080 trying 80
15:58:19:Connecting to 171.67.108.25:80
15:58:20:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
16:07:50:Unit 00:Completed 3%
16:16:14:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
16:16:14:Unit 02: Uploading 6.39KiB
16:16:14:Connecting to 171.67.108.11:8080
16:16:14:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
16:16:14:Trying to send results to collection server
16:16:14:Unit 02: Uploading 6.39KiB
16:16:14:Connecting to 171.67.108.25:8080
16:16:16:WARNING: WorkServer connection failed on port 8080 trying 80
16:16:16:Connecting to 171.67.108.25:80
16:16:17:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
16:20:11:Unit 00:Completed 4%
16:22:41:Sending unit results: id:01 state:SEND project:10514 run:8 clone:889 gen:480 core:0x11 unit:0x76490b014e21600b01e0037900082912
16:22:41:Unit 01: Uploading 4.85KiB
16:22:41:Connecting to 171.64.65.61:8080
16:22:41:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
16:22:41:Trying to send results to collection server
16:22:41:Unit 01: Uploading 4.85KiB
16:22:41:Connecting to 171.67.108.25:8080
16:22:43:WARNING: WorkServer connection failed on port 8080 trying 80
16:22:43:Connecting to 171.67.108.25:80
16:22:44:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
16:32:30:Unit 00:Completed 5%
16:44:50:Unit 00:Completed 6%
16:45:16:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
16:45:16:Unit 02: Uploading 6.39KiB
16:45:16:Connecting to 171.67.108.11:8080
16:45:16:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
16:45:17:Trying to send results to collection server
16:45:17:Unit 02: Uploading 6.39KiB
16:45:17:Connecting to 171.67.108.25:8080
16:45:19:WARNING: WorkServer connection failed on port 8080 trying 80
16:45:19:Connecting to 171.67.108.25:80
16:45:20:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
Mod Edit: Changed Quote Tags To Code Tags - PantherX
Last edited by t-fh on Fri Jul 29, 2011 8:46 pm, edited 2 times in total.
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Re: CS: 171.67.108.25 & WS: 171.67.108.11

Post by t-fh »

As an aside, the client lost the SMP slot. Readding doesn't work, either.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: CS: 171.67.108.25 & WS: 171.67.108.11

Post by bruce »

t-fh wrote:Here we go again.

Code: Select all

15:30:42:Unit 02:Run: exception thrown during GuardedRun
15:30:42:Unit 02:Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
15:30:42:Unit 02:Going to send back what have done -- stepsTotalG=15000000
15:30:42:Unit 02:Work fraction=0.9198 steps=15000000.
15:30:46:Unit 02:logfile size=28739 infoLength=28739 edr=0 trr=23
15:30:46:Unit 02:+ Opened results file
15:30:46:Unit 02:- Writing 29275 bytes of core data to disk...
15:30:47:Unit 02:Done: 28763 -> 6028 (compressed to 20.9 percent)
15:30:47:Unit 02:  ... Done.
15:30:47:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
15:30:47:Unit 02:
15:30:47:Unit 02:Folding@home Core Shutdown: UNSTABLE_MACHINE
15:30:47:FahCore, running Unit 02, returned: UNSTABLE_MACHINE (122)
Following errors like UNSTABLE_MACHINE, V7 creates an error report. At the present time, many of the servers (including 171.67.108.11) need to have a later version of the server code to accept those error reports. That's one of the reasons V7 is in open beta -- to work out issues like that. You will continue to see failed upload reports associated with this error.

You need to focus on whatever is causing those errors, not on the fact that the error report is not being uploaded.
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Re: CS: 171.67.108.25 & WS: 171.67.108.11

Post by t-fh »

I don't know what's causing the errors. :-)
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: collection server refusing connection

Post by PantherX »

t-fh wrote:...One thing I haven't mentioned but which should be obvious from the logs I posted is that I use -verbosity 9...
The logs that you have posted are from V7 Beta Client and that flag isn't used in it.
t-fh wrote:I don't know what's causing the errors. :-)
In that case, start here (viewtopic.php?f=19&t=16526) :D
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

171.67.108.25, assign3.stanford.edu

Post by t-fh »

Most recent problems now are:

Code: Select all

20:44:57:News: Welcome to Folding@Home
20:44:57:WARNING: Failed to get assignment from 'assign3.stanford.edu:8080': Empty work server assignment
20:44:57:Connecting to assign4.stanford.edu:80
20:44:57:News: Welcome to Folding@Home
20:44:57:WARNING: Failed to get assignment from 'assign4.stanford.edu:80': Empty work server assignment
20:44:57:ERROR: Exception: Could not get an assignment
20:44:57:Sending unit results: id:01 state:SEND project:5768 run:7 clone:92 gen:1927 core:0x11 unit:0x28bc49fc4e2e79130787005c00071688
20:44:57:Unit 01: Uploading 5.38KiB
20:44:57:Connecting to 171.67.108.11:8080
20:44:58:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
20:44:58:Trying to send results to collection server
20:44:58:Unit 01: Uploading 5.38KiB
20:44:58:Connecting to 171.67.108.25:8080
20:44:59:WARNING: WorkServer connection failed on port 8080 trying 80
20:44:59:Connecting to 171.67.108.25:80
20:45:01:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
20:46:33:Connecting to assign3.stanford.edu:8080
20:46:33:News: Welcome to Folding@Home
20:46:33:WARNING: Failed to get assignment from 'assign3.stanford.edu:8080': Empty work server assignment
20:46:33:Connecting to assign4.stanford.edu:80
20:46:34:News: Welcome to Folding@Home
20:46:34:WARNING: Failed to get assignment from 'assign4.stanford.edu:80': Empty work server assignment
20:46:34:ERROR: Exception: Could not get an assignment
20:46:34:Sending unit results: id:01 state:SEND project:5768 run:7 clone:92 gen:1927 core:0x11 unit:0x28bc49fc4e2e79130787005c00071688
20:46:34:Unit 01: Uploading 5.38KiB
20:46:34:Connecting to 171.67.108.11:8080
20:46:35:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
20:46:35:Trying to send results to collection server
20:46:35:Unit 01: Uploading 5.38KiB
20:46:35:Connecting to 171.67.108.25:8080
20:46:36:WARNING: WorkServer connection failed on port 8080 trying 80
20:46:36:Connecting to 171.67.108.25:80
20:46:38:ERROR: Exception: Failed to connect to 171.67.108.25:80: No connection could be made because the target machine actively refused it.
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Re: 171.67.108.11, .25, assign3.stanford.edu

Post by t-fh »

Checked through various FAQs, help etc. meanwhile - got 4 WUs waiting on send now. .25 shouldn't be reported (just wait instead?), too. And on http://fah-web.stanford.edu/logs/171.67.108.25.log.html it says it's been at "Not accept" for quite some time.
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Re: 171.67.108.11, .25, assign3.stanford.edu

Post by t-fh »

Also, some WUs are expiring soon.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.11, .25, assign3.stanford.edu

Post by bruce »

Yes, there is no point in reporting anything about 171.67.108.25.

Look carefully at the log you posted. The work server to report is 171.67.108.11 and that's reflected in the title of this topic. I do not see any problems in http://fah-web.stanford.edu/logs/171.67.108.11.log.htmll and in every case, your client tried that server before trying .25.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.11, .25, assign3.stanford.edu

Post by bruce »

As far as WUs expiring soon, look carefully at the first log that you posted. Here's some of the important lines:

Code: Select all

    15:15:13:Unit 02:Completed 90%
    15:30:42:Unit 02:Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
    15:30:42:Unit 02:Going to send back what have done -- stepsTotalG=15000000
    15:30:42:Unit 02:Work fraction=0.9198 steps=15000000.
    15:30:47:Unit 02:Folding@home Core Shutdown: UNSTABLE_MACHINE
    15:30:47:FahCore, running Unit 02, returned: UNSTABLE_MACHINE (122)
    15:30:48:Sending unit results: id:02 state:SEND project:5765 run:14 clone:287 gen:2528 core:0x11 unit:0x1a922d0b4e21992f09e0011f000e1685
    15:30:48:Unit 02: Uploading 6.39KiB
    15:30:48:Connecting to 171.67.108.11:8080
    15:30:48:WARNING: Exception: Failed to send results to work server: Failed to read response packet: HTTP_OK
This WU failed with an UNSTABLE_MACHINE error. The client created an error report showing that the WU had an error and attempted to upload the error report. This server is running an older version of the server code and it doesn't know how to process this error report. At some point, this will be fixed in either the V7 client or the server code. In the meantime, your client cannot upload that error report. Whether it expires or not should be of no concern to you. You're not going to get credit for completing the WU because you didn't complete it.

Had that error report been uploaded (say to a server with newer server code) that server might have used that error report to decide if the WU was corrupt or your hardware was failing. Without that server-based feature, the Mods have to wait and see if others have been able to complete the same WU before deciding if it should be reported as a bad WU. In this case, the WU was reassigned every time your machine dumped it and it was successfully completed by quite a number of of others.

That log shows that at least one of the uploads that is failing is a result of an earlier hardware error. You have not posted enough information to conclude that everything you're trying to upload is the result of a hardware error, but it's a pretty good guess.
t-fh wrote:I don't know what's causing the errors. :-)
In that case, start here (viewtopic.php?f=19&t=16526) :D
Its up to you to fix your hardware so it's reliable or to stop folding with it.
t-fh
Posts: 15
Joined: Tue Jul 12, 2011 9:27 am

Re: 171.67.108.11, .25, assign3.stanford.edu

Post by t-fh »

OK, thanks. I'll check my machine's logs for any crashes etc.
Post Reply