Page 1 of 1

171.67.108.35 Down

Posted: Mon Dec 17, 2012 6:11 pm
by carrotworks
Hi,

Just attempted to return a p8069 (r0, c438, g83) and it appears that 171.67.108.35 is down. Connection failed on both ports 80 and 8080 due to timeout by the server, and the WU was sucessfully uploaded instead to the collection server, 171.65.103.160.

Looking at the server status page reports that 171.67.108.34, 171.67.108.35 and 171.67.108.36 are all DOWN, and checking the individual server logs shows that they have all been so since 06:40 PST. Unless there's some maintenance I'm not aware of, this doesn't seem normal.

Log is below, error's when uploading the second WU. Apologies about verbosity 5!

Code: Select all

*********************** Log Started 2012-12-16T21:17:06Z ***********************
21:17:06:************************* Folding@home Client *************************
21:17:06:      Website: http://folding.stanford.edu/
21:17:06:    Copyright: (c) 2009-2012 Stanford University
21:17:06:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:17:06:         Args: --lifeline 900 --command-port=36330
21:17:06:       Config: C:/Users/Ellen/AppData/Roaming/FAHClient/config.xml
21:17:06:******************************** Build ********************************
21:17:06:      Version: 7.2.9
21:17:06:         Date: Oct 3 2012
21:17:06:         Time: 18:02:40
21:17:06:      SVN Rev: 3578
21:17:06:       Branch: fah/trunk/client
21:17:06:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:17:06:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE2
21:17:06:               /QaxSSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
21:17:06:     Platform: win32 Vista
21:17:06:         Bits: 32
21:17:06:         Mode: Release
21:17:06:******************************* System ********************************
21:17:06:          CPU: Intel(R) Core(TM)2 Duo CPU T9400 @ 2.53GHz
21:17:06:       CPU ID: GenuineIntel Family 6 Model 23 Stepping 6
21:17:06:         CPUs: 2
21:17:06:       Memory: 3.96GiB
21:17:06:  Free Memory: 2.85GiB
21:17:06:      Threads: WINDOWS_THREADS
21:17:06:   On Battery: false
21:17:06:   UTC offset: 0
21:17:06:          PID: 1612
21:17:06:          CWD: C:/Users/Ellen/AppData/Roaming/FAHClient
21:17:06:           OS: Windows 7 Home Premium
21:17:06:      OS Arch: AMD64
21:17:06:         GPUs: 0
21:17:06:         CUDA: Not detected
21:17:06:Win32 Service: false
21:17:06:***********************************************************************
21:17:06:<config>
21:17:06:  <service-description v='Folding@home Client'/>
21:17:06:  <service-restart v='true'/>
21:17:06:  <service-restart-delay v='5000'/>
21:17:06:
21:17:06:  <!-- Client Control -->
21:17:06:  <cycle-rate v='4'/>
21:17:06:  <cycles v='-1'/>
21:17:06:  <data-directory v='.'/>
21:17:06:  <disable-project-lookup v='false'/>
21:17:06:  <exec-directory v='C:\Program Files (x86)\FAHClient'/>
21:17:06:  <exit-when-done v='false'/>
21:17:06:  <threads v='4'/>
21:17:06:
21:17:06:  <!-- Configuration -->
21:17:06:  <config-rotate v='true'/>
21:17:06:  <config-rotate-dir v='configs'/>
21:17:06:  <config-rotate-max v='16'/>
21:17:06:
21:17:06:  <!-- Debugging -->
21:17:06:  <assignment-servers>
21:17:06:    assign3.stanford.edu:8080 assign4.stanford.edu:80
21:17:06:  </assignment-servers>
21:17:06:  <capture-directory v='capture'/>
21:17:06:  <capture-sockets v='false'/>
21:17:06:  <debug-sockets v='false'/>
21:17:06:  <exception-locations v='true'/>
21:17:06:  <gpu-assignment-servers>
21:17:06:    assign-GPU.stanford.edu:80 assign-GPU.stanford.edu:8080
21:17:06:  </gpu-assignment-servers>
21:17:06:  <stack-traces v='false'/>
21:17:06:
21:17:06:  <!-- Error Handling -->
21:17:06:  <max-slot-errors v='5'/>
21:17:06:  <max-unit-errors v='5'/>
21:17:06:
21:17:06:  <!-- FahCore Control -->
21:17:06:  <checkpoint v='15'/>
21:17:06:  <core-dir v='cores'/>
21:17:06:  <core-priority v='idle'/>
21:17:06:  <cpu-affinity v='false'/>
21:17:06:  <cpu-usage v='100'/>
21:17:06:  <no-assembly v='false'/>
21:17:06:
21:17:06:  <!-- Folding Slot Configuration -->
21:17:06:  <cause-pref v='ANY'/>
21:17:06:  <client-subtype v='STDCLI'/>
21:17:06:  <client-type v='normal'/>
21:17:06:  <cpu-species v='X86_PENTIUM_II'/>
21:17:06:  <cpu-type v='AMD64'/>
21:17:06:  <cpus v='-1'/>
21:17:06:  <cuda-index v='0'/>
21:17:06:  <extra-core-args v='-forceasm'/>
21:17:06:  <gpu v='false'/>
21:17:06:  <gpu-usage v='100'/>
21:17:06:  <max-packet-size v='normal'/>
21:17:06:  <opencl-index v='0'/>
21:17:06:  <os-species v='UNKNOWN'/>
21:17:06:  <os-type v='WIN32'/>
21:17:06:  <project-key v='0'/>
21:17:06:  <smp v='true'/>
21:17:06:
21:17:06:  <!-- Logging -->
21:17:06:  <log v='log.txt'/>
21:17:06:  <log-color v='false'/>
21:17:06:  <log-crlf v='true'/>
21:17:06:  <log-date v='false'/>
21:17:06:  <log-date-periodically v='21600'/>
21:17:06:  <log-debug v='true'/>
21:17:06:  <log-domain v='false'/>
21:17:06:  <log-header v='true'/>
21:17:06:  <log-level v='true'/>
21:17:06:  <log-no-info-header v='true'/>
21:17:06:  <log-redirect v='false'/>
21:17:06:  <log-rotate v='true'/>
21:17:06:  <log-rotate-dir v='logs'/>
21:17:06:  <log-rotate-max v='16'/>
21:17:06:  <log-short-level v='false'/>
21:17:06:  <log-simple-domains v='true'/>
21:17:06:  <log-thread-id v='false'/>
21:17:06:  <log-thread-prefix v='true'/>
21:17:06:  <log-time v='true'/>
21:17:06:  <log-to-screen v='true'/>
21:17:06:  <log-truncate v='false'/>
21:17:06:  <verbosity v='5'/>
21:17:06:
21:17:06:  <!-- Network -->
21:17:06:  <proxy v=':8080'/>
21:17:06:  <proxy-enable v='false'/>
21:17:06:  <proxy-pass v=''/>
21:17:06:  <proxy-user v=''/>
21:17:06:
21:17:06:  <!-- Process Control -->
21:17:06:  <child v='false'/>
21:17:06:  <daemon v='false'/>
21:17:06:  <pid v='false'/>
21:17:06:  <pid-file v='Folding@home Client.pid'/>
21:17:06:  <respawn v='false'/>
21:17:06:  <service v='false'/>
21:17:06:
21:17:06:  <!-- Remote Command Server -->
21:17:06:  <command-address v='0.0.0.0'/>
21:17:06:  <command-allow v='127.0.0.1'/>
21:17:06:  <command-allow-no-pass v='127.0.0.1'/>
21:17:06:  <command-deny v='0.0.0.0/0'/>
21:17:06:  <command-deny-no-pass v='0.0.0.0/0'/>
21:17:06:  <command-port v='36330'/>
21:17:06:  <password v=''/>
21:17:06:
21:17:06:  <!-- Slot Control -->
21:17:06:  <max-shutdown-wait v='60'/>
21:17:06:  <pause-on-battery v='false'/>
21:17:06:  <pause-on-start v='false'/>
21:17:06:
21:17:06:  <!-- User Information -->
21:17:06:  <machine-id v='0'/>
21:17:06:  <passkey v='********************************'/>
21:17:06:  <team v='0'/>
21:17:06:  <user v='eshhvn.WvhzX%40FvnxsaE.ev.Sz'/>
21:17:06:
21:17:06:  <!-- Work Unit Control -->
21:17:06:  <dump-after-deadline v='true'/>
21:17:06:  <max-queue v='16'/>
21:17:06:  <max-units v='0'/>
21:17:06:  <next-unit-percentage v='99'/>
21:17:06:
21:17:06:  <!-- Folding Slots -->
21:17:06:  <slot id='0' type='SMP'>
21:17:06:    <client-type v='advanced'/>
21:17:06:    <max-packet-size v='big'/>
21:17:06:  </slot>
21:17:06:</config>
21:17:06:Trying to access database...
21:17:07:Successfully acquired database lock
21:17:07:Enabled folding slot 00: READY smp:2
21:17:07:Started thread 1 on PID 1612
21:17:07:Started thread 4 on PID 1612
21:17:07:Started thread 6 on PID 1612
21:17:07:Started thread 5 on PID 1612
21:17:07:Started thread 3 on PID 1612
21:17:07:WU00:FS00:Starting
21:17:07:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Ellen/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 1612 -checkpoint 15 -np 2 -forceasm
21:17:07:WU00:FS00:Started FahCore on PID 1740
21:17:07:Started thread 7 on PID 1612
21:17:07:WU00:FS00:Core PID:1840
21:17:07:WU00:FS00:FahCore 0xa4 started
21:17:07:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
21:17:07:Started thread 8 on PID 1612
21:17:08:WU00:FS00:0xa4:
21:17:08:WU00:FS00:0xa4:*------------------------------*
21:17:08:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
21:17:08:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:17:08:WU00:FS00:0xa4:
21:17:08:WU00:FS00:0xa4:Preparing to commence simulation
21:17:08:WU00:FS00:0xa4:- Ensuring status. Please wait.
21:17:17:WU00:FS00:0xa4:- Assembly optimizations manually forced on.
21:17:17:WU00:FS00:0xa4:- Not checking prior termination.
21:17:18:WU00:FS00:0xa4:- Expanded 1023352 -> 2527872 (decompressed 247.0 percent)
21:17:18:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=1023352 data_size=2527872, decompressed_data_size=2527872 diff=0
21:17:18:WU00:FS00:0xa4:- Digital signature verified
21:17:18:WU00:FS00:0xa4:
21:17:18:WU00:FS00:0xa4:Project: 8069 (Run 3, Clone 762, Gen 53)
21:17:18:WU00:FS00:0xa4:
21:17:18:WU00:FS00:0xa4:Assembly optimizations on if available.
21:17:18:WU00:FS00:0xa4:Entering M.D.
21:17:19:FS00:Finishing
21:17:24:WU00:FS00:0xa4:Using Gromacs checkpoints
21:17:24:WU00:FS00:0xa4:Mapping NT from 2 to 2 
21:17:25:WU00:FS00:0xa4:Resuming from checkpoint
21:17:25:WU00:FS00:0xa4:Verified 00/wudata_01.log
21:17:25:WU00:FS00:0xa4:Verified 00/wudata_01.trr
21:17:25:WU00:FS00:0xa4:Verified 00/wudata_01.xtc
21:17:25:WU00:FS00:0xa4:Verified 00/wudata_01.edr
21:17:25:WU00:FS00:0xa4:Completed 58970 out of 250000 steps  (23%)
21:17:27:FS00:Finishing
21:19:45:WU00:FS00:0xa4:Completed 60000 out of 250000 steps  (24%)
21:25:26:WU00:FS00:0xa4:Completed 62500 out of 250000 steps  (25%)
21:31:07:WU00:FS00:0xa4:Completed 65000 out of 250000 steps  (26%)
21:36:48:WU00:FS00:0xa4:Completed 67500 out of 250000 steps  (27%)

---SNIP---

04:14:12:WU00:FS00:0xa4:Completed 242500 out of 250000 steps  (97%)
04:19:50:WU00:FS00:0xa4:Completed 245000 out of 250000 steps  (98%)
04:25:31:WU00:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
04:31:11:WU00:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
04:31:12:WU00:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
04:31:22:WU00:FS00:0xa4:
04:31:22:WU00:FS00:0xa4:Finished Work Unit:
04:31:22:WU00:FS00:0xa4:- Reading up to 1567212 from "00/wudata_01.trr": Read 1567212
04:31:22:WU00:FS00:0xa4:trr file hash check passed.
04:31:22:WU00:FS00:0xa4:- Reading up to 1757840 from "00/wudata_01.xtc": Read 1757840
04:31:22:WU00:FS00:0xa4:xtc file hash check passed.
04:31:22:WU00:FS00:0xa4:edr file hash check passed.
04:31:22:WU00:FS00:0xa4:logfile size: 28871
04:31:22:WU00:FS00:0xa4:Leaving Run
04:31:23:WU00:FS00:0xa4:- Writing 3363195 bytes of core data to disk...
04:31:24:WU00:FS00:0xa4:Done: 3362683 -> 3243084 (compressed to 96.4 percent)
04:31:24:WU00:FS00:0xa4:  ... Done.
04:31:25:WU00:FS00:0xa4:- Shutting down core
04:31:25:WU00:FS00:0xa4:
04:31:25:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
04:31:26:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
04:31:26:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8069 run:3 clone:762 gen:53 core:0xa4 unit:0x000000376652edb350ad63e6a5223586
04:31:26:WU00:FS00:Uploading 3.09MiB to 171.67.108.35
04:31:26:WU00:FS00:Connecting to 171.67.108.35:8080
04:31:32:WU00:FS00:Upload 8.08%
04:31:38:WU00:FS00:Upload 18.18%
04:31:44:WU00:FS00:Upload 26.27%
04:31:51:WU00:FS00:Upload 36.37%
04:31:58:WU00:FS00:Upload 46.47%
04:32:04:WU00:FS00:Upload 56.57%
04:32:10:WU00:FS00:Upload 64.66%
04:32:17:WU00:FS00:Upload 74.76%
04:32:23:WU00:FS00:Upload 84.86%
04:32:29:WU00:FS00:Upload 92.94%
04:32:35:WU00:FS00:Upload complete
04:32:35:WU00:FS00:Server responded WORK_ACK (400)
04:32:35:WU00:FS00:Final credit estimate, 751.00 points
04:32:35:WU00:FS00:Cleaning up
08:17:45:WU00:FS00:Connecting to assign3.stanford.edu:8080
08:17:47:WU00:FS00:News: Welcome to Folding@Home
08:17:47:WU00:FS00:Assigned to work server 171.67.108.35
08:17:47:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 171.67.108.35
08:17:47:WU00:FS00:Connecting to 171.67.108.35:8080
08:17:48:WU00:FS00:Downloading 999.63KiB
08:17:52:WU00:FS00:Download complete
08:17:52:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:8069 run:0 clone:438 gen:83 core:0xa4 unit:0x000000566652edb350ad5a37ba7a67db
08:17:52:WU00:FS00:Starting
08:17:52:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Ellen/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 1612 -checkpoint 15 -np 2 -forceasm
08:17:52:WU00:FS00:Started FahCore on PID 1868
08:17:52:Started thread 10 on PID 1612
08:17:52:WU00:FS00:Core PID:1892
08:17:52:WU00:FS00:FahCore 0xa4 started
08:17:52:WU00:FS00:0xa4:
08:17:52:WU00:FS00:0xa4:*------------------------------*
08:17:52:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
08:17:52:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
08:17:52:WU00:FS00:0xa4:
08:17:52:WU00:FS00:0xa4:Preparing to commence simulation
08:17:52:WU00:FS00:0xa4:- Assembly optimizations manually forced on.
08:17:52:WU00:FS00:0xa4:- Not checking prior termination.
08:17:52:WU00:FS00:0xa4:- Expanded 1023107 -> 2527872 (decompressed 247.0 percent)
08:17:52:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=1023107 data_size=2527872, decompressed_data_size=2527872 diff=0
08:17:52:WU00:FS00:0xa4:- Digital signature verified
08:17:52:WU00:FS00:0xa4:
08:17:52:WU00:FS00:0xa4:Project: 8069 (Run 0, Clone 438, Gen 83)
08:17:52:WU00:FS00:0xa4:
08:17:52:WU00:FS00:0xa4:Assembly optimizations on if available.
08:17:52:WU00:FS00:0xa4:Entering M.D.
08:17:58:WU00:FS00:0xa4:Mapping NT from 2 to 2 
08:17:58:WU00:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
08:18:00:FS00:Finishing
08:23:38:WU00:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
08:29:19:WU00:FS00:0xa4:Completed 5000 out of 250000 steps  (2%)
08:35:01:WU00:FS00:0xa4:Completed 7500 out of 250000 steps  (3%)
08:40:44:WU00:FS00:0xa4:Completed 10000 out of 250000 steps  (4%)

---SNIP---

17:29:56:WU00:FS00:0xa4:Completed 242500 out of 250000 steps  (97%)
17:35:36:WU00:FS00:0xa4:Completed 245000 out of 250000 steps  (98%)
17:41:17:WU00:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
17:46:59:WU00:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
17:47:00:WU00:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
17:47:10:WU00:FS00:0xa4:
17:47:10:WU00:FS00:0xa4:Finished Work Unit:
17:47:10:WU00:FS00:0xa4:- Reading up to 1567212 from "00/wudata_01.trr": Read 1567212
17:47:10:WU00:FS00:0xa4:trr file hash check passed.
17:47:10:WU00:FS00:0xa4:- Reading up to 1758628 from "00/wudata_01.xtc": Read 1758628
17:47:10:WU00:FS00:0xa4:xtc file hash check passed.
17:47:10:WU00:FS00:0xa4:edr file hash check passed.
17:47:10:WU00:FS00:0xa4:logfile size: 28160
17:47:10:WU00:FS00:0xa4:Leaving Run
17:47:11:WU00:FS00:0xa4:- Writing 3363272 bytes of core data to disk...
17:47:12:WU00:FS00:0xa4:Done: 3362760 -> 3243877 (compressed to 96.4 percent)
17:47:12:WU00:FS00:0xa4:  ... Done.
17:47:13:WU00:FS00:0xa4:- Shutting down core
17:47:13:WU00:FS00:0xa4:
17:47:13:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
17:47:14:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
17:47:14:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8069 run:0 clone:438 gen:83 core:0xa4 unit:0x000000566652edb350ad5a37ba7a67db
17:47:14:WU00:FS00:Uploading 3.09MiB to 171.67.108.35
17:47:14:WU00:FS00:Connecting to 171.67.108.35:8080
17:47:35:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
17:47:35:WU00:FS00:Connecting to 171.67.108.35:80
17:47:56:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
17:47:56:WU00:FS00:Trying to send results to collection server
17:47:56:WU00:FS00:Uploading 3.09MiB to 171.65.103.160
17:47:56:WU00:FS00:Connecting to 171.65.103.160:8080
17:48:02:WU00:FS00:Upload 8.08%
17:48:08:WU00:FS00:Upload 18.18%
17:48:14:WU00:FS00:Upload 26.26%
17:48:21:WU00:FS00:Upload 36.36%
17:48:28:WU00:FS00:Upload 46.46%
17:48:35:WU00:FS00:Upload 56.56%
17:48:41:WU00:FS00:Upload 64.64%
17:48:47:WU00:FS00:Upload 72.72%
17:48:53:WU00:FS00:Upload 80.80%
17:49:00:WU00:FS00:Upload 88.88%
17:49:06:WU00:FS00:Upload 96.96%
17:49:09:WU00:FS00:Upload complete
17:49:09:WU00:FS00:Server responded WORK_ACK (400)
17:49:09:WU00:FS00:Final credit estimate, 761.00 points
17:49:09:WU00:FS00:Cleaning up

Re: 171.67.108.35 Down

Posted: Mon Dec 17, 2012 6:58 pm
by bruce
welcome to foldingforum.org, carrotworks

Thank you for the notification. According the history on the serverstat page, 171.67.108.35 was, in fact, down before 10:00 PST but was brought back on-line before 10:20 PST. The job of the collection server, 171.65.103.160, is to covering for the primary work servers during brief outages, and it performed its job as expected.

Re: 171.67.108.35 Down

Posted: Mon Dec 17, 2012 7:04 pm
by carrotworks
Hi Bruce,

Yeah, I've been monitoring the server status page and noticed 171.67.108.34-171.67.108.36 come back up over the last hour. Hope I didn't jump the gun with this thread! :P

Re: 171.67.108.35 Down

Posted: Mon Dec 17, 2012 7:12 pm
by bruce
NP.

The first concern is making sure that either the WS or the CS is working so that WUs have a path to be returned, and you answered that in your original post. In other words, it was not critical. The second concern is that some other WS has WUs that can be assigned to your specific type of client. That probably was okay, too, but since it's back up, it's no longer a concern.

I have no first-hand information other than serverstat. Perhaps there was a problem with the WS or perhaps they took it down briefly for maintenance.

Re: 171.67.108.35 Down

Posted: Sat Mar 09, 2013 5:48 am
by Jesse_V
Same error

Code: Select all

0:47:25:WU00:FS01:0xa4:Completed 480000 out of 500000 steps  (96%)
20:49:52:WU00:FS01:0xa4:Completed 485000 out of 500000 steps  (97%)
20:52:18:WU00:FS01:0xa4:Completed 490000 out of 500000 steps  (98%)
20:52:19:WU02:FS01:Connecting to assign3.stanford.edu:8080
20:52:19:WU02:FS01:News: Welcome to Folding@Home
20:52:19:WU02:FS01:Assigned to work server 171.67.108.35
20:52:19:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:7 from 171.67.108.35
20:52:19:WU02:FS01:Connecting to 171.67.108.35:8080
20:52:40:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
20:52:40:WU02:FS01:Connecting to 171.67.108.35:80
20:53:01:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
20:53:02:WU02:FS01:Connecting to assign3.stanford.edu:8080
20:53:02:WU02:FS01:News: Welcome to Folding@Home
20:53:02:WU02:FS01:Assigned to work server 171.67.108.35
20:53:02:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:7 from 171.67.108.35
20:53:02:WU02:FS01:Connecting to 171.67.108.35:8080
20:53:23:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
20:53:23:WU02:FS01:Connecting to 171.67.108.35:80
20:53:44:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
20:54:02:WU02:FS01:Connecting to assign3.stanford.edu:8080
20:54:02:WU02:FS01:News: Welcome to Folding@Home
20:54:02:WU02:FS01:Assigned to work server 171.67.108.35
20:54:02:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:7 from 171.67.108.35
20:54:02:WU02:FS01:Connecting to 171.67.108.35:8080
20:54:23:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
20:54:23:WU02:FS01:Connecting to 171.67.108.35:80
20:54:45:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
20:54:46:WU00:FS01:0xa4:Completed 495000 out of 500000 steps  (99%)
20:55:39:WU02:FS01:Connecting to assign3.stanford.edu:8080
20:55:39:WU02:FS01:News: Welcome to Folding@Home
20:55:39:WU02:FS01:Assigned to work server 171.67.108.35
20:55:39:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:7 from 171.67.108.35
20:55:39:WU02:FS01:Connecting to 171.67.108.35:8080
20:56:00:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
20:56:00:WU02:FS01:Connecting to 171.67.108.35:80
20:56:22:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
20:57:14:WU00:FS01:0xa4:Completed 500000 out of 500000 steps  (100%)
20:57:14:WU00:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
20:57:24:WU00:FS01:0xa4:
20:57:24:WU00:FS01:0xa4:Finished Work Unit:
20:57:24:WU00:FS01:0xa4:- Reading up to 1350492 from "00/wudata_01.trr": Read 1350492
20:57:24:WU00:FS01:0xa4:trr file hash check passed.
20:57:24:WU00:FS01:0xa4:- Reading up to 1505724 from "00/wudata_01.xtc": Read 1505724
20:57:24:WU00:FS01:0xa4:xtc file hash check passed.
20:57:24:WU00:FS01:0xa4:edr file hash check passed.
20:57:24:WU00:FS01:0xa4:logfile size: 26424
20:57:24:WU00:FS01:0xa4:Leaving Run
20:57:29:WU00:FS01:0xa4:- Writing 2891464 bytes of core data to disk...
20:57:29:WU00:FS01:0xa4:Done: 2890952 -> 2804248 (compressed to 97.0 percent)
20:57:29:WU00:FS01:0xa4:  ... Done.
20:57:30:WU00:FS01:0xa4:- Shutting down core
20:57:30:WU00:FS01:0xa4:
20:57:30:WU00:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
20:57:31:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:57:31:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:8082 run:38 clone:53 gen:16 core:0xa4 unit:0x000000106652edb3512a11338c90f625
20:57:31:WU00:FS01:Uploading 2.67MiB to 171.67.108.35
20:57:31:WU00:FS01:Connecting to 171.67.108.35:8080
20:57:52:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
20:57:52:WU00:FS01:Connecting to 171.67.108.35:80
20:58:13:WARNING:WU00:FS01:Exception: Failed to send results to work server: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
20:58:13:WU00:FS01:Trying to send results to collection server
20:58:13:WU00:FS01:Uploading 2.67MiB to 171.65.103.160
20:58:13:WU00:FS01:Connecting to 171.65.103.160:8080
20:58:16:WU02:FS01:Connecting to assign3.stanford.edu:8080
20:58:16:WU02:FS01:News: Welcome to Folding@Home
20:58:16:WU02:FS01:Assigned to work server 171.67.108.35
20:58:16:WU02:FS01:Requesting new work unit for slot 01: READY cpu:7 from 171.67.108.35
20:58:16:WU02:FS01:Connecting to 171.67.108.35:8080
20:58:19:WU00:FS01:Upload 63.09%
20:58:22:WU00:FS01:Upload complete
20:58:22:WU00:FS01:Server responded WORK_ACK (400)
20:58:22:WU00:FS01:Final credit estimate, 4266.00 points
20:58:22:WU00:FS01:Cleaning up
20:58:38:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
20:58:38:WU02:FS01:Connecting to 171.67.108.35:80
20:58:59:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
21:02:30:WU02:FS01:Connecting to assign3.stanford.edu:8080
21:02:31:WU02:FS01:News: Welcome to Folding@Home
21:02:31:WU02:FS01:Assigned to work server 171.67.108.35
21:02:31:WU02:FS01:Requesting new work unit for slot 01: READY cpu:7 from 171.67.108.35
21:02:31:WU02:FS01:Connecting to 171.67.108.35:8080
21:02:52:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
21:02:52:WU02:FS01:Connecting to 171.67.108.35:80
21:03:13:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
21:09:22:WU02:FS01:Connecting to assign3.stanford.edu:8080
21:09:22:WU02:FS01:News: Welcome to Folding@Home
21:09:22:WU02:FS01:Assigned to work server 171.67.108.35
21:09:22:WU02:FS01:Requesting new work unit for slot 01: READY cpu:7 from 171.67.108.35
21:09:22:WU02:FS01:Connecting to 171.67.108.35:8080
21:09:43:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
21:09:43:WU02:FS01:Connecting to 171.67.108.35:80
21:10:04:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
21:20:27:WU02:FS01:Connecting to assign3.stanford.edu:8080
21:20:27:WU02:FS01:News: Welcome to Folding@Home
21:20:27:WU02:FS01:Assigned to work server 171.67.108.35
21:20:27:WU02:FS01:Requesting new work unit for slot 01: READY cpu:7 from 171.67.108.35
21:20:27:WU02:FS01:Connecting to 171.67.108.35:8080
21:20:49:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
21:20:49:WU02:FS01:Connecting to 171.67.108.35:80
21:21:10:ERROR:WU02:FS01:Exception: Failed to connect to 171.67.108.35:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
21:38:24:WU02:FS01:Connecting to assign3.stanford.edu:8080
21:38:24:WU02:FS01:News: Welcome to Folding@Home
21:38:24:WU02:FS01:Assigned to work server 128.143.199.97
21:38:24:WU02:FS01:Requesting new work unit for slot 01: READY cpu:7 from 128.143.199.97
21:38:24:WU02:FS01:Connecting to 128.143.199.97:8080
21:38:25:WU02:FS01:Downloading 1.86MiB
21:38:27:WU02:FS01:Download complete
21:38:27:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7513 run:0 clone:18 gen:237 core:0xa3 unit:0x000000f5fbcb017d4ff756c2ea6156f6
21:38:27:WU02:FS01:Starting
21:38:27:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Admin/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/beta/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 3536 -checkpoint 30 -np 7
21:38:27:WU02:FS01:Started FahCore on PID 3304
21:38:27:WU02:FS01:Core PID:3392
21:38:27:WU02:FS01:FahCore 0xa3 started
21:38:27:WU02:FS01:0xa3:
21:38:27:WU02:FS01:0xa3:*------------------------------*
21:38:27:WU02:FS01:0xa3:Folding@Home Gromacs SMP Core
21:38:27:WU02:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
21:38:27:WU02:FS01:0xa3:
21:38:27:WU02:FS01:0xa3:Preparing to commence simulation
21:38:27:WU02:FS01:0xa3:- Looking at optimizations...
21:38:27:WU02:FS01:0xa3:- Created dyn
21:38:27:WU02:FS01:0xa3:- Files status OK
21:38:27:WU02:FS01:0xa3:- Expanded 1948679 -> 2929924 (decompressed 150.3 percent)
21:38:27:WU02:FS01:0xa3:Called DecompressByteArray: compressed_data_size=1948679 data_size=2929924, decompressed_data_size=2929924 diff=0
21:38:27:WU02:FS01:0xa3:- Digital signature verified
21:38:27:WU02:FS01:0xa3:
21:38:27:WU02:FS01:0xa3:Project: 7513 (Run 0, Clone 18, Gen 237)
21:38:27:WU02:FS01:0xa3:
21:38:27:WU02:FS01:0xa3:Assembly optimizations on if available.
21:38:27:WU02:FS01:0xa3:Entering M.D.
21:38:33:WU02:FS01:0xa3:Mapping NT from 7 to 7 
21:38:34:WU02:FS01:0xa3:Completed 0 out of 500000 steps  (0%)
21:42:26:WU02:FS01:0xa3:Completed 5000 out of 500000 steps  (1%)
21:46:18:WU02:FS01:0xa3:Completed 10000 out of 500000 steps  (2%)
21:50:11:WU02:FS01:0xa3:Completed 15000 out of 500000 steps  (3%)
21:54:03:WU02:FS01:0xa3:Completed 20000 out of 500000 steps  (4%)
21:57:56:WU02:FS01:0xa3:Completed 25000 out of 500000 steps  (5%)
22:01:48:WU02:FS01:0xa3:Completed 30000 out of 500000 steps  (6%)
22:05:42:WU02:FS01:0xa3:Completed 35000 out of 500000 steps  (7%)
The WU is still waiting.

I cannot check the server stats page because fah-web.stanford.edu appears to be down, which also includes the stats pages.

Re: 171.67.108.35 Down

Posted: Sat Mar 09, 2013 8:52 am
by Jesse_V
Jesse_V wrote:The WU is still waiting.
I restarted FAH, and it managed to start uploading to 128.143.199.97 (a different server). However, it looks like its hanging at 92%:

Code: Select all

08:45:03:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:7513 run:0 clone:18 gen:237 core:0xa3 unit:0x000000f5fbcb017d4ff756c2ea6156f6
08:45:03:WU02:FS01:Uploading 10.88MiB to 128.143.199.97
08:45:03:WU02:FS01:Connecting to 128.143.199.97:8080
08:45:03:WU03:FS01:Starting
08:45:03:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Admin/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/beta/Core_a4.fah/FahCore_a4.exe -dir 03 -suffix 01 -version 703 -lifeline 3396 -checkpoint 30 -np 7
08:45:03:WU03:FS01:Started FahCore on PID 3596
08:45:03:WU03:FS01:Core PID:3612
08:45:03:WU03:FS01:FahCore 0xa4 started
08:45:04:WU03:FS01:0xa4:
08:45:04:WU03:FS01:0xa4:*------------------------------*
08:45:04:WU03:FS01:0xa4:Folding@Home Gromacs GB Core
08:45:04:WU03:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
08:45:04:WU03:FS01:0xa4:
08:45:04:WU03:FS01:0xa4:Preparing to commence simulation
08:45:04:WU03:FS01:0xa4:- Looking at optimizations...
08:45:04:WU03:FS01:0xa4:- Files status OK
08:45:04:WU03:FS01:0xa4:- Expanded 1082309 -> 3051032 (decompressed 281.9 percent)
08:45:04:WU03:FS01:0xa4:Called DecompressByteArray: compressed_data_size=1082309 data_size=3051032, decompressed_data_size=3051032 diff=0
08:45:04:WU03:FS01:0xa4:- Digital signature verified
08:45:04:WU03:FS01:0xa4:
08:45:04:WU03:FS01:0xa4:Project: 8082 (Run 19, Clone 36, Gen 12)
08:45:04:WU03:FS01:0xa4:
08:45:04:WU03:FS01:0xa4:Assembly optimizations on if available.
08:45:04:WU03:FS01:0xa4:Entering M.D.
08:45:09:WU02:FS01:Upload 9.77%
08:45:10:WU03:FS01:0xa4:Mapping NT from 7 to 7 
08:45:10:WU03:FS01:0xa4:Completed 0 out of 500000 steps  (0%)
08:45:15:WU02:FS01:Upload 20.11%
08:45:21:WU02:FS01:Upload 30.46%
08:45:27:WU02:FS01:Upload 40.80%
08:45:33:WU02:FS01:Upload 51.14%
08:45:39:WU02:FS01:Upload 61.49%
08:45:45:WU02:FS01:Upload 71.83%
08:45:51:WU02:FS01:Upload 82.18%
08:45:57:WU02:FS01:Upload 92.52%
08:47:41:WU03:FS01:0xa4:Completed 5000 out of 500000 steps  (1%)
08:50:19:WU03:FS01:0xa4:Completed 10000 out of 500000 steps  (2%)