129.74.85.15

Moderators: Site Moderators, FAHC Science Team

Post Reply
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

129.74.85.15

Post by plext0r »

I've been having problems with this server since yesterday and it's still taking place. I'm getting HTTP_GATEWAY_TIME_OUT across about 20 servers. I thought it might be an internal transparent proxy issue, but then I got assigned a different work server (171.67.108.60) for a couple of units and it was fine. Now I'm back to failing on 129.74.85.15. Here's the cleaned-up log from one of my hosts.

Code: Select all

*********************** Log Started 2012-07-30T20:59:10Z ***********************
20:59:10:************************* Folding@home Client *************************
20:59:10:    Website: http://folding.stanford.edu/
20:59:10:  Copyright: (c) 2009-2012 Stanford University
20:59:10:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:59:10:       Args: --child --lifeline 2719 /etc/fahclient/config.xml --run-as
20:59:10:             fahclient --pid-file=/var/run/fahclient.pid --daemon
20:59:10:     Config: /etc/fahclient/config.xml
20:59:10:******************************** Build ********************************
20:59:10:    Version: 7.1.52
20:59:10:       Date: Mar 20 2012
20:59:10:       Time: 13:32:16
20:59:10:    SVN Rev: 3515
20:59:10:     Branch: fah/trunk/client
20:59:10:   Compiler: GNU 4.1.2 20080704 (Red Hat 4.1.2-46)
20:59:10:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
20:59:10:             -fno-unsafe-math-optimizations -msse2
20:59:10:   Platform: linux2 2.6.18-164.11.1.el5
20:59:10:       Bits: 64
20:59:10:       Mode: Release
20:59:10:******************************* System ********************************
20:59:10:        CPU: Intel(R) Core(TM)2 Duo CPU E8400 @ 3.00GHz
20:59:10:     CPU ID: GenuineIntel Family 6 Model 23 Stepping 10
20:59:10:       CPUs: 2
20:59:10:     Memory: 1.96GiB
20:59:10:Free Memory: 1.74GiB
20:59:10:    Threads: POSIX_THREADS
20:59:10: On Battery: false
20:59:10: UTC offset: -4
20:59:10:        PID: 2726
20:59:10:        CWD: /var/lib/fahclient
20:59:10:         OS: Linux 2.6.18-308.8.2.el5 x86_64
20:59:10:    OS Arch: AMD64
20:59:10:       GPUs: 1
20:59:10:      GPU 0: UNSUPPORTED: ES1000
20:59:10:       CUDA: Not detected
20:59:10:***********************************************************************
20:59:10:<config>
20:59:10:  <!-- Folding Slot Configuration -->
20:59:10:  <client-type v='advanced'/>
20:59:10:  <max-packet-size v='big'/>
20:59:10:
20:59:10:  <!-- Network -->
20:59:10:  <proxy v='x.x.x.x:3128'/>
20:59:10:  <proxy-enable v='true'/>
20:59:10:
20:59:10:  <!-- User Information -->
20:59:10:  <passkey v='********************************'/>
20:59:10:  <team v='1115'/>
20:59:10:  <user v='brilong'/>
20:59:10:
20:59:10:  <!-- Folding Slots -->
20:59:10:</config>
20:59:10:Switching to user fahclient
20:59:10:Trying to access database...
20:59:10:Successfully acquired database lock
20:59:10:Enabled folding slot 00: READY smp:2
20:59:10:WU00:FS00:Connecting to x.x.x.x:3128
20:59:10:WU00:FS00:News: Welcome to Folding@Home
20:59:10:WU00:FS00:Assigned to work server 129.74.85.15
20:59:10:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
20:59:10:WU00:FS00:Connecting to x.x.x.x:3128
20:59:11:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
20:59:11:WU00:FS00:Connecting to x.x.x.x:3128
20:59:11:WU00:FS00:News: Welcome to Folding@Home
20:59:11:WU00:FS00:Assigned to work server 129.74.85.15
20:59:11:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
20:59:11:WU00:FS00:Connecting to x.x.x.x:3128
20:59:11:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
21:00:11:WU00:FS00:Connecting to x.x.x.x:3128
21:00:11:WU00:FS00:News: Welcome to Folding@Home
21:00:11:WU00:FS00:Assigned to work server 129.74.85.15
21:00:11:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
21:00:11:WU00:FS00:Connecting to x.x.x.x:3128
21:00:11:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
[snip lots of timeouts]
22:33:47:WU01:FS00:Connecting to x.x.x.x:3128
22:33:48:WU01:FS00:News: Welcome to Folding@Home
22:33:48:WU01:FS00:Assigned to work server 129.74.85.15
22:33:48:WU01:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
22:33:48:WU01:FS00:Connecting to x.x.x.x:3128
22:33:48:ERROR:WU01:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
22:36:25:WU01:FS00:Connecting to x.x.x.x:3128
22:36:25:WU01:FS00:News: Welcome to Folding@Home
22:36:25:WU01:FS00:Assigned to work server 129.74.85.15
22:36:25:WU01:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
22:36:25:WU01:FS00:Connecting to x.x.x.x:3128
22:36:25:ERROR:WU01:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
22:40:39:WU01:FS00:Connecting to x.x.x.x:3128
22:40:39:WU01:FS00:News: Welcome to Folding@Home
22:40:39:WU01:FS00:Assigned to work server 171.67.108.60
22:40:39:WU01:FS00:Requesting new work unit for slot 00: READY smp:2 from 171.67.108.60
22:40:39:WU01:FS00:Connecting to x.x.x.x:3128
22:40:40:WU01:FS00:Downloading 1.17MiB
22:40:41:WU01:FS00:Download complete
22:40:41:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:8049 run:621 clone:14 gen:0 core:0xa4 unit:0x000000006652edcc5013372c69bf2917
22:40:41:WU01:FS00:Starting
22:40:41:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 701 -lifeline 2726 -checkpoint 15 -np 2
22:40:41:WU01:FS00:Started FahCore on PID 2934
22:40:41:WU01:FS00:Core PID:2938
22:40:41:WU01:FS00:FahCore 0xa4 started
22:40:42:WU01:FS00:0xa4:
22:40:42:WU01:FS00:0xa4:*------------------------------*
22:40:42:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
22:40:42:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
22:40:42:WU01:FS00:0xa4:
22:40:42:WU01:FS00:0xa4:Preparing to commence simulation
22:40:42:WU01:FS00:0xa4:- Looking at optimizations...
22:40:42:WU01:FS00:0xa4:- Created dyn
22:40:42:WU01:FS00:0xa4:- Files status OK
22:40:42:WU01:FS00:0xa4:- Expanded 1229139 -> 2209388 (decompressed 179.7 percent)
22:40:42:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=1229139 data_size=2209388, decompressed_data_size=2209388 diff=0
22:40:42:WU01:FS00:0xa4:- Digital signature verified
22:40:42:WU01:FS00:0xa4:
22:40:42:WU01:FS00:0xa4:Project: 8049 (Run 621, Clone 14, Gen 0)
22:40:42:WU01:FS00:0xa4:
22:40:42:WU01:FS00:0xa4:Assembly optimizations on if available.
22:40:42:WU01:FS00:0xa4:Entering M.D.
22:40:48:WU01:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
22:44:12:WU01:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
[snip steps]
04:14:21:WU01:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
04:17:42:WU01:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
04:17:42:WU01:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
04:17:43:WU00:FS00:Connecting to x.x.x.x:3128
04:17:43:WU00:FS00:News: Welcome to Folding@Home
04:17:43:WU00:FS00:Assigned to work server 129.74.85.15
04:17:43:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:2 from 129.74.85.15
04:17:43:WU00:FS00:Connecting to x.x.x.x:3128
04:17:43:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
04:17:43:WU00:FS00:Connecting to x.x.x.x:3128
04:17:44:WU00:FS00:News: Welcome to Folding@Home
04:17:44:WU00:FS00:Assigned to work server 129.74.85.15
04:17:44:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:2 from 129.74.85.15
04:17:44:WU00:FS00:Connecting to x.x.x.x:3128
04:17:44:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
04:17:52:WU01:FS00:0xa4:
04:17:52:WU01:FS00:0xa4:Finished Work Unit:
04:17:52:WU01:FS00:0xa4:- Reading up to 1405500 from "01/wudata_01.trr": Read 1405500
04:17:52:WU01:FS00:0xa4:trr file hash check passed.
04:17:52:WU01:FS00:0xa4:- Reading up to 850948 from "01/wudata_01.xtc": Read 850948
04:17:52:WU01:FS00:0xa4:xtc file hash check passed.
04:17:52:WU01:FS00:0xa4:edr file hash check passed.
04:17:52:WU01:FS00:0xa4:logfile size: 23474
04:17:52:WU01:FS00:0xa4:Leaving Run
04:17:55:WU01:FS00:0xa4:- Writing 2285326 bytes of core data to disk...
04:17:56:WU01:FS00:0xa4:Done: 2284814 -> 2183171 (compressed to 95.5 percent)
04:17:56:WU01:FS00:0xa4:  ... Done.
04:18:14:WU01:FS00:0xa4:- Shutting down core
04:18:14:WU01:FS00:0xa4:
04:18:14:WU01:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
04:18:16:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
04:18:16:WU01:FS00:Sending unit results: id:01 state:SEND error:OK project:8049 run:621 clone:14 gen:0 core:0xa4 unit:0x000000006652edcc5013372c69bf2917
04:18:16:WU01:FS00:Uploading 2.08MiB to 171.67.108.60
04:18:16:WU01:FS00:Connecting to x.x.x.x:3128
04:18:19:WU01:FS00:Upload complete
04:18:19:WU01:FS00:Server responded WORK_ACK (400)
04:18:19:WU01:FS00:Final credit estimate, 1033.00 points
04:18:19:WU01:FS00:Cleaning up
04:18:44:WU00:FS00:Connecting to x.x.x.x:3128
04:18:44:WU00:FS00:News: Welcome to Folding@Home
04:18:44:WU00:FS00:Assigned to work server 129.74.85.15
04:18:44:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
04:18:44:WU00:FS00:Connecting to x.x.x.x:3128
04:18:44:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
[snip more timeouts]
04:36:43:WU00:FS00:Connecting to x.x.x.x:3128
04:36:43:WU00:FS00:News: Welcome to Folding@Home
04:36:43:WU00:FS00:Assigned to work server 129.74.85.15
04:36:43:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
04:36:43:WU00:FS00:Connecting to x.x.x.x:3128
04:36:43:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
04:37:43:WU00:FS00:Connecting to x.x.x.x:3128
04:37:44:WU00:FS00:News: Welcome to Folding@Home
04:37:44:WU00:FS00:Assigned to work server 171.67.108.60
04:37:44:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 171.67.108.60
04:37:44:WU00:FS00:Connecting to x.x.x.x:3128
04:37:44:WU00:FS00:Downloading 1.17MiB
04:37:45:WU00:FS00:Download complete
04:37:45:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:OK project:8049 run:541 clone:17 gen:0 core:0xa4 unit:0x000000006652edcc5013355d14501723
04:37:45:WU00:FS00:Starting
04:37:45:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 00 -suffix 01 -version 701 -lifeline 2726 -checkpoint 15 -np 2
04:37:45:WU00:FS00:Started FahCore on PID 3640
04:37:45:WU00:FS00:Core PID:3644
04:37:45:WU00:FS00:FahCore 0xa4 started
04:37:46:WU00:FS00:0xa4:
04:37:46:WU00:FS00:0xa4:*------------------------------*
04:37:46:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
04:37:46:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
04:37:46:WU00:FS00:0xa4:
04:37:46:WU00:FS00:0xa4:Preparing to commence simulation
04:37:46:WU00:FS00:0xa4:- Looking at optimizations...
04:37:46:WU00:FS00:0xa4:- Created dyn
04:37:46:WU00:FS00:0xa4:- Files status OK
04:37:46:WU00:FS00:0xa4:- Expanded 1231391 -> 2213132 (decompressed 179.7 percent)
04:37:46:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=1231391 data_size=2213132, decompressed_data_size=2213132 diff=0
04:37:46:WU00:FS00:0xa4:- Digital signature verified
04:37:46:WU00:FS00:0xa4:
04:37:46:WU00:FS00:0xa4:Project: 8049 (Run 541, Clone 17, Gen 0)
04:37:46:WU00:FS00:0xa4:
04:37:46:WU00:FS00:0xa4:Assembly optimizations on if available.
04:37:46:WU00:FS00:0xa4:Entering M.D.
04:37:52:WU00:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
04:41:15:WU00:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
[snip steps]
10:22:09:WU00:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
10:25:41:WU00:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
10:25:41:WU00:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
10:25:42:WU01:FS00:Connecting to x.x.x.x:3128
10:25:42:WU01:FS00:News: Welcome to Folding@Home
10:25:42:WU01:FS00:Assigned to work server 171.67.108.59
10:25:42:WU01:FS00:Requesting new work unit for slot 00: RUNNING smp:2 from 171.67.108.59
10:25:42:WU01:FS00:Connecting to x.x.x.x:3128
10:25:43:WU01:FS00:Downloading 531.52KiB
10:25:43:WU01:FS00:Download complete
10:25:43:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:8004 run:97 clone:17 gen:126 core:0xa4 unit:0x000000a56652edcb4ee901131ec3c664
10:25:51:WU00:FS00:0xa4:
10:25:51:WU00:FS00:0xa4:Finished Work Unit:
10:25:51:WU00:FS00:0xa4:- Reading up to 1408308 from "00/wudata_01.trr": Read 1408308
10:25:51:WU00:FS00:0xa4:trr file hash check passed.
10:25:51:WU00:FS00:0xa4:- Reading up to 852800 from "00/wudata_01.xtc": Read 852800
10:25:51:WU00:FS00:0xa4:xtc file hash check passed.
10:25:51:WU00:FS00:0xa4:edr file hash check passed.
10:25:51:WU00:FS00:0xa4:logfile size: 23534
10:25:51:WU00:FS00:0xa4:Leaving Run
10:25:55:WU00:FS00:0xa4:- Writing 2290046 bytes of core data to disk...
10:25:55:WU00:FS00:0xa4:Done: 2289534 -> 2187583 (compressed to 95.5 percent)
10:25:55:WU00:FS00:0xa4:  ... Done.
10:26:13:WU00:FS00:0xa4:- Shutting down core
10:26:13:WU00:FS00:0xa4:
10:26:13:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
10:26:16:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
10:26:16:WU00:FS00:Sending unit results: id:00 state:SEND error:OK project:8049 run:541 clone:17 gen:0 core:0xa4 unit:0x000000006652edcc5013355d14501723
10:26:16:WU00:FS00:Uploading 2.09MiB to 171.67.108.60
10:26:16:WU00:FS00:Connecting to x.x.x.x:3128
10:26:16:WU01:FS00:Starting
10:26:16:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 701 -lifeline 2726 -checkpoint 15 -np 2
10:26:16:WU01:FS00:Started FahCore on PID 4777
10:26:16:WU01:FS00:Core PID:4781
10:26:16:WU01:FS00:FahCore 0xa4 started
10:26:17:WU01:FS00:0xa4:
10:26:17:WU01:FS00:0xa4:*------------------------------*
10:26:17:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
10:26:17:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
10:26:17:WU01:FS00:0xa4:
10:26:17:WU01:FS00:0xa4:Preparing to commence simulation
10:26:17:WU01:FS00:0xa4:- Looking at optimizations...
10:26:17:WU01:FS00:0xa4:- Created dyn
10:26:17:WU01:FS00:0xa4:- Files status OK
10:26:17:WU01:FS00:0xa4:- Expanded 543765 -> 1303440 (decompressed 239.7 percent)
10:26:17:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=543765 data_size=1303440, decompressed_data_size=1303440 diff=0
10:26:17:WU01:FS00:0xa4:- Digital signature verified
10:26:17:WU01:FS00:0xa4:
10:26:17:WU01:FS00:0xa4:Project: 8004 (Run 97, Clone 17, Gen 126)
10:26:17:WU01:FS00:0xa4:
10:26:17:WU01:FS00:0xa4:Assembly optimizations on if available.
10:26:17:WU01:FS00:0xa4:Entering M.D.
10:26:18:WU00:FS00:Upload complete
10:26:18:WU00:FS00:Server responded WORK_ACK (400)
10:26:18:WU00:FS00:Final credit estimate, 1017.00 points
10:26:18:WU00:FS00:Cleaning up
10:26:23:WU01:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
10:28:06:WU01:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
[snip steps]
13:16:56:WU01:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
13:18:39:WU01:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
13:18:39:WU01:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
13:18:40:WU00:FS00:Connecting to x.x.x.x:3128
13:18:40:WU00:FS00:News: Welcome to Folding@Home
13:18:40:WU00:FS00:Assigned to work server 129.74.85.15
13:18:40:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:2 from 129.74.85.15
13:18:40:WU00:FS00:Connecting to x.x.x.x:3128
13:18:40:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
13:18:40:WU00:FS00:Connecting to x.x.x.x:3128
13:18:41:WU00:FS00:News: Welcome to Folding@Home
13:18:41:WU00:FS00:Assigned to work server 129.74.85.15
13:18:41:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:2 from 129.74.85.15
13:18:41:WU00:FS00:Connecting to x.x.x.x:3128
13:18:41:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
13:18:49:WU01:FS00:0xa4:
13:18:49:WU01:FS00:0xa4:Finished Work Unit:
13:18:49:WU01:FS00:0xa4:- Reading up to 768480 from "01/wudata_01.trr": Read 768480
13:18:49:WU01:FS00:0xa4:trr file hash check passed.
13:18:49:WU01:FS00:0xa4:- Reading up to 455420 from "01/wudata_01.xtc": Read 455420
13:18:49:WU01:FS00:0xa4:xtc file hash check passed.
13:18:49:WU01:FS00:0xa4:edr file hash check passed.
13:18:49:WU01:FS00:0xa4:logfile size: 23044
13:18:49:WU01:FS00:0xa4:Leaving Run
13:18:53:WU01:FS00:0xa4:- Writing 1252348 bytes of core data to disk...
13:18:54:WU01:FS00:0xa4:Done: 1251836 -> 1191440 (compressed to 95.1 percent)
13:18:54:WU01:FS00:0xa4:  ... Done.
13:19:04:WU01:FS00:0xa4:- Shutting down core
13:19:04:WU01:FS00:0xa4:
13:19:04:WU01:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
13:19:05:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
13:19:05:WU01:FS00:Sending unit results: id:01 state:SEND error:OK project:8004 run:97 clone:17 gen:126 core:0xa4 unit:0x000000a56652edcb4ee901131ec3c664
13:19:05:WU01:FS00:Uploading 1.14MiB to 171.67.108.59
13:19:05:WU01:FS00:Connecting to x.x.x.x:3128
13:19:07:WU01:FS00:Upload complete
13:19:07:WU01:FS00:Server responded WORK_ACK (400)
13:19:07:WU01:FS00:Final credit estimate, 415.00 points
13:19:07:WU01:FS00:Cleaning up
13:19:41:WU00:FS00:Connecting to x.x.x.x:3128
13:19:41:WU00:FS00:News: Welcome to Folding@Home
13:19:41:WU00:FS00:Assigned to work server 129.74.85.15
13:19:41:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
13:19:41:WU00:FS00:Connecting to x.x.x.x:3128
13:19:41:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
13:21:18:WU00:FS00:Connecting to x.x.x.x:3128
13:21:18:WU00:FS00:News: Welcome to Folding@Home
13:21:18:WU00:FS00:Assigned to work server 129.74.85.15
13:21:18:WU00:FS00:Requesting new work unit for slot 00: READY smp:2 from 129.74.85.15
13:21:18:WU00:FS00:Connecting to x.x.x.x:3128
13:21:18:ERROR:WU00:FS00:Exception: 10001: Server responded: HTTP_GATEWAY_TIME_OUT
Mod edit - added code tags
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

Problem with 129.74.85.15

Post by plext0r »

My 7.1.52 Linux clients keep trying to get work units from 129.74.85.15 but server status shows "accept". Why am I getting assigned this server when it's not dishing out new work units?

According to the status page, "Accept" means the server is only accepting WUs, not assigning. I have multiple clients requesting work units from this server and getting HTTP_GATEWAY_TIME_OUT. I have to use a proxy server in my corporate network to transition from RFC1918 space to the Internet.

I first posted my log at viewtopic.php?f=18&t=21173#p221120, then realized the server status was "Accept".

Thanks for any help resolving this issue. I'm not sure how to force my clients to try for assignments from other working servers.
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Problem with 129.74.85.15

Post by sortofageek »

Have you worked through the stickied Troubleshooting Tips?
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

Re: Problem with 129.74.85.15

Post by plext0r »

sortofageek wrote:Have you worked through the stickied Troubleshooting Tips?
Yes, that's how I figured out I was getting assigned to a server in "Active" status. If I'm reading this right, this server is not assigning new WUs, yet I'm getting sent to it anyway.

My v7.1.52 client is configured for "advanced", "big" units, so it should get assigned normal or small units as needed. I've noticed that many of my hosts have retried tens of times with 129.74.85.15 and then got assigned to 171.67.108.59 which immediately downloaded and started working.

My beef is why I'm getting assigned to work server 129.74.85.15 when it claims to have no new work units available.

Thanks!
codysluder
Posts: 1024
Joined: Sun Dec 02, 2007 12:43 pm

Re: Problem with 129.74.85.15

Post by codysluder »

I don't read the server status page the same way you do. The Status column says full, not accept, which means it should be assigning work. This is confirmed by the green number (currently 12, but it changes) in the %Ass column which indicates that it is successfully assigning work. Don't be confused by the Connect column saying accepting. That's simply an indication that it is accepting connections, not that the assignment status is limited to accept-only.

My concern is the red 0 in the 80 column which indicates it isn't accepting connections on port 80, but the lack of any information in the %Ass80 column seems to indicate that the Assignment Servers understand the status correctly.

I'd like to be able to help answer your original question about the timeout, but I don't understand that problem. All I know is that serverstat doesn't explain what's wrong.

Post the output from tracert 129.74.85.15
Joe_H
Site Admin
Posts: 8002
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Studio M1 Max 32 GB smp6
Mac Hack i7-7700K 48 GB smp4
Location: W. MA

Re: Problem with 129.74.85.15

Post by Joe_H »

There may be a problem with the server or your connecting to it, but it is not from lack of work being available. The status reported on the server status page, http://fah-web.stanford.edu/pybeta/serverstat.html, has been "full" and "accepting" for over 24 hours into the past. The number for "WU's avail" has been in excess of 300K as well. Where did you see a status of just "accept"? Or did you misread the server status page?
Image
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

Re: Problem with 129.74.85.15

Post by plext0r »

This was my first attempt reading the server status and your post clued me in. It appears I was reading it wrong. I cannot traceroute to the server from behind my NAT. foo.bar.com and x.x.x.x are my HTTP proxy. wget output follows:

# wget http://129.74.85.15
--2012-07-31 12:10:13-- http://129.74.85.15/
Resolving foo.bar.com... x.x.x.x
Connecting to foo.bar.com|x.x.x.x|:80... connected.
Proxy request sent, awaiting response... No data received.
Retrying.
[hit Ctrl-C]
# wget http://129.74.85.15:8080
--2012-07-31 12:10:40-- http://129.74.85.15:8080/
Resolving foo.bar.com... x.x.x.x
Connecting to foo.bar.com|x.x.x.x|:80... connected.
Proxy request sent, awaiting response... 200 HTTP_OK
Length: unspecified
Saving to: `index.html'

[ <=> ] 0 --.-K/s in 0s

2012-07-31 12:10:40 (0.00 B/s) - `index.html' saved [0]

# cat index.html
#

As you said, port 80 is not responding. Port 8080 is returning an empty file.
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

Re: Problem with 129.74.85.15

Post by plext0r »

Joe_H wrote:There may be a problem with the server or your connecting to it, but it is not from lack of work being available. The status reported on the server status page, http://fah-web.stanford.edu/pybeta/serverstat.html, has been "full" and "accepting" for over 24 hours into the past. The number for "WU's avail" has been in excess of 300K as well. Where did you see a status of just "accept"? Or did you misread the server status page?
I was reading http://fah-web.stanford.edu/serverstat.html where it says "accept", not "full". This is my first attempt at checking server status, so I Google'd "folding server status" and it gave me the non-pybeta page which appears to be wrong even though it says it was updated this morning. :)
Joe_H
Site Admin
Posts: 8002
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Studio M1 Max 32 GB smp6
Mac Hack i7-7700K 48 GB smp4
Location: W. MA

Re: 129.74.85.15

Post by Joe_H »

That is interesting. The older status page has been deprecated for a while, but usually gives the same information as the newer page. All of the links on the F@H pages at Stanford that I have gone to recently have been updated to point to the pybeta version, so I would assume that is considered the authoritative source for the server status.
plext0r
Posts: 9
Joined: Fri Jun 11, 2010 12:55 pm

Re: 129.74.85.15

Post by plext0r »

I'm still having problems with this host on multiple machines. I removed 7.1.52 and reinstalled 6.34 and pulled a work unit from another server just fine. Not sure why I'm getting so many HTTP_GATEWAY_TIMEOUTs for 129.74.85.15 when I can reach all the other F@H servers.
Post Reply