Several console clients no longer getting work
Posted: Wed Feb 27, 2008 1:58 pm
This includes 5.04beta, 6.01beta4, and SMP 5.91beta6.
I get an ok from http://assign.stanford.edu:8080/
I get a connection refused from http://assign2.stanford.edu/
We are using an ISA server as a firewall for our network, and I double checked and it isn't blocking anything in 171.xxx.xxx.xxx. These boxes were turning in and getting new work units before, so I'm not sure what happened. They are all on a shared t1 line. If I switch a box over to our backup dsl line, it does download a new work unit. However, it isn't practical for me switch them over every time they are ready for a new work unit. assign2.stanford.edu shows up when I ping it as 171.64.65.121
Any suggestions?
I get an ok from http://assign.stanford.edu:8080/
I get a connection refused from http://assign2.stanford.edu/
We are using an ISA server as a firewall for our network, and I double checked and it isn't blocking anything in 171.xxx.xxx.xxx. These boxes were turning in and getting new work units before, so I'm not sure what happened. They are all on a shared t1 line. If I switch a box over to our backup dsl line, it does download a new work unit. However, it isn't practical for me switch them over every time they are ready for a new work unit. assign2.stanford.edu shows up when I ping it as 171.64.65.121
Any suggestions?
Code: Select all
--- Opening Log file [February 26 17:09:14]
# Windows Console Edition #####################################################
###############################################################################
Folding@Home Client Version 5.04beta
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files\fah1
Executable: C:\Program Files\fah1\FAH504-Console.exe
Arguments: -verbosity 9 -forceasm -advmethods
Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.
[17:09:29] - Ask before connecting: No
[17:09:30] - User name: Torin3 (Team 32)
[17:09:31] - User ID: 37E65BE7264C6495
[17:09:31] - Machine ID: 1
[17:09:31]
[17:09:32] Loaded queue successfully.
[17:09:32] + Benchmarking ...
[17:09:35] The benchmark result is 3580
[17:09:37]
[17:09:37] - Autosending finished units...
[17:09:37] + Processing work unit
[17:09:37] Trying to send all finished work units
[17:09:37] Core required: FahCore_81.exe
[17:09:37] + No unsent completed units remaining.
[17:09:37] Core found.
[17:09:38] - Autosend completed
[17:09:39] Working on Unit 05 [February 26 17:09:39]
[17:09:39] + Working ...
[17:09:39] - Calling 'FahCore_81.exe -dir work/ -suffix 05 -checkpoint 30 -forceasm -verbose -lifeline 3916 -version 504'
[17:09:41]
[17:09:41] *------------------------------*
[17:09:42] Folding@Home Gromacs Simulated Tempering Core
[17:09:42] Version 1.10 (Oct 4, 2007)
[17:09:42]
[17:09:42] Preparing to commence simulation
[17:09:43] - Assembly optimizations manually forced on.
[17:09:43] - Not checking prior termination.
[17:09:43] - Expanded 364133 -> 1799134 (decompressed 494.0 percent)
[17:09:47]
[17:09:47] Project: 3620 (Run 101, Clone 2, Gen 10)
[17:09:47]
[17:09:48] Assembly optimizations on if available.
[17:09:48] Entering M.D.
[17:10:08] (Starting from checkpoint)
[17:10:09] Protein: p3620_Seq26_Amber03_Extended
[17:10:09]
[17:10:11] Writing local files
[17:11:05] Completed 1255857 out of 1500000 steps (84)
[17:11:05] Extra SSE boost OK.
[17:24:49] Writing local files
[17:24:49] Completed 1260000 out of 1500000 steps (84)
[17:55:48] Timered checkpoint triggered.
[18:09:09] Writing local files
[18:09:09] Completed 1275000 out of 1500000 steps (85)
[18:40:08] Timered checkpoint triggered.
[18:54:09] Writing local files
[18:54:09] Completed 1290000 out of 1500000 steps (86)
[19:25:09] Timered checkpoint triggered.
[19:38:28] Writing local files
[19:38:28] Completed 1305000 out of 1500000 steps (87)
[20:09:28] Timered checkpoint triggered.
[20:22:02] Writing local files
[20:22:02] Completed 1320000 out of 1500000 steps (88)
[20:53:01] Timered checkpoint triggered.
[21:06:08] Writing local files
[21:06:09] Completed 1335000 out of 1500000 steps (89)
[21:37:08] Timered checkpoint triggered.
[21:50:10] Writing local files
[21:50:10] Completed 1350000 out of 1500000 steps (90)
[22:21:10] Timered checkpoint triggered.
[22:34:31] Writing local files
[22:34:31] Completed 1365000 out of 1500000 steps (91)
[23:05:32] Timered checkpoint triggered.
[23:09:39] - Autosending finished units...
[23:09:39] Trying to send all finished work units
[23:09:39] + No unsent completed units remaining.
[23:09:39] - Autosend completed
[23:18:36] Writing local files
[23:18:36] Completed 1380000 out of 1500000 steps (92)
[23:49:36] Timered checkpoint triggered.
[00:02:37] Writing local files
[00:02:38] Completed 1395000 out of 1500000 steps (93)
[00:33:38] Timered checkpoint triggered.
[00:47:03] Writing local files
[00:47:03] Completed 1410000 out of 1500000 steps (94)
[01:18:02] Timered checkpoint triggered.
[01:31:12] Writing local files
[01:31:12] Completed 1425000 out of 1500000 steps (95)
[02:02:13] Timered checkpoint triggered.
[02:15:03] Writing local files
[02:15:03] Completed 1440000 out of 1500000 steps (96)
[02:46:03] Timered checkpoint triggered.
[02:59:25] Writing local files
[02:59:25] Completed 1455000 out of 1500000 steps (97)
[03:30:26] Timered checkpoint triggered.
[03:43:49] Writing local files
[03:43:49] Completed 1470000 out of 1500000 steps (98)
[04:14:49] Timered checkpoint triggered.
[04:27:45] Writing local files
[04:27:45] Completed 1485000 out of 1500000 steps (99)
[04:58:46] Timered checkpoint triggered.
[05:09:39] - Autosending finished units...
[05:09:39] Trying to send all finished work units
[05:09:39] + No unsent completed units remaining.
[05:09:39] - Autosend completed
[05:12:05] Writing local files
[05:12:05] Completed 1500000 out of 1500000 steps (100)
[05:12:05] Writing final coordinates.
[05:12:06] Past main M.D. loop
[05:13:06]
[05:13:06] Finished Work Unit:
[05:13:06] - Reading up to 297504 from "work/wudata_05.arc": Read 297504
[05:13:06] - Reading up to 478844 from "work/wudata_05.xtc": Read 478844
[05:13:06] goefile size: 0
[05:13:06] logfile size: 59666
[05:13:06] Leaving Run
[05:13:08] - Writing 968672 bytes of core data to disk...
[05:13:09] Done: 968160 -> 781937 (compressed to 80.7 percent)
[05:13:09] ... Done.
[05:13:09] - Shutting down core
[05:13:09]
[05:13:09] Folding@home Core Shutdown: FINISHED_UNIT
[05:13:12] CoreStatus = 64 (100)
[05:13:12] Unit 5 finished with 97 percent of time to deadline remaining.
[05:13:12] Updated performance fraction: 0.943991
[05:13:12] Sending work to server
[05:13:12] + Attempting to send results
[05:13:12] - Reading file work/wuresults_05.dat from core
[05:13:12] (Read 782449 bytes from disk)
[05:13:12] Connecting to http://171.64.122.82:80/
[05:13:24] Posted data.
[05:13:24] Initial: 0000; - Uploaded at ~63 kB/s
[05:13:24] - Averaged speed for that direction ~67 kB/s
[05:13:24] + Results successfully sent
[05:13:24] Thank you for your contribution to Folding@Home.
[05:13:24] + Number of Units Completed: 5
[05:13:28] Trying to send all finished work units
[05:13:28] + No unsent completed units remaining.
[05:13:28] - Preparing to get new work unit...
[05:13:28] + Attempting to get work packet
[05:13:28] - Will indicate memory of 2013 MB
[05:13:28] - Connecting to assignment server
[05:13:28] Connecting to http://assign.stanford.edu:8080/
[05:13:28] - Couldn't send HTTP request to server
[05:13:28] + Could not connect to Assignment Server
[05:13:28] Connecting to http://assign2.stanford.edu:80/
[05:13:30] - Couldn't send HTTP request to server
[05:13:30] + Could not connect to Assignment Server 2
[05:13:30] + Couldn't get work instructions.
[05:13:30] - Error: Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[05:13:41] + Attempting to get work packet
[05:13:41] - Will indicate memory of 2013 MB
[05:13:41] - Connecting to assignment server
[05:13:41] Connecting to http://assign.stanford.edu:8080/
[05:13:41] - Couldn't send HTTP request to server
[05:13:41] + Could not connect to Assignment Server
[05:13:41] Connecting to http://assign2.stanford.edu:80/
[05:13:42] - Couldn't send HTTP request to server
[05:13:42] + Could not connect to Assignment Server 2
[05:13:42] + Couldn't get work instructions.
[05:13:42] - Error: Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
Code: Select all
--- Opening Log file [February 25 22:09:20]
# Windows Console Edition #####################################################
###############################################################################
Folding@Home Client Version 6.01beta4
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files\fah
Service: C:\Program Files\fah\fah6-win-x86-console.exe
Arguments: -svcstart
Launched as a service.
Entered C:\Program Files\fah to do work.
[22:09:20] - Ask before connecting: No
[22:09:20] - User name: Torin3 (Team 32)
[22:09:20] - User ID: 32A551543679AFAF
[22:09:20] - Machine ID: 1
[22:09:20]
[22:09:20] Loaded queue successfully.
[22:09:20]
[22:09:20] + Processing work unit
[22:09:20] Core required: FahCore_81.exe
[22:09:20] Core found.
[22:09:20] Working on Unit 01 [February 25 22:09:20]
[22:09:20] + Working ...
[22:09:20]
[22:09:20] *------------------------------*
[22:09:20] Folding@Home Gromacs Simulated Tempering Core
[22:09:20] Version 1.10 (Oct 4, 2007)
[22:09:20]
[22:09:20] Preparing to commence simulation
[22:09:20] - Looking at optimizations...
[22:09:20] - Files status OK
[22:09:21] - Expanded 365631 -> 1800420 (decompressed 492.4 percent)
[22:09:21]
[22:09:21] Project: 3640 (Run 14, Clone 9, Gen 8)
[22:09:21]
[22:09:21] Assembly optimizations on if available.
[22:09:21] Entering M.D.
[22:09:41] (Starting from checkpoint)
[22:09:41] Protein: p3640_Seq14_Amber03_Extended
[22:09:41]
[22:09:41] Writing local files
[22:10:27] Completed 1365000 out of 1500000 steps (91%)
[22:10:27] Extra SSE boost OK.
[22:49:54] Writing local files
[22:49:54] Completed 1380000 out of 1500000 steps (92%)
[23:28:08] Writing local files
[23:28:08] Completed 1395000 out of 1500000 steps (93%)
[00:06:25] Writing local files
[00:06:25] Completed 1410000 out of 1500000 steps (94%)
[00:44:33] Writing local files
[00:44:33] Completed 1425000 out of 1500000 steps (95%)
[01:22:41] Writing local files
[01:22:41] Completed 1440000 out of 1500000 steps (96%)
[02:00:55] Writing local files
[02:00:55] Completed 1455000 out of 1500000 steps (97%)
[02:39:03] Writing local files
[02:39:03] Completed 1470000 out of 1500000 steps (98%)
[03:17:13] Writing local files
[03:17:13] Completed 1485000 out of 1500000 steps (99%)
[03:55:26] Writing local files
[03:55:26] Completed 1500000 out of 1500000 steps (100%)
[03:55:26] Writing final coordinates.
[03:55:26] Past main M.D. loop
[03:56:26]
[03:56:26] Finished Work Unit:
[03:56:26] - Reading up to 297528 from "work/wudata_01.arc": Read 297528
[03:56:26] - Reading up to 507832 from "work/wudata_01.xtc": Read 507832
[03:56:26] goefile size: 0
[03:56:26] logfile size: 110214
[03:56:27] Leaving Run
[03:56:27] - Writing 1061231 bytes of core data to disk...
[03:56:28] Done: 1060719 -> 810249 (compressed to 76.3 percent)
[03:56:28] ... Done.
[03:56:28] - Shutting down core
[03:56:28]
[03:56:28] Folding@home Core Shutdown: FINISHED_UNIT
[03:56:31] CoreStatus = 64 (100)
[03:56:31] Sending work to server
[03:56:31] - Read packet limit of 540015616... Set to 524286976.
[03:56:31] + Attempting to send results
[03:56:33] - Couldn't send HTTP request to server
[03:56:33] + Could not connect to Work Server (results)
[03:56:33] (171.64.122.82:80)
[03:56:33] - Error: Could not transmit unit 01 (completed February 26) to work server.
[03:56:33] Keeping unit 01 in queue.
[03:56:33] - Read packet limit of 540015616... Set to 524286976.
[03:56:33] + Attempting to send results
[03:56:35] - Couldn't send HTTP request to server
[03:56:35] + Could not connect to Work Server (results)
[03:56:35] (171.64.122.82:80)
[03:56:35] - Error: Could not transmit unit 01 (completed February 26) to work server.
[03:56:35] - Read packet limit of 540015616... Set to 524286976.
[03:56:35] + Attempting to send results
[03:56:36] - Couldn't send HTTP request to server
[03:56:36] + Could not connect to Work Server (results)
[03:56:36] (171.64.122.76:80)
[03:56:36] Could not transmit unit 01 to Collection server; keeping in queue.
[03:56:36] - Preparing to get new work unit...
[03:56:36] + Attempting to get work packet
[03:56:36] - Connecting to assignment server
[03:56:44] - Couldn't send HTTP request to server
[03:56:44] + Could not connect to Assignment Server
[03:56:51] - Couldn't send HTTP request to server
[03:56:51] + Could not connect to Assignment Server 2
[03:56:51] + Couldn't get work instructions.
[03:56:51] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[03:57:03] + Attempting to get work packet
[03:57:03] - Connecting to assignment server
[03:57:04] - Couldn't send HTTP request to server
[03:57:04] + Could not connect to Assignment Server
[03:57:06] - Couldn't send HTTP request to server
[03:57:06] + Could not connect to Assignment Server 2
[03:57:06] + Couldn't get work instructions.
[03:57:06] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
Code: Select all
--- Opening Log file [February 26 11:10:54]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 5.91beta6
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files\fah
Executable: C:\Program Files\fah\fah.exe
Arguments: -verbosity 9 -forceasm -advmethods
Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.
[11:10:54] - Ask before connecting: No
[11:10:54] - User name: Torin3 (Team 32)
[11:10:54] - User ID: 4FC70C1768D014E8
[11:10:55] - Machine ID: 1
[11:10:55]
[11:10:55] Loaded queue successfully.
[11:10:55]
[11:10:55] - Autosending finished units...
[11:10:55] + Processing work unit
[11:10:55] Trying to send all finished work units
[11:10:55] Core required: FahCore_a1.exe
[11:10:56] + No unsent completed units remaining.
[11:10:56] Core found.
[11:10:56] - Autosend completed
[11:10:56] Working on Unit 07 [February 26 11:10:56]
[11:10:56] + Working ...
[11:10:56] - Calling 'mpiexec -channel auto -np 4 FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 30 -forceasm -verbose -lifeline 2184 -version 591'
[11:10:57]
[11:10:57] *------------------------------*
[11:10:57] Folding@Home Gromacs SMP Core
[11:10:57] Version 1.74 (March 10, 2007)
[11:10:57]
[11:10:58] Preparing to commence simulation
[11:10:58] - Ensuring status. Please wait.
[11:11:14] - Assembly optimizations manually forced on.
[11:11:14] - Not checking prior termination.
[11:11:20] - Expanded 2965840 -> 15212615 (decompressed 512.9 percent)
[11:11:21]
[11:11:21] Project: 2653 (Run 15, Clone 188, Gen 63)
[11:11:21]
[11:11:24] Assembly optimizations on if available.
[11:11:24] Entering M.D.
[11:11:30] Calling FAH init
[11:11:31] in POPC
[11:11:31] Writing local files
[11:11:31] checkpoint)
[11:11:31] Read checkpoint
[11:11:31] Protein: Protein in POPC
[11:11:31] ra SSE boost OK.
[11:11:31] es
[11:11:31] Completed 465000 out of 500000 steps (93 percent)
[11:11:32] Extra SSE boost OK.
[11:27:25] Writing local files
[11:27:25] Completed 470000 out of 500000 steps (94 percent)
[11:41:57] Writing local files
[11:41:57] Completed 475000 out of 500000 steps (95 percent)
[11:57:34] Writing local files
[11:57:34] Completed 480000 out of 500000 steps (96 percent)
[12:13:26] Writing local files
[12:13:26] Completed 485000 out of 500000 steps (97 percent)
[12:33:50] Writing local files
[12:33:50] Completed 490000 out of 500000 steps (98 percent)
[12:49:17] Writing local files
[12:49:17] Completed 495000 out of 500000 steps (99 percent)
[13:04:39] Writing local files
[13:04:40] Completed 500000 out of 500000 steps (100 percent)
[13:04:40] Writing final coordinates.
[13:04:41] Past main M.D. loop
[13:04:41] Will end MPI now
[13:05:41]
[13:05:41] Finished Work Unit:
[13:05:41] - Reading up to 3724272 from "work/wudata_07.arc": Read 3724272
[13:05:42] - Reading up to 1780324 from "work/wudata_07.xtc": Read 1780324
[13:05:43] goefile size: 0
[13:05:43] logfile size: 0
[13:05:43] Warning: Core could not open logfile.
[13:05:43] Leaving Run
[13:05:47] - Writing 5508996 bytes of core data to disk...
[13:05:48] ... Done.
[13:05:48] - Failed to delete work/wudata_07.sas
[13:05:48] - Failed to delete work/wudata_07.goe
[13:05:48] Warning: check for stray files
[13:05:48] - Shutting down core
[13:07:48]
[13:07:48] Folding@home Core Shutdown: FINISHED_UNIT
[13:07:48]
[13:07:48] Folding@home Core Shutdown: FINISHED_UNIT
[13:07:51] CoreStatus = 64 (100)
[13:07:51] Unit 7 finished with 71 percent of time to deadline remaining.
[13:07:51] Updated performance fraction: 0.716120
[13:07:51] Sending work to server
[13:07:51] + Attempting to send results
[13:07:51] - Reading file work/wuresults_07.dat from core
[13:07:51] (Read 5508996 bytes from disk)
[13:07:51] Connecting to http://171.64.65.64:80/
[13:09:11] Posted data.
[13:09:12] Initial: 0000; - Uploaded at ~65 kB/s
[13:09:13] - Averaged speed for that direction ~66 kB/s
[13:09:13] + Results successfully sent
[13:09:13] Thank you for your contribution to Folding@Home.
[13:09:13] + Number of Units Completed: 7
[13:11:59] - Warning: Could not delete all work unit files (7): Core returned invalid code
[13:11:59] Trying to send all finished work units
[13:11:59] + No unsent completed units remaining.
[13:11:59] - Preparing to get new work unit...
[13:11:59] + Attempting to get work packet
[13:11:59] - Will indicate memory of 2045 MB
[13:11:59] - Connecting to assignment server
[13:11:59] Connecting to http://assign.stanford.edu:8080/
[13:11:59] - Couldn't send HTTP request to server
[13:11:59] + Could not connect to Assignment Server
[13:11:59] Connecting to http://assign2.stanford.edu:80/
[13:12:00] - Couldn't send HTTP request to server
[13:12:00] + Could not connect to Assignment Server 2
[13:12:00] + Couldn't get work instructions.
[13:12:00] - Error: Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[13:12:13] + Attempting to get work packet
[13:12:13] - Will indicate memory of 2045 MB
[13:12:13] - Connecting to assignment server
[13:12:13] Connecting to http://assign.stanford.edu:8080/
[13:12:13] - Couldn't send HTTP request to server
[13:12:13] + Could not connect to Assignment Server
[13:12:13] Connecting to http://assign2.stanford.edu:80/
[13:12:14] - Couldn't send HTTP request to server
[13:12:14] + Could not connect to Assignment Server 2
[13:12:14] + Couldn't get work instructions.
[13:12:14] - Error: Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[13:12:26] + Attempting to get work packet
[13:12:26] - Will indicate memory of 2045 MB
[13:12:26] - Connecting to assignment server
[13:12:26] Connecting to http://assign.stanford.edu:8080/
[13:12:26] - Couldn't send HTTP request to server
[13:12:26] + Could not connect to Assignment Server
[13:12:26] Connecting to http://assign2.stanford.edu:80/
[13:12:27] - Couldn't send HTTP request to server
[13:12:27] + Could not connect to Assignment Server 2
[13:12:27] + Couldn't get work instructions.
[13:12:27] - Error: Attempt #3 to get work failed, and no other work to do.
Waiting before retry.