Page 1 of 4
CSs 171.67.108.17 - 171.65.103.100 - 171.67.108.25
Posted: Thu Feb 26, 2009 2:13 pm
by SyntorX
I have also been having probs with the 171.67.108.17:8080 and :80 collection servers.. Have been getting rejections for roughly 24 hours now.
Code: Select all
[01:33:45] + Attempting to get work packet
[01:33:45] - Connecting to assignment server
[01:33:45] - Successful: assigned to (171.64.122.136).
[01:33:45] + News From Folding@Home: Welcome to Folding@Home
[01:33:45] Loaded queue successfully.
[01:33:49] + Closed connections
[01:33:49]
[01:33:49] + Processing work unit
[01:33:49] Core required: FahCore_78.exe
[01:33:49] Core found.
[01:33:49] Working on queue slot 04 [February 23 01:33:49 UTC]
[01:33:49] + Working ...
[01:33:49]
[01:33:49] *------------------------------*
[01:33:49] Folding@Home Gromacs Core
[01:33:49] Version 1.90 (March 8, 2006)
[01:33:49]
[01:33:49] Preparing to commence simulation
[01:33:49] - Looking at optimizations...
[01:33:49] - Created dyn
[01:33:49] - Files status OK
[01:33:49] - Expanded 238942 -> 1190885 (decompressed 498.3 percent)
[01:33:49] - Starting from initial work packet
[01:33:49]
[01:33:49] Project: 2527 (Run 19, Clone 13, Gen 41)
[01:33:49]
[01:33:49] Assembly optimizations on if available.
[01:33:49] Entering M.D.
[01:33:55] Protein: p2527_Am22-43
Re: Problems with CS 171.67.108.17
Posted: Sun Mar 01, 2009 4:03 am
by MstrBlstr
That log snip only shows you getting work from server 171.64.122.136.
Where does it say anything pertaining to this thread topic?
Edit by Mod: Thread split into a new topic.
=b
Re: Problems with CS 171.67.108.17
Posted: Sun Mar 01, 2009 7:58 am
by bruce
When a WU is completed, the client attempts to return it to two servers. First, it tries the primary Work Server which issued it, and if that fails, it will try to send it to a Collection Server such as 171.67.108.17. We do need to see the portion of the log identifying both servers to be helpful. Specifically, we need to know the status of the primary Work Server for your assignment(s).
All of the Collection Servers have been running at their maximum capacity for quite a while and when the servers mentioned in the announcements went down, it made matters worse. The new server code that is soon to be available is expected to cure this issue quite quickly.
Welcome to the foldingforum, SyntorX
Re: Problems with CS 171.67.108.17
Posted: Sun Mar 01, 2009 2:33 pm
by MstrBlstr
Yes, welcome SyntorX.
I apologize for the abrupt statement that I made before. I should have explained a bit more carefully, which Bruce has done well. I really didn't mean to sound rude, it was not my intention. I was in a hurry, and should not have posted until I had the time to respond fully.
Re: Problems with CS 171.67.108.17
Posted: Sun Mar 01, 2009 3:14 pm
by dgermann
I'm having problems with the same collection server -- unable to upload results since 2/25. Ditto for 171.64.122.136, the associated work server. Completed work units to other servers get through OK.
Code: Select all
[03:44:52] Project: 2527 (Run 60, Clone 79, Gen 12)
[03:44:52]
[03:44:52] Assembly optimizations on if available.
[03:44:52] Entering M.D.
[03:44:58] Protein: p2527_Am22-43
[03:44:58]
[03:44:58] Writing local files
[03:44:58] Extra SSE boost OK.
[03:44:58] Writing local files
[03:44:58] Completed 0 out of 2000000 steps (0%)
[04:01:39] Writing local files
[04:01:39] Completed 20000 out of 2000000 steps (1%)
.....
[07:17:22] Completed 1980000 out of 2000000 steps (99%)
[07:35:07] Writing local files
[07:35:08] Completed 2000000 out of 2000000 steps (100%)
[07:35:08] Writing final coordinates.
[07:35:08] Past main M.D. loop
[07:36:08]
[07:36:08] Finished Work Unit:
[07:36:08] - Reading up to 193464 from "work/wudata_08.arc": Read 193464
[07:36:08] - Reading up to 51712 from "work/wudata_08.xtc": Read 51712
[07:36:08] goefile size: 0
[07:36:08] logfile size: 87630
[07:36:08] Leaving Run
[07:36:09] - Writing 382550 bytes of core data to disk...
[07:36:09] ... Done.
[07:36:09] - Shutting down core
[07:36:09]
[07:36:09] Folding@home Core Shutdown: FINISHED_UNIT
[07:36:09] CoreStatus = 64 (100)
[07:36:09] Sending work to server
[07:36:09] + Attempting to send results
[07:36:09] - Couldn't send HTTP request to server
[07:36:09] + Could not connect to Work Server (results)
[07:36:09] (171.64.122.136:8080)
[07:36:09] - Error: Could not transmit unit 08 (completed February 25) to work server.
[07:36:09] Keeping unit 08 in queue.
[07:36:09] + Attempting to send results
[07:36:09] - Couldn't send HTTP request to server
[07:36:09] + Could not connect to Work Server (results)
[07:36:09] (171.64.122.136:8080)
[07:36:09] - Error: Could not transmit unit 08 (completed February 25) to work server.
[07:36:09] + Attempting to send results
[07:36:09] - Couldn't send HTTP request to server
[07:36:09] + Could not connect to Work Server (results)
[07:36:09] (171.67.108.17:8080)
[07:36:09] Could not transmit unit 08 to Collection server; keeping in queue.
[07:36:09] - Preparing to get new work unit...
.....
(lots more failures to send completed work unit over the next several days)
(next failure is from about 2 hours ago)
.....
[13:01:53] + Attempting to send results
[13:01:53] - Couldn't send HTTP request to server
[13:01:53] + Could not connect to Work Server (results)
[13:01:53] (171.64.122.136:8080)
[13:01:53] - Error: Could not transmit unit 08 (completed February 25) to work server.
[13:01:53] + Attempting to send results
[13:01:53] - Couldn't send HTTP request to server
[13:01:53] + Could not connect to Work Server (results)
[13:01:53] (171.67.108.17:8080)
[13:01:53] Could not transmit unit 08 to Collection server; keeping in queue.
[13:08:50] Writing local files
And on a related note, I've got a Windows machine that's having problems sending its completed units to 128.59.74.4 and 171.65.103.100. I'm running two clients (one for each CPU), and both are having the same problem sending a completed unit back. One of the clients was able to send back a different completed work unit to another server.
CPU1:
Code: Select all
[13:44:03] Project: 3858 (Run 9951, Clone 0, Gen 22)
[13:44:03] + Attempting to send results [March 1 13:44:03 UTC]
[13:44:24] - Couldn't send HTTP request to server
[13:44:24] + Could not connect to Work Server (results)
[13:44:24] (128.59.74.4:8080)
[13:44:24] + Retrying using alternative port
[13:44:45] - Couldn't send HTTP request to server
[13:44:45] + Could not connect to Work Server (results)
[13:44:45] (128.59.74.4:80)
[13:44:45] - Error: Could not transmit unit 09 (completed February 27) to work server.
[13:44:45] + Attempting to send results [March 1 13:44:45 UTC]
[13:44:46] - Couldn't send HTTP request to server
[13:44:46] (Got status 503)
[13:44:46] + Could not connect to Work Server (results)
[13:44:46] (171.65.103.100:8080)
[13:44:46] + Retrying using alternative port
[13:44:46] - Couldn't send HTTP request to server
[13:44:46] (Got status 503)
[13:44:46] + Could not connect to Work Server (results)
[13:44:46] (171.65.103.100:80)
[13:44:46] Could not transmit unit 09 to Collection server; keeping in queue.
CPU2:
Code: Select all
[13:44:20] Project: 3859 (Run 4521, Clone 0, Gen 15)
[13:44:20] + Attempting to send results [March 1 13:44:20 UTC]
[13:44:41] - Couldn't send HTTP request to server
[13:44:41] + Could not connect to Work Server (results)
[13:44:41] (128.59.74.4:8080)
[13:44:41] + Retrying using alternative port
[13:45:02] - Couldn't send HTTP request to server
[13:45:02] + Could not connect to Work Server (results)
[13:45:02] (128.59.74.4:80)
[13:45:02] - Error: Could not transmit unit 01 (completed February 27) to work server.
[13:45:02] + Attempting to send results [March 1 13:45:02 UTC]
[13:45:02] - Couldn't send HTTP request to server
[13:45:02] (Got status 503)
[13:45:02] + Could not connect to Work Server (results)
[13:45:02] (171.65.103.100:8080)
[13:45:02] + Retrying using alternative port
[13:45:02] - Couldn't send HTTP request to server
[13:45:02] (Got status 503)
[13:45:02] + Could not connect to Work Server (results)
[13:45:02] (171.65.103.100:80)
[13:45:02] Could not transmit unit 01 to Collection server; keeping in queue.
Re: Problems with CS 171.67.108.17
Posted: Sun Mar 01, 2009 4:46 pm
by MstrBlstr
dgermann wrote:Ditto for 171.64.122.136, the associated work server. Completed work units to other servers get through OK.
Server .136 is in reject mode, and has been reported >> viewtopic.php?p=85884#p85884 .
dgermann wrote:And on a related note, I've got a Windows machine that's having problems sending its completed units to 128.59.74.4 and 171.65.103.100.
Issues with 74.4 were announced here >> viewtopic.php?p=85096#p85096. Might want to keep an eye out for an update there.
As Bruce stated, the CS's have been under heavy load.
Re: Problems with CS 171.67.108.17
Posted: Mon Mar 02, 2009 3:15 am
by bruce
MstrBlstr wrote:As Bruce stated, the CS's have been under heavy load.
The collection servers are currently:
171.65.103.100 and
171.67.108.17 and
171.67.108.25
Posting about problems with any of them isn't likely to help since they're all saturated and keeping the WU in your local queue is the only option available unless you're very lucky. The Pande Group can do nothing about it until new resources can be brought on-line . . . which is probably going to be when the new server code completes testing and can be rolled out to the various servers.
Because of that, I'm locking this thread.
Do check the status of your Work Server on the ServerStatus page and if it needs help, you'll probably find a thread already discussing it's status.
171.64.65.71 accepting... but
Posted: Sat Feb 06, 2010 4:48 pm
by tobor
Cannot connect to server and Server does not have record of this unit....
Code: Select all
--- Opening Log file [February 6 15:35:59 UTC]
# Windows GPU Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.23
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Documents and Settings\steve\Application Data\Folding@home-gpu
Arguments: -gpu 0
[15:35:59] - Ask before connecting: No
[15:35:59] - User name: stv911 (Team 4)
[15:35:59] - User ID: BE4069840F022A4
[15:35:59] - Machine ID: 2
[15:35:59]
[15:35:59] Loaded queue successfully.
[15:35:59] Initialization complete
[15:35:59]
[15:35:59] + Processing work unit
[15:35:59] Project: 10102 (Run 167, Clone 7, Gen 3)
[15:35:59] - Read packet limit of 540015616... Set to 524286976.
[15:35:59] + Attempting to send results [February 6 15:35:59 UTC]
[15:35:59] Core required: FahCore_11.exe
[15:35:59] Core found.
[15:35:59] Working on queue slot 00 [February 6 15:35:59 UTC]
[15:35:59] + Working ...
[15:35:59]
[15:35:59] *------------------------------*
[15:35:59] Folding@Home GPU Core
[15:35:59] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:35:59]
[15:35:59] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[15:35:59] Build host: amoeba
[15:35:59] Board Type: Nvidia
[15:35:59] Core :
[15:35:59] Preparing to commence simulation
[15:35:59] - Looking at optimizations...
[15:35:59] - Files status OK
[15:35:59] - Expanded 65060 -> 344387 (decompressed 529.3 percent)
[15:35:59] Called DecompressByteArray: compressed_data_size=65060 data_size=344387, decompressed_data_size=344387 diff=0
[15:35:59] - Digital signature verified
[15:35:59]
[15:35:59] Project: 5781 (Run 31, Clone 121, Gen 2)
[15:35:59]
[15:35:59] Assembly optimizations on if available.
[15:35:59] Entering M.D.
[15:36:05] Will resume from checkpoint file
[15:36:05] Tpr hash work/wudata_00.tpr: 4203502414 2089759323 3102399191 97257872 3050345625
[15:36:05]
[15:36:05] Calling fah_main args: 14 usage=100
[15:36:05]
[15:36:05] - Couldn't send HTTP request to server
[15:36:05] + Could not connect to Work Server (results)
[15:36:05] (171.64.65.71:8080)
[15:36:05] + Retrying using alternative port
[15:36:05] Working on Great Red Owns Many ACres of Sand
[15:36:06] Client config found, loading data.
[15:36:06] Starting GUI Server
[15:36:06] Resuming from checkpoint
[15:36:06] fcCheckPointResume: retreived and current tpr file hash:
[15:36:06] 0 4203502414 4203502414
[15:36:06] 1 2089759323 2089759323
[15:36:06] 2 3102399191 3102399191
[15:36:06] 3 97257872 97257872
[15:36:06] 4 3050345625 3050345625
[15:36:06] fcCheckPointResume: file hashes same.
[15:36:06] fcCheckPointResume: state restored.
[15:36:06] Verified work/wudata_00.log
[15:36:06] Verified work/wudata_00.edr
[15:36:06] Verified work/wudata_00.xtc
[15:36:06] Completed 86%
[15:36:17] - Couldn't send HTTP request to server
[15:36:17] + Could not connect to Work Server (results)
[15:36:17] (171.64.65.71:80)
[15:36:17] - Error: Could not transmit unit 03 (completed February 6) to work server.
[15:36:17] - Read packet limit of 540015616... Set to 524286976.
[15:36:17] + Attempting to send results [February 6 15:36:17 UTC]
[15:36:31] - Server does not have record of this unit. Will try again later.
[15:36:31] Could not transmit unit 03 to Collection server; keeping in queue.
[15:37:17] Completed 87%
[15:38:27] Completed 88%
[15:39:37] Completed 89%
[15:40:48] Completed 90%
[15:41:58] Completed 91%
[15:43:09] Completed 92%
[15:44:19] Completed 93%
[15:45:30] Completed 94%
[15:46:40] Completed 95%
[15:47:50] Completed 96%
[15:49:01] Completed 97%
[15:50:11] Completed 98%
[15:51:22] Completed 99%
[15:52:32] Completed 100%
[15:52:32] Successful run
[15:52:32] DynamicWrapper: Finished Work Unit: sleep=10000
[15:52:42] Reserved 147416 bytes for xtc file; Cosm status=0
[15:52:42] Allocated 147416 bytes for xtc file
[15:52:42] - Reading up to 147416 from "work/wudata_00.xtc": Read 147416
[15:52:42] Read 147416 bytes from xtc file; available packet space=786283048
[15:52:42] xtc file hash check passed.
[15:52:42] Reserved 22248 22248 786283048 bytes for arc file=<work/wudata_00.trr> Cosm status=0
[15:52:42] Allocated 22248 bytes for arc file
[15:52:42] - Reading up to 22248 from "work/wudata_00.trr": Read 22248
[15:52:42] Read 22248 bytes from arc file; available packet space=786260800
[15:52:42] trr file hash check passed.
[15:52:42] Allocated 560 bytes for edr file
[15:52:42] Read bedfile
[15:52:42] edr file hash check passed.
[15:52:42] Logfile not read.
[15:52:42] GuardedRun: success in DynamicWrapper
[15:52:42] GuardedRun: done
[15:52:42] Run: GuardedRun completed.
[15:52:44] + Opened results file
[15:52:44] - Writing 170736 bytes of core data to disk...
[15:52:44] Done: 170224 -> 168788 (compressed to 99.1 percent)
[15:52:44] ... Done.
[15:52:44] DeleteFrameFiles: successfully deleted file=work/wudata_00.ckp
[15:52:44] Shutting down core
[15:52:44]
[15:52:44] Folding@home Core Shutdown: FINISHED_UNIT
[15:52:47] CoreStatus = 64 (100)
[15:52:47] Sending work to server
[15:52:47] Project: 5781 (Run 31, Clone 121, Gen 2)
[15:52:47] - Read packet limit of 540015616... Set to 524286976.
[15:52:47] + Attempting to send results [February 6 15:52:47 UTC]
[15:53:06] + Results successfully sent
[15:53:06] Thank you for your contribution to Folding@Home.
[15:53:06] + Number of Units Completed: 18
[15:53:10] Project: 10102 (Run 167, Clone 7, Gen 3)
[15:53:10] - Read packet limit of 540015616... Set to 524286976.
[15:53:10] + Attempting to send results [February 6 15:53:10 UTC]
[15:53:18] - Couldn't send HTTP request to server
[15:53:18] + Could not connect to Work Server (results)
[15:53:18] (171.64.65.71:8080)
[15:53:18] + Retrying using alternative port
[15:53:27] - Couldn't send HTTP request to server
[15:53:27] + Could not connect to Work Server (results)
[15:53:27] (171.64.65.71:80)
[15:53:27] - Error: Could not transmit unit 03 (completed February 6) to work server.
[15:53:27] - Read packet limit of 540015616... Set to 524286976.
[15:53:27] + Attempting to send results [February 6 15:53:27 UTC]
[15:53:45] - Server does not have record of this unit. Will try again later.
[15:53:45] Could not transmit unit 03 to Collection server; keeping in queue.
[15:53:45] - Preparing to get new work unit...
[15:53:45] + Attempting to get work packet
[15:53:45] - Connecting to assignment server
[15:53:45] - Successful: assigned to (171.67.108.21).
[15:53:45] + News From Folding@Home: Welcome to Folding@Home
[15:53:45] Loaded queue successfully.
[15:53:46] Project: 10102 (Run 167, Clone 7, Gen 3)
[15:53:46] - Read packet limit of 540015616... Set to 524286976.
[15:53:46] + Attempting to send results [February 6 15:53:46 UTC]
[15:53:52] - Couldn't send HTTP request to server
[15:53:52] + Could not connect to Work Server (results)
[15:53:52] (171.64.65.71:8080)
[15:53:52] + Retrying using alternative port
[15:53:59] - Couldn't send HTTP request to server
[15:53:59] + Could not connect to Work Server (results)
[15:53:59] (171.64.65.71:80)
[15:53:59] - Error: Could not transmit unit 03 (completed February 6) to work server.
[15:53:59] - Read packet limit of 540015616... Set to 524286976.
[15:53:59] + Attempting to send results [February 6 15:53:59 UTC]
[15:54:06] - Server does not have record of this unit. Will try again later.
[15:54:06] Could not transmit unit 03 to Collection server; keeping in queue.
[15:54:06] + Closed connections
[15:54:06]
[15:54:06] + Processing work unit
[15:54:06] Core required: FahCore_11.exe
[15:54:06] Core found.
[15:54:06] Working on queue slot 01 [February 6 15:54:06 UTC]
[15:54:06] + Working ...
[15:54:06]
[15:54:06] *------------------------------*
[15:54:06] Folding@Home GPU Core
[15:54:06] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:54:06]
[15:54:06] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[15:54:06] Build host: amoeba
[15:54:06] Board Type: Nvidia
[15:54:06] Core :
[15:54:06] Preparing to commence simulation
[15:54:06] - Looking at optimizations...
[15:54:06] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[15:54:06] - Created dyn
[15:54:06] - Files status OK
[15:54:06] - Expanded 65451 -> 344335 (decompressed 526.0 percent)
[15:54:06] Called DecompressByteArray: compressed_data_size=65451 data_size=344335, decompressed_data_size=344335 diff=0
[15:54:06] - Digital signature verified
[15:54:06]
[15:54:06] Project: 5782 (Run 9, Clone 63, Gen 14)
[15:54:06]
[15:54:06] Assembly optimizations on if available.
[15:54:06] Entering M.D.
[15:54:12] Tpr hash work/wudata_01.tpr: 2362307664 1886526845 643368671 1159712277 3155362848
[15:54:12]
[15:54:12] Calling fah_main args: 14 usage=100
[15:54:12]
[15:54:12] Working on Giving Russians Opium May Alter Current Situation
[15:54:13] Client config found, loading data.
[15:54:13] Starting GUI Server
[15:55:24] Completed 1%
[15:56:34] Completed 2%
[15:57:45] Completed 3%
[15:58:56] Completed 4%
[16:00:07] Completed 5%
[16:01:17] Completed 6%
[16:02:28] Completed 7%
[16:03:39] Completed 8%
[16:04:49] Completed 9%
[16:06:00] Completed 10%
[16:07:11] Completed 11%
[16:08:21] Completed 12%
[16:09:32] Completed 13%
[16:10:43] Completed 14%
[16:11:53] Completed 15%
[16:13:04] Completed 16%
[16:14:15] Completed 17%
[16:15:25] Completed 18%
[16:16:36] Completed 19%
[16:17:47] Completed 20%
[16:18:58] Completed 21%
[16:20:08] Completed 22%
[16:21:19] Completed 23%
[16:22:30] Completed 24%
[16:23:40] Completed 25%
[16:24:51] Completed 26%
[16:26:02] Completed 27%
[16:27:12] Completed 28%
[16:28:23] Completed 29%
[16:29:36] Completed 30%
[16:31:14] Completed 31%
[16:32:53] Completed 32%
[16:35:16] Completed 33%
[16:36:37] Completed 34%
[16:37:49] Completed 35%
[16:39:48] Completed 36%
[16:41:46] Completed 37%
Re: 171.64.65.71 accepting... but
Posted: Sat Feb 06, 2010 7:32 pm
by bruce
Server 171.64.65.71 is currently accepting some 2000 uploads per hour. I'm not sure why your client is having trouble connecting -- or whether others are also experiencing a similar problem.
The message "Read packet limit of 540015616... Set to 524286976." bothers me. That often means that the WU is too big to upload. Have you configured your client for Big WUs or is it set to Normal? Has the WU expired?
There is a distinct possibility that because of the size problem, the server may have removed the WU from the list of WUs assigned to you. I'm not sure how that part of the server code works. You may have already received credit (partial or full, I don't know) for that WU but there's no way for me to check.
Re: 171.64.65.71 accepting... but
Posted: Sat Feb 06, 2010 9:46 pm
by tobor
The client was set fr big WUs . I just set it to normal.
I dont know if the WU is expired or not...
thnx
Re: 171.64.65.71 accepting... but
Posted: Mon Feb 08, 2010 12:39 am
by Bobby-Uschi
I also have problems with this server
"Could not send HTTP request to server
[23:53:19] + Could not connect to Work Server (results)
[23:53:19] (171.64.65.71:80)
[23:53:19] - Error: Could not transmit unit 01 completed (February 6) to work server.
[23:53:19] - Read packet limit of 540,015,616th .. Set to 524,286,976th "
And 5 more
Re: 171.64.65.71 accepting... but
Posted: Tue Feb 09, 2010 8:58 am
by lambdapro
This morning, I am having an issue with this server:
[08:33:03] Run: GuardedRun completed.
[08:33:06] + Opened results file
[08:33:06] - Writing 132540 bytes of core data to disk...
[08:33:06] Done: 132028 -> 131537 (compressed to 99.6 percent)
[08:33:06] ... Done.
[08:33:06] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[08:33:06] Shutting down core
[08:33:06]
[08:33:06] Folding@home Core Shutdown: FINISHED_UNIT
[08:33:09] CoreStatus = 64 (100)
[08:33:09] Sending work to server
[08:33:09] Project: 10103 (Run 875, Clone 1, Gen 2)
[08:33:09] + Attempting to send results [February 9 08:33:09 UTC]
[08:33:12] - Couldn't send HTTP request to server
[08:33:12] + Could not connect to Work Server (results)
[08:33:12] (171.64.65.71:8080)
[08:33:12] + Retrying using alternative port
[08:33:14] - Couldn't send HTTP request to server
[08:33:14] + Could not connect to Work Server (results)
[08:33:14] (171.64.65.71:80)
[08:33:14] - Error: Could not transmit unit 02 (completed February 9) to work server.
[08:33:14] Keeping unit 02 in queue.
[08:33:14] Project: 10103 (Run 875, Clone 1, Gen 2)
[08:33:14] + Attempting to send results [February 9 08:33:14 UTC]
[08:33:17] - Couldn't send HTTP request to server
[08:33:17] + Could not connect to Work Server (results)
[08:33:17] (171.64.65.71:8080)
[08:33:17] + Retrying using alternative port
[08:33:20] - Couldn't send HTTP request to server
[08:33:20] + Could not connect to Work Server (results)
[08:33:20] (171.64.65.71:80)
[08:33:20] - Error: Could not transmit unit 02 (completed February 9) to work server.
[08:33:20] + Attempting to send results [February 9 08:33:20 UTC]
I just now started up this machine for folding and earlier in the night I got this when trying to get started:
[03:44:29] Initialization complete
[03:44:29] - Preparing to get new work unit...
[03:44:29] + Attempting to get work packet
[03:44:29] - Connecting to assignment server
[03:44:29] - Successful: assigned to (171.64.65.71).
[03:44:29] + News From Folding@Home: Welcome to Folding@Home
[03:44:29] Loaded queue successfully.
[03:44:50] - Couldn't send HTTP request to server
[03:44:50] + Could not connect to Work Server
[03:44:50] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[03:44:57] + Attempting to get work packet
[03:44:57] - Connecting to assignment server
[03:44:57] - Successful: assigned to (171.64.65.71).
[03:44:57] + News From Folding@Home: Welcome to Folding@Home
[03:44:57] Loaded queue successfully.
[03:45:19] - Couldn't send HTTP request to server
[03:45:19] + Could not connect to Work Server
[03:45:19] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[03:45:33] + Attempting to get work packet
[03:45:33] - Connecting to assignment server
[03:45:33] - Successful: assigned to (171.64.65.71).
[03:45:33] + News From Folding@Home: Welcome to Folding@Home
[03:45:33] Loaded queue successfully.
[03:49:31] + Could not connect to Work Server
[03:49:31] - Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[03:49:54] + Attempting to get work packet
[03:49:54] - Connecting to assignment server
[03:49:54] - Successful: assigned to (171.64.65.71).
[03:49:54] + News From Folding@Home: Welcome to Folding@Home
[03:49:54] Loaded queue successfully.
[03:50:15] - Couldn't send HTTP request to server
[03:50:15] + Could not connect to Work Server
[03:50:15] - Attempt #4 to get work failed, and no other work to do.
Waiting before retry.
[03:51:00] + Attempting to get work packet
[03:51:00] - Connecting to assignment server
[03:51:00] - Successful: assigned to (171.64.65.71).
[03:51:00] + News From Folding@Home: Welcome to Folding@Home
[03:51:00] Loaded queue successfully.
[03:51:21] - Couldn't send HTTP request to server
[03:51:21] + Could not connect to Work Server
[03:51:21] - Attempt #5 to get work failed, and no other work to do.
Waiting before retry.
[03:52:56] + Attempting to get work packet
[03:52:56] - Connecting to assignment server
[03:52:56] - Successful: assigned to (171.67.108.21).
[03:52:56] + News From Folding@Home: Welcome to Folding@Home
[03:52:56] Loaded queue successfully.
[03:52:57] + Closed connections
[03:52:57]
[03:52:57] + Processing work unit
[03:52:57] Core required: FahCore_11.exe
[03:52:57] Core not found.
[03:52:57] - Core is not present or corrupted.
[03:52:57] - Attempting to download new core...
[03:52:57] + Downloading new core: FahCore_11.exe
[03:52:58] + 10240 bytes downloaded
[03:52:58] + 20480 bytes downloaded
...
David
Re: 171.64.65.71 accepting... but
Posted: Tue Feb 09, 2010 10:22 am
by Pette Broad
I'm getting allocated to this server but I'm not getting work. The machines just sit there, no error messages or anything....
O.K, 40 minutes between attempts but still no work.
Code: Select all
[09:46:39] Folding@home Core Shutdown: FINISHED_UNIT
[09:46:42] CoreStatus = 64 (100)
[09:46:42] Sending work to server
[09:46:42] Project: 5769 (Run 2, Clone 66, Gen 1819)
[09:46:42] + Attempting to send results [February 9 09:46:42 UTC]
[09:46:51] + Results successfully sent
[09:46:51] Thank you for your contribution to Folding@Home.
[09:46:51] + Number of Units Completed: 682
[09:46:55] - Preparing to get new work unit...
[09:46:55] + Attempting to get work packet
[09:46:55] - Connecting to assignment server
[09:46:56] - Successful: assigned to (171.64.65.71).
[09:46:56] + News From Folding@Home: Welcome to Folding@Home
[09:46:56] Loaded queue successfully.
[10:28:56] + Could not connect to Work Server
[10:28:56] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[10:29:09] + Attempting to get work packet
[10:29:09] - Connecting to assignment server
[10:29:10] - Successful: assigned to (171.64.65.71).
[10:29:10] + News From Folding@Home: Welcome to Folding@Home
[10:29:10] Loaded queue successfully.
Pete
Re: 171.64.65.71 accepting... but
Posted: Tue Feb 09, 2010 12:27 pm
by Bobby-Uschi
My 4 computers are out of work
Code: Select all
0:44:45] Completed 99%
[10:45:19] Completed 100%
[10:45:19] Successful run
[10:45:19] DynamicWrapper: Finished Work Unit: sleep=10000
[10:45:29] Reserved 75940 bytes for xtc file; Cosm status=0
[10:45:29] Allocated 75940 bytes for xtc file
[10:45:29] - Reading up to 75940 from "work/wudata_08.xtc": Read 75940
[10:45:29] Read 75940 bytes from xtc file; available packet space=786354524
[10:45:29] xtc file hash check passed.
[10:45:29] Reserved 15168 15168 786354524 bytes for arc file=<work/wudata_08.trr> Cosm status=0
[10:45:29] Allocated 15168 bytes for arc file
[10:45:29] - Reading up to 15168 from "work/wudata_08.trr": Read 15168
[10:45:29] Read 15168 bytes from arc file; available packet space=786339356
[10:45:29] trr file hash check passed.
[10:45:29] Allocated 560 bytes for edr file
[10:45:29] Read bedfile
[10:45:29] edr file hash check passed.
[10:45:29] Allocated 33416 bytes for logfile
[10:45:29] Read logfile
[10:45:29] GuardedRun: success in DynamicWrapper
[10:45:29] GuardedRun: done
[10:45:29] Run: GuardedRun completed.
[10:45:30] + Opened results file
[10:45:30] - Writing 125596 bytes of core data to disk...
[10:45:30] Done: 125084 -> 99469 (compressed to 79.5 percent)
[10:45:30] ... Done.
[10:45:30] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[10:45:30] Shutting down core
[10:45:30]
[10:45:30] Folding@home Core Shutdown: FINISHED_UNIT
[10:45:33] CoreStatus = 64 (100)
[10:45:33] Sending work to server
[10:45:33] Project: 5771 (Run 3, Clone 222, Gen 1394)
[10:45:33] - Read packet limit of 540015616... Set to 524286976.
[10:45:33] + Attempting to send results [February 9 10:45:33 UTC]
[10:45:36] + Results successfully sent
[10:45:36] Thank you for your contribution to Folding@Home.
[10:45:36] + Number of Units Completed: 2508
[10:45:40] Project: 10103 (Run 812, Clone 9, Gen 2)
[10:45:40] - Read packet limit of 540015616... Set to 524286976.
[10:45:40] + Attempting to send results [February 9 10:45:40 UTC]
[10:46:01] - Couldn't send HTTP request to server
[10:46:01] + Could not connect to Work Server (results)
[10:46:01] (171.64.65.71:8080)
[10:46:01] + Retrying using alternative port
[10:46:22] - Couldn't send HTTP request to server
[10:46:22] + Could not connect to Work Server (results)
[10:46:22] (171.64.65.71:80)
[10:46:22] - Error: Could not transmit unit 06 (completed February 9) to work server.
[10:46:22] - Read packet limit of 540015616... Set to 524286976.
[10:46:22] + Attempting to send results [February 9 10:46:22 UTC]
[11:34:50] + Could not connect to Work Server (results)
[11:34:50] (171.67.108.26:8080)
[11:34:50] + Retrying using alternative port
[11:34:50] - Couldn't send HTTP request to server
[11:34:50] (Got status 503)
[11:34:50] + Could not connect to Work Server (results)
[11:34:50] (171.67.108.26:80)
[11:34:50] Could not transmit unit 06 to Collection server; keeping in queue.
[11:34:50] - Preparing to get new work unit...
[11:34:50] + Attempting to get work packet
[11:34:50] - Connecting to assignment server
[11:34:51] - Successful: assigned to (171.64.65.71).
[11:34:51] + News From Folding@Home: Welcome to Folding@Home
[11:34:52] Loaded queue successfully.
[12:16:52] + Could not connect to Work Server
[12:16:52] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[12:16:58] + Attempting to get work packet
[12:16:58] - Connecting to assignment server
[12:16:59] - Successful: assigned to (171.67.108.21).
[12:16:59] + News From Folding@Home: Welcome to Folding@Home
[12:16:59] Loaded queue successfully.
[12:17:02] Project: 10103 (Run 812, Clone 9, Gen 2)
[12:17:02] - Read packet limit of 540015616... Set to 524286976.
[12:17:02] + Attempting to send results [February 9 12:17:02 UTC]
[12:17:05] - Couldn't send HTTP request to server
[12:17:05] + Could not connect to Work Server (results)
[12:17:05] (171.64.65.71:8080)
[12:17:05] + Retrying using alternative port
[12:17:12] - Couldn't send HTTP request to server
[12:17:12] + Could not connect to Work Server (results)
[12:17:12] (171.64.65.71:80)
[12:17:12] - Error: Could not transmit unit 06 (completed February 9) to work server.
[12:17:12] - Read packet limit of 540015616... Set to 524286976.
What's up? I send a total of 12 WU in queue
Code: Select all
olding@Home Client Version 6.23
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Dokumente und Einstellungen\Bobby2\Anwendungsdaten\Folding@home-gpu1
Arguments: -gpu 1 -forcegpu nvidia_g80
[10:28:17] - Ask before connecting: No
[10:28:17] - User name: Bobby-Uschi (Team 34361)
[10:28:17] - Usexxxx
[10:28:17] - Machine ID: 3
[10:28:17]
[10:28:17] Loaded queue successfully.
[10:28:17] Initialization complete
[10:28:17] - Preparing to get new work unit...
[10:28:17] + Attempting to get work packet
[10:28:17] Project: 10103 (Run 835, Clone 4, Gen 2)
[10:28:17] - Read packet limit of 540015616... Set to 524286976.
[10:28:17] + Attempting to send results [February 9 10:28:17 UTC]
[10:28:17] - Connecting to assignment server
[10:28:19] - Successful: assigned to (171.64.65.71).
[10:28:19] + News From Folding@Home: Welcome to Folding@Home
[10:28:19] Loaded queue successfully.
[10:47:33] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[10:47:41] + Attempting to get work packet
[10:47:41] - Connecting to assignment server
[10:47:42] - Successful: assigned to (171.64.65.71).
[10:47:42] + News From Folding@Home: Welcome to Folding@Home
[10:47:42] Loaded queue successfully.
[10:47:49] + Could not connect to Work Server (results)
[10:47:49] (171.64.65.71:8080)
[10:47:49] + Retrying using alternative port
[10:47:50] + Closed connections
[10:47:50]
[10:47:50] + Processing work unit
[10:47:50] Core required: FahCore_11.exe
[10:47:50] Core found.
[10:47:50] Working on queue slot 04 [February 9 10:47:50 UTC]
[10:47:50] + Working ...
[10:47:50]
[10:47:50] *------------------------------*
[10:47:50] Folding@Home GPU Core
[10:47:50] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[10:47:50]
[10:47:50] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[10:47:50] Build host: amoeba
[10:47:50] Board Type: Nvidia
[10:47:50] Core :
[10:47:50] Preparing to commence simulation
[10:47:50] - Looking at optimizations...
[10:47:50] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[10:47:50] - Created dyn
[10:47:50] - Files status OK
[10:47:50] - Expanded 88679 -> 447307 (decompressed 504.4 percent)
[10:47:50] Called DecompressByteArray: compressed_data_size=88679 data_size=447307, decompressed_data_size=447307 diff=0
[10:47:50] - Digital signature verified
[10:47:50]
[10:47:50] Project: 10103 (Run 941, Clone 5, Gen 2)
[10:47:50]
[10:47:50] Assembly optimizations on if available.
[10:47:50] Entering M.D.
[10:47:56] Tpr hash work/wudata_04.tpr: 919476651 3123908192 2734029337 3753397868 900102366
[10:47:56]
[10:47:56] Calling fah_main args: 14 usage=100
[10:47:56]
[10:47:57] Working on p10103_lambda_370K
[10:47:59] Client config found, loading data.
[10:47:59] Starting GUI Server
[10:48:00] - Couldn't send HTTP request to server
[10:48:00] + Could not connect to Work Server (results)
[10:48:00] (171.64.65.71:80)
[10:48:00] - Error: Could not transmit unit 02 (completed February 9) to work server.
[10:48:00] - Read packet limit of 540015616... Set to 524286976.
[10:48:00] + Attempting to send results [February 9 10:48:00 UTC]
[10:48:54] Completed 1%
[10:49:49] Completed 2%
[10:50:44] Completed 3%
[10:51:39] Completed 4%
[10:52:34] Completed 5%
[10:53:30] Completed 6%
[10:54:25] Completed 7%
[10:55:20] Completed 8%
[10:56:15] Completed 9%
[10:57:10] Completed 10%
[10:58:05] Completed 11%
[10:59:00] Completed 12%
[10:59:55] Completed 13%
[11:00:50] Completed 14%
[11:01:46] Completed 15%
[11:02:41] Completed 16%
[11:03:36] Completed 17%
[11:04:31] Completed 18%
[11:05:26] Completed 19%
[11:06:21] Completed 20%
[11:07:16] Completed 21%
[11:08:11] Completed 22%
[11:09:07] Completed 23%
[11:10:02] Completed 24%
[11:10:57] Completed 25%
[11:11:52] Completed 26%
[11:12:47] Completed 27%
[11:13:42] Completed 28%
[11:14:37] Completed 29%
[11:15:32] Completed 30%
[11:16:27] Completed 31%
[11:17:23] Completed 32%
[11:18:18] Completed 33%
[11:19:13] Completed 34%
[11:20:08] Completed 35%
[11:21:03] Completed 36%
[11:21:58] Completed 37%
[11:22:53] Completed 38%
[11:23:48] Completed 39%
[11:24:44] Completed 40%
[11:25:39] Completed 41%
[11:26:34] Completed 42%
[11:27:29] Completed 43%
[11:28:24] Completed 44%
[11:29:19] Completed 45%
[11:30:14] Completed 46%
[11:31:09] Completed 47%
[11:32:04] Completed 48%
[11:33:00] Completed 49%
[11:33:55] Completed 50%
[11:34:50] Completed 51%
[11:35:45] Completed 52%
[11:36:27] + Could not connect to Work Server (results)
[11:36:27] (171.67.108.26:8080)
[11:36:27] + Retrying using alternative port
[11:36:27] - Couldn't send HTTP request to server
[11:36:27] (Got status 503)
[11:36:27] + Could not connect to Work Server (results)
[11:36:27] (171.67.108.26:80)
[11:36:27] Could not transmit unit 02 to Collection server; keeping in queue.
[11:36:40] Completed 53%
[11:37:35] Completed 54%
[11:38:30] Completed 55%
[11:39:25] Completed 56%
[11:40:21] Completed 57%
[11:41:16] Completed 58%
[11:42:11] Completed 59%
[11:43:06] Completed 60%
[11:44:01] Completed 61%
[11:44:56] Completed 62%
[11:45:51] Completed 63%
[11:46:46] Completed 64%
[11:47:41] Completed 65%
[11:48:37] Completed 66%
[11:49:32] Completed 67%
[11:50:27] Completed 68%
[11:51:22] Completed 69%
[11:52:17] Completed 70%
[11:53:12] Completed 71%
[11:54:07] Completed 72%
[11:55:02] Completed 73%
[11:55:58] Completed 74%
[11:56:53] Completed 75%
[11:57:48] Completed 76%
[11:58:43] Completed 77%
[11:59:38] Completed 78%
[12:00:33] Completed 79%
[12:01:28] Completed 80%
[12:02:23] Completed 81%
[12:03:18] Completed 82%
[12:04:14] Completed 83%
[12:05:09] Completed 84%
[12:06:04] Completed 85%
[12:06:59] Completed 86%
[12:07:55] Completed 87%
[12:08:50] Completed 88%
[12:09:45] Completed 89%
[12:10:40] Completed 90%
[12:11:35] Completed 91%
[12:12:30] Completed 92%
[12:13:25] Completed 93%
[12:14:20] Completed 94%
[12:15:16] Completed 95%
[12:16:11] Completed 96%
[12:17:06] Completed 97%
[12:18:01] Completed 98%
[12:18:56] Completed 99%
[12:19:51] Completed 100%
[12:19:51] Successful run
[12:19:51] DynamicWrapper: Finished Work Unit: sleep=10000
[12:20:01] Reserved 101248 bytes for xtc file; Cosm status=0
[12:20:01] Allocated 101248 bytes for xtc file
[12:20:01] - Reading up to 101248 from "work/wudata_04.xtc": Read 101248
[12:20:01] Read 101248 bytes from xtc file; available packet space=786329216
[12:20:01] xtc file hash check passed.
[12:20:01] Reserved 30216 30216 786329216 bytes for arc file=<work/wudata_04.trr> Cosm status=0
[12:20:01] Allocated 30216 bytes for arc file
[12:20:01] - Reading up to 30216 from "work/wudata_04.trr": Read 30216
[12:20:01] Read 30216 bytes from arc file; available packet space=786299000
[12:20:01] trr file hash check passed.
[12:20:01] Allocated 560 bytes for edr file
[12:20:01] Read bedfile
[12:20:01] edr file hash check passed.
[12:20:01] Logfile not read.
[12:20:01] GuardedRun: success in DynamicWrapper
[12:20:01] GuardedRun: done
[12:20:01] Run: GuardedRun completed.
[12:20:06] + Opened results file
[12:20:06] - Writing 132536 bytes of core data to disk...
[12:20:06] Done: 132024 -> 131566 (compressed to 99.6 percent)
[12:20:06] ... Done.
[12:20:06] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[12:20:06] Shutting down core
[12:20:06]
[12:20:06] Folding@home Core Shutdown: FINISHED_UNIT
[12:20:08] CoreStatus = 64 (100)
[12:20:08] Sending work to server
[12:20:08] Project: 10103 (Run 941, Clone 5, Gen 2)
[12:20:08] - Read packet limit of 540015616... Set to 524286976.
[12:20:08] + Attempting to send results [February 9 12:20:08 UTC]
[12:53:25] - Unknown packet returned from server, expected ACK for results
[12:53:25] - Error: Could not transmit unit 04 (completed February 9) to work server.
[12:53:25] Keeping unit 04 in queue.
[12:53:25] Project: 10103 (Run 835, Clone 4, Gen 2)
[12:53:25] - Read packet limit of 540015616... Set to 524286976.
[12:53:25] + Attempting to send results [February 9 12:53:25 UTC]
[12:53:33] - Couldn't send HTTP request to server
[12:53:33] + Could not connect to Work Server (results)
[12:53:33] (171.64.65.71:8080)
[12:53:33] + Retrying using alternative port
[12:53:51] - Couldn't send HTTP request to server
[12:53:51] + Could not connect to Work Server (results)
[12:53:51] (171.64.65.71:80)
[12:53:51] - Error: Could not transmit unit 02 (completed February 9) to work server.
[12:53:51] - Read packet limit of 540015616... Set to 524286976.
[12:53:51] + Attempting to send results [February 9 12:53:51 UTC]
[13:23:57] + Could not connect to Work Server (results)
[13:23:57] (171.67.108.26:8080)
[13:23:57] + Retrying using alternative port
[13:23:59] - Couldn't send HTTP request to server
[13:23:59] (Got status 503)
[13:23:59] + Could not connect to Work Server (results)
[13:23:59] (171.67.108.26:80)
[13:23:59] Could not transmit unit 02 to Collection server; keeping in queue.
[13:23:59] Project: 10103 (Run 941, Clone 5, Gen 2)
[13:23:59] - Read packet limit of 540015616... Set to 524286976.
[13:23:59] + Attempting to send results [February 9 13:23:59 UTC]
[13:24:03] - Server has already received unit.
[13:24:03] - Preparing to get new work unit...
[13:24:03] + Attempting to get work packet
[13:24:03] - Connecting to assignment server
[13:24:04] - Successful: assigned to (171.67.108.11).
[13:24:04] + News From Folding@Home: Welcome to Folding@Home
[13:24:04] Loaded queue successfully.
[13:24:06] Project: 10103 (Run 835, Clone 4, Gen 2)
[13:24:06] - Read packet limit of 540015616... Set to 524286976.
[13:24:06] + Attempting to send results [February 9 13:24:06 UTC]
[13:27:48] - Couldn't send HTTP request to server
[13:27:48] + Could not connect to Work Server (results)
[13:27:48] (171.64.65.71:8080)
[13:27:48] + Retrying using alternative port
[13:27:56] - Couldn't send HTTP request to server
[13:27:56] + Could not connect to Work Server (results)
[13:27:56] (171.64.65.71:80)
[13:27:56] - Error: Could not transmit unit 02 (completed February 9) to work server.
[13:27:56] - Read packet limit of 540015616... Set to 524286976.
[13:27:56] + Attempting to send results [February 9 13:27:56 UTC]
Frieden
Re: 171.64.65.71 accepting... but
Posted: Tue Feb 09, 2010 12:48 pm
by jevans64
Same thing here on the 8th and 9th. Problems sending and getting work from 171.64.65.71. I have had to reset some of my clients because they are hanging at this point... They send/grab work right away after being stopped and re-started.
[11:57:12] + Attempting to send results [February 9 11:57:12 UTC]
[11:57:12] - Reading file work/wuresults_09.dat from core
[11:57:12] (Read 132064 bytes from disk)
[11:57:12] Connecting to
http://171.64.65.71:8080/
[12:05:39] Posted data.
They don't spit out the usual
Initial: xxxx line after that. My read packets have been reported as limit 540... set to 524... for as long as I can remember.
Also have a case here where it didn't even attempt to send a queued unit...
[12:10:30] + Attempting to send results [February 9 12:10:30 UTC]
[12:10:30] - Will indicate memory of 2047 MB
[12:10:30] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[12:10:30] - Connecting to assignment server
[12:10:30] Connecting to
http://assign-GPU.stanford.edu:8080/
[12:10:30] - Reading file work/wuresults_04.dat from core
[12:10:30] (Read 132444 bytes from disk)
[12:10:30] Connecting to
http://171.64.65.71:8080/
[12:10:31] Posted data.
[12:10:31] Initial: 43AB; - Successful: assigned to (171.67.108.21).
[12:10:31] + News From Folding@Home: Welcome to Folding@Home
[12:10:31] Loaded queue successfully.
[12:10:31] Connecting to
http://171.67.108.21:8080/
[12:10:31] Posted data.
[12:10:31] Initial: 0000; - Receiving payload (expected size: 65498)
[12:10:32] - Downloaded at ~63 kB/s
[12:10:32] - Averaged speed for that direction ~90 kB/s
[12:10:32] + Received work.
[12:10:32] + Closed connections
And just a Posted Data line here...
[12:17:38] Completed 6%
[12:18:47] Completed 7%
[12:18:57] Posted data.
[12:19:57] Completed 8%
[12:21:06] Completed 9%
And the Initial: xxx line 20 minutes later...
[12:37:18] Completed 23%
[12:38:27] Completed 24%
[12:38:57] Initial: 00FA; Completed 25%
[12:40:46] Completed 26%
[12:41:55] Completed 27%