171.67.108.33 & 171.67.108.26 (cant submit completed work)

Moderators: Site Moderators, FAHC Science Team

Post Reply
nogginthenog
Posts: 29
Joined: Mon Nov 28, 2011 3:42 pm

171.67.108.33 & 171.67.108.26 (cant submit completed work)

Post by nogginthenog »

3 machines runninf 6.23 systray client on XP, plain Gromacs core, have had trouble for a couple of days submitting finished work. Here's a sample verbose log.

Code: Select all


--- Opening Log file [March 16 12:52:50 UTC] 


# Windows CPU Systray Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files\Folding@home\Folding@home-x86
Arguments: -verbosity 9 

[12:52:50] - Ask before connecting: No
[12:52:50] - User name: **********************
[12:52:50] - User ID: *******************
[12:52:50] - Machine ID: 1
[12:52:50] 
[12:52:51] Loaded queue successfully.
[12:52:51] Initialization complete
[12:52:51] 
[12:52:51] + Processing work unit
[12:52:51] Core required: FahCore_78.exe
[12:52:51] Core found.
[12:52:51] - Autosending finished units... [March 16 12:52:51 UTC]
[12:52:51] Trying to send all finished work units
[12:52:51] Project: 6881 (Run 604, Clone 13, Gen 362)


[12:52:51] + Attempting to send results [March 16 12:52:51 UTC]
[12:52:51] - Reading file work/wuresults_02.dat from core
[12:52:51]   (Read 560849 bytes from disk)
[12:52:51] Connecting to http://171.67.108.33:8080/
[12:52:51] Working on queue slot 03 [March 16 12:52:51 UTC]
[12:52:51] + Working ...
[12:52:51] - Calling '.\FahCore_78.exe -dir work/ -suffix 03 -checkpoint 9 -verbose -lifeline 5268 -version 623'

[12:52:51] 
[12:52:51] *------------------------------*
[12:52:51] Folding@Home Gromacs Core
[12:52:51] Version 1.90 (March 8, 2006)
[12:52:51] 
[12:52:51] Preparing to commence simulation
[12:52:51] - Looking at optimizations...
[12:52:51] - Files status OK
[12:52:52] - Expanded 375221 -> 1805564 (decompressed 481.2 percent)
[12:52:52] 
[12:52:52] Project: 6888 (Run 792, Clone 4, Gen 91)
[12:52:52] 
[12:52:52] Assembly optimizations on if available.
[12:52:52] Entering M.D.
[12:52:52] - Couldn't send HTTP request to server
[12:52:52] + Could not connect to Work Server (results)
[12:52:52]     (171.67.108.33:8080)
[12:52:52] + Retrying using alternative port
[12:52:52] Connecting to http://171.67.108.33:80/
[12:52:54] - Couldn't send HTTP request to server
[12:52:54] + Could not connect to Work Server (results)
[12:52:54]     (171.67.108.33:80)
[12:52:54] - Error: Could not transmit unit 02 (completed March 14) to work server.
[12:52:54] - 13 failed uploads of this unit.


[12:52:54] + Attempting to send results [March 16 12:52:54 UTC]
[12:52:54] - Reading file work/wuresults_02.dat from core
[12:52:54]   (Read 560849 bytes from disk)
[12:52:54] Connecting to http://171.67.108.26:8080/
[12:53:07] - Couldn't send HTTP request to server
[12:53:07] + Could not connect to Work Server (results)
[12:53:07]     (171.67.108.26:8080)
[12:53:07] + Retrying using alternative port
[12:53:07] Connecting to http://171.67.108.26:80/
[12:53:08] - Couldn't send HTTP request to server
[12:53:08] + Could not connect to Work Server (results)
[12:53:08]     (171.67.108.26:80)
[12:53:08]   Could not transmit unit 02 to Collection server; keeping in queue.
[12:53:08] + Sent 0 of 1 completed units to the server
[12:53:08] - Autosend completed
[12:53:13] (Starting from checkpoint)
[12:53:13] Protein: ALZHEIMER DISEASE AMYLOID
[12:53:13] 
[12:53:13] Writing local files
[12:53:57] Completed 217500 out of 250000 steps  (87%)
[12:53:58] Extra SSE boost OK.
[13:02:59] Timered checkpoint triggered.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by bruce »

I hope you started with the sticky topic in this forum: Subject: Troubleshooting Server Connectivity Issues (Do This First)

One of the first things you should have checked is the serverstat page. Based on my reading of the history of that page, 171.67.108.33 has been having troubles since Tue Mar 13 13:00:10 PST 2012. I suspect your report is the first one to arrive here on the forum (at least it's the first one I've seen). I'll notify the owner of that server. Hopefully they just haven't noticed it's down rather than it being a major problem that they have been working on all this time.
nogginthenog
Posts: 29
Joined: Mon Nov 28, 2011 3:42 pm

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by nogginthenog »

I did indeed read the DoItFirst page, hence my verbose log snippet. That serverstats page has way too much info for me!
On one PC, last successful submission was [March 11 20:40:30 UTC], then it failed at March 14 10:18:27 UTC. I let it restart a few times in case it was a temporary problem, but thought I'd report it when it started saying it had failed 13 times !
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by bruce »

viewtopic.php?f=24&t=21045

171.67.108.33 is also known as vsp05c

171.67.108.26 is the collection server for GPU projects and it's one of those collection servers that still needs to have its software upgraded.
ejsanyo
Posts: 20
Joined: Sat Mar 17, 2012 11:03 am

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by ejsanyo »

Same problem with those servers.

Code: Select all

--- Opening Log file [March 17 19:18:30 UTC] 

# Windows CPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: R:\Folding
Executable: R:\Folding\Folding@home-Win32-x86.exe


[19:18:30] - Ask before connecting: No
[19:18:30] - User name: EJSanYo (Team 47191)
[19:18:30] - User ID: 7B0ABA767BE80576
[19:18:30] - Machine ID: 1
[19:18:30] 
[19:18:30] Loaded queue successfully.
[19:18:30] 
[19:18:30] + Processing work unit
[19:18:30] Core required: FahCore_78.exe
[19:18:30] Core found.
[19:18:30] Project: 6883 (Run 360, Clone 3, Gen 361)
[19:18:30] - Read packet limit of 540015616... Set to 524286976.


[19:18:30] + Attempting to send results [March 17 19:18:30 UTC]
[19:18:30] Working on queue slot 02 [March 17 19:18:30 UTC]
[19:18:30] + Working ...
[19:18:30] 
[19:18:30] *------------------------------*
[19:18:30] Folding@Home Gromacs Core
[19:18:30] Version 1.90 (March 8, 2006)
[19:18:30] 
[19:18:30] Preparing to commence simulation
[19:18:30] - Looking at optimizations...
[19:18:30] - Files status OK
[19:18:31] - Expanded 668756 -> 3308516 (decompressed 494.7 percent)
[19:18:32] 
[19:18:32] Project: 6894 (Run 943, Clone 2, Gen 28)
[19:18:32] 
[19:18:32] Assembly optimizations on if available.
[19:18:32] Entering M.D.
[19:18:32] - Couldn't send HTTP request to server
[19:18:32] + Could not connect to Work Server (results)
[19:18:32]     (171.67.108.33:8080)
[19:18:32] + Retrying using alternative port
[19:18:34] - Couldn't send HTTP request to server
[19:18:34] + Could not connect to Work Server (results)
[19:18:34]     (171.67.108.33:80)
[19:18:34] - Error: Could not transmit unit 01 (completed March 15) to work server.
[19:18:34] - Read packet limit of 540015616... Set to 524286976.


[19:18:34] + Attempting to send results [March 17 19:18:34 UTC]
[19:18:35] - Couldn't send HTTP request to server
[19:18:35] + Could not connect to Work Server (results)
[19:18:35]     (171.67.108.26:8080)
[19:18:35] + Retrying using alternative port
[19:18:37] - Couldn't send HTTP request to server
[19:18:37] + Could not connect to Work Server (results)
[19:18:37]     (171.67.108.26:80)
[19:18:37]   Could not transmit unit 01 to Collection server; keeping in queue.
[19:18:52] (Starting from checkpoint)
[19:18:52] Protein: ALZHEIMER DISEASE AMYLOID
[19:18:52] 
[19:18:52] Writing local files
[19:22:48] Completed 212500 out of 250000 steps  (85%)
[19:22:49] Extra SSE boost OK.
The best material model of a cat is another, or preferably the same, cat. ©Norbert Wiener
nogginthenog
Posts: 29
Joined: Mon Nov 28, 2011 3:42 pm

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by nogginthenog »

bruce wrote:http://foldingforum.org/viewtopic.php?f=24&t=21045

171.67.108.33 is also known as vsp05c

171.67.108.26 is the collection server for GPU projects and it's one of those collection servers that still needs to have its software upgraded.

Thats odd, the PCs that are having this problem are AFAIK just running on the CPU. 2 x Dell C610 1 GHz laptops, 1 Sempron 2200 tower.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by bruce »

Sorry: typing error.

Some of the collection severs that still need to have their software upgraded are running CPU based projects (I think just uniprocessor projects) and others are running GPU based projects.

I'm seeing a similar problem with a couple of of my uniprocessor machines. I'm running the latest V7 beta client so there's probably nothing you can change to bypass the problem until the hardware is repaired. Maybe it's a coincidence, but I only see the problem on the machines with WinXP..
rhysmacs
Posts: 1
Joined: Sat Mar 24, 2012 6:59 pm

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by rhysmacs »

I have the same issue with the same servers. Both servers that the client try to send the finished work unit to are not accepting or rejecting according to the server stats page.
I am running v5.04b on PC-BSD 8.2 on a Dell Inspiron 1000 with Mobil Intel Celeron CPU 2.20 GHz Laptop.
I am hoping someone fixes the servers so that they may accept the finished WU.
I am assuming this would resolve the issue.
However new WU are being worked in the meantime. :-)

[19:12:40] - Couldn't send HTTP request to server
[19:12:40] + Could not connect to Work Server (results)
[19:12:40] (171.67.108.33:8080)
[19:12:40] - Error: Could not transmit unit 02 (completed March 14) to work server.


[19:12:40] + Attempting to send results
[19:12:40] - Couldn't send HTTP request to server
[19:12:40] + Could not connect to Work Server (results)
[19:12:40] (171.67.108.26:8080)
[19:12:40] Could not transmit unit 02 to Collection server; keeping in queue.
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 171.67.108.33 & 171.67.108.26 (cant submit completed wor

Post by bruce »

Welcome to foldingforum.org, rhysmacs.

Please note that the top topic in this forum is "DO THIS FIRST" and it would have narrowed down the answers for you.

Server 171.67.108.33 shows REJECT on the serverstat page. Of course you can't upload to it when it's not working. Server 171.67.108.26 is a Collection Server, and that's also mentioned in the DO THIS FIRST topic as well as my previous post.

There is an official announcement about 171.67.108.33 viewtopic.php?f=24&t=21045
Post Reply