Project 11294 (R3,C325,G14) [Resolved]

Moderators: Site Moderators, FAHC Science Team

Locked
PerryC
Posts: 3
Joined: Sat Oct 22, 2011 1:17 pm

Project 11294 (R3,C325,G14) [Resolved]

Post by PerryC »

My folding pc has been trying upload this unit around 9-10 hours now. Spent some time digging around trying to find a solution. The only thing I have changed since I noticed the issue was upgrade the client from v7.1.24 to the most recent, unfortunately I forgot to save the log from the old client before the upgrade, but the WU was completed, just can't return it.


Image

Image

Code: Select all

*********************** Log Started 2011-10-22T12:36:36 ************************
12:36:36:************************* Folding@home Client *************************
12:36:36:      Website: http://folding.stanford.edu/
12:36:36:    Copyright: (c) 2009-2011 Stanford University
12:36:36:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
12:36:36:         Args: --lifeline 2604 --command-port=36330
12:36:36:       Config: C:/Documents and Settings/Perry.PERRY-03C7DDE4D/Application
12:36:36:               Data/FAHClient/config.xml
12:36:36:******************************** Build ********************************
12:36:36:      Version: 7.1.38
12:36:36:         Date: Oct 6 2011
12:36:36:         Time: 19:57:04
12:36:36:      SVN Rev: 3080
12:36:36:       Branch: fah/trunk/client
12:36:36:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
12:36:36:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
12:36:36:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
12:36:36:     Platform: win32 XP
12:36:36:         Bits: 32
12:36:36:         Mode: Release
12:36:36:******************************* System ********************************
12:36:36:          CPU: Intel(R) Core(TM)2 CPU 6300 @ 1.86GHz
12:36:36:       CPU ID: GenuineIntel Family 6 Model 15 Stepping 6
12:36:36:         CPUs: 2
12:36:36:       Memory: 3.00GiB
12:36:36:  Free Memory: 2.39GiB
12:36:36:      Threads: WINDOWS_THREADS
12:36:36:   On Battery: false
12:36:36:   UTC offset: -3
12:36:36:          PID: 268
12:36:36:          CWD: C:/Documents and Settings/Perry.PERRY-03C7DDE4D/Application
12:36:36:               Data/FAHClient
12:36:36:           OS: Microsoft Windows XP Service Pack 3
12:36:36:      OS Arch: X86
12:36:36:         GPUs: 1
12:36:36:        GPU 0: ATI:4 Juniper [Radeon HD 5750 Series]
12:36:36:         CUDA: Not detected
12:36:36:Win32 Service: false
12:36:36:***********************************************************************
12:36:36:<config>
12:36:36:  <!-- Folding Slot Configuration -->
12:36:36:  <gpu v='true'/>
12:36:36:
12:36:36:  <!-- Network -->
12:36:36:  <proxy v=':8080'/>
12:36:36:
12:36:36:  <!-- User Information -->
12:36:36:  <passkey v='********************************'/>
12:36:36:  <team v='54196'/>
12:36:36:  <user v='PerryC'/>
12:36:36:
12:36:36:  <!-- Folding Slots -->
12:36:36:  <slot id='0' type='GPU'>
12:36:36:    <client-type v='advanced'/>
12:36:36:  </slot>
12:36:36:</config>
12:36:36:Trying to access database...
12:36:37:Upgrading database schema from version 9 to 10
12:36:37:Successfully acquired database lock
12:36:37:Enabled folding slot 00: READY gpu:0:"Juniper [Radeon HD 5750 Series]"
12:36:37:Downloading project 11293 description
12:36:37:Connecting to fah-web.stanford.edu:80
12:36:37:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:36:37:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:36:37:Starting Unit 02
12:36:37:Connecting to 171.64.65.56:8080
12:36:37:Running core: "C:/Documents and Settings/Perry.PERRY-03C7DDE4D/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/ATI/R600/Core_16.fah/FahCore_16.exe" -dir 02 -suffix 01 -lifeline 268 -version 701 -checkpoint 15 -gpu 0
12:36:37:Started core on PID 3216
12:36:37:FahCore 0x16 started
12:36:38:Downloading project 11294 description
12:36:38:Connecting to fah-web.stanford.edu:80
12:36:38:Unit 02:
12:36:38:Unit 02:*------------------------------*
12:36:38:Unit 02:Folding@Home GPU Core
12:36:38:Unit 02:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
12:36:38:Unit 02:
12:36:38:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
12:36:38:Unit 02:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
12:36:38:Unit 02:Build host: user-f6d030f24f
12:36:38:Unit 02:Board Type: AMD/OpenCL
12:36:38:Unit 02:Core      : x=16
12:36:38:Unit 02: Window's signal control handler registered.
12:36:38:Unit 02:Preparing to commence simulation
12:36:38:Unit 02:- Ensuring status. Please wait.
12:36:47:Unit 02:- Looking at optimizations...
12:36:47:Unit 02:- Working with standard loops on this execution.
12:36:47:Unit 02:- Previous termination of core was improper.
12:36:47:Unit 02:- Files status OK
12:36:47:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
12:36:47:Unit 02:- Expanded 42498 -> 171163 (decompressed 402.7 percent)
12:36:47:Unit 02:Called DecompressByteArray: compressed_data_size=42498 data_size=171163, decompressed_data_size=171163 diff=0
12:36:47:Unit 02:- Digital signature verified
12:36:47:Unit 02:
12:36:47:Unit 02:Project: 11293 (Run 27, Clone 161, Gen 0)
12:36:47:Unit 02:
12:36:47:Unit 02:Entering M.D.
12:36:49:Unit 02:Will resume from checkpoint file 02/wudata_01.ckp
12:36:49:Unit 02:Tpr hash 02/wudata_01.tpr:  2928458515 2167640363 33375329 4042530662 3748155633
12:36:49:Unit 02:Working on ALZHEIMER DISEASE AMYLOID
12:36:49:Unit 02:Client config unavailable.
12:36:49:Unit 02:Starting GUI Server
12:36:55:Unit 02:Resuming from checkpoint
12:36:55:Unit 02:fcCheckPointResume: retreived and current tpr file hash:
12:36:55:Unit 02:   0   2928458515   2928458515
12:36:55:Unit 02:   1   2167640363   2167640363
12:36:55:Unit 02:   2     33375329     33375329
12:36:55:Unit 02:   3   4042530662   4042530662
12:36:55:Unit 02:   4   3748155633   3748155633
12:36:55:Unit 02:fcCheckPointResume: file hashes same.
12:36:55:Unit 02:fcCheckPointResume: state restored.
12:36:55:Unit 02:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
12:36:55:Unit 02:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
12:36:55:Unit 02:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
12:36:55:Unit 02:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
12:36:55:Unit 02:fcCheckPointResume: state restored 2
12:36:55:Unit 02:Resumed from checkpoint
12:36:55:Unit 02:Setting checkpoint frequency: 500000
12:36:55:Unit 02:Completed  43000001 out of 50000000 steps (86%).
12:40:11:WARNING: Exception: Failed to send results to work server: Upload failed
12:40:11:Trying to send results to collection server
12:40:11:Unit 00: Uploading 2.38MiB to 171.67.108.26
12:40:11:Connecting to 171.67.108.26:8080
12:40:12:WARNING: WorkServer connection failed on port 8080 trying 80
12:40:12:Connecting to 171.67.108.26:80
12:40:14:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
12:40:14:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:40:14:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:40:14:Connecting to 171.64.65.56:8080
12:41:01:Unit 02:Completed  43500000 out of 50000000 steps (87%).
12:43:48:WARNING: Exception: Failed to send results to work server: Upload failed
12:43:48:Trying to send results to collection server
12:43:48:Unit 00: Uploading 2.38MiB to 171.67.108.26
12:43:48:Connecting to 171.67.108.26:8080
12:43:49:WARNING: WorkServer connection failed on port 8080 trying 80
12:43:49:Connecting to 171.67.108.26:80
12:43:50:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
12:43:50:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:43:50:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:43:50:Connecting to 171.64.65.56:8080
12:45:30:Unit 02:Completed  44000000 out of 50000000 steps (88%).
12:47:24:WARNING: Exception: Failed to send results to work server: Upload failed
12:47:24:Trying to send results to collection server
12:47:24:Unit 00: Uploading 2.38MiB to 171.67.108.26
12:47:24:Connecting to 171.67.108.26:8080
12:47:25:WARNING: WorkServer connection failed on port 8080 trying 80
12:47:25:Connecting to 171.67.108.26:80
12:47:26:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
12:47:26:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:47:26:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:47:26:Connecting to 171.64.65.56:8080
12:49:59:Unit 02:Completed  44500000 out of 50000000 steps (89%).
12:51:24:WARNING: Exception: Failed to send results to work server: Upload failed
12:51:24:Trying to send results to collection server
12:51:24:Unit 00: Uploading 2.38MiB to 171.67.108.26
12:51:24:Connecting to 171.67.108.26:8080
12:51:25:WARNING: WorkServer connection failed on port 8080 trying 80
12:51:25:Connecting to 171.67.108.26:80
12:51:27:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
12:51:27:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:51:27:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:51:27:Connecting to 171.64.65.56:8080
12:51:48:WARNING: WorkServer connection failed on port 8080 trying 80
12:51:48:Connecting to 171.64.65.56:80
12:52:09:WARNING: Exception: Failed to send results to work server: Failed to connect to 171.64.65.56:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
12:52:09:Trying to send results to collection server
12:52:09:Unit 00: Uploading 2.38MiB to 171.67.108.26
12:52:09:Connecting to 171.67.108.26:8080
12:52:10:WARNING: WorkServer connection failed on port 8080 trying 80
12:52:10:Connecting to 171.67.108.26:80
12:52:11:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
12:54:32:Unit 02:Completed  45000000 out of 50000000 steps (90%).
12:55:41:Sending unit results: id:00 state:SEND error:OK project:11294 run:3 clone:325 gen:14 core:0x16 unit:0x000000160a3b1e5c4d9a1c7df2a97f80
12:55:41:Unit 00: Uploading 2.38MiB to 171.64.65.56
12:55:41:Connecting to 171.64.65.56:8080
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project 11294 (R3,C325,G14)

Post by sortofageek »

Hello, Perry. Welcome to Folding Forum. :)

Looking at the Server Stats page, the work server for that WU (161.64.65.56) has a high CPU load right now, which could explain your problem uploading. As long as you are continuing to get work, just let it keep trying. The client will do that periodically for you and the WU will go home when the server can accept it.

Server Stats page is here: http://fah-web.stanford.edu/serverstat.html
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project 11294 (R3,C325,G14)

Post by sortofageek »

12:36:47:Unit 02:- Looking at optimizations...
12:36:47:Unit 02:- Working with standard loops on this execution.
12:36:47:Unit 02:- Previous termination of core was improper.
12:36:47:Unit 02:- Files status OK
It looks like you had a rough shutdown and are now folding without optimizations. I think I would do a proper shutdown/restart to see if all is well with optimizations again after a smooth start.
PerryC
Posts: 3
Joined: Sat Oct 22, 2011 1:17 pm

Re: Project 11294 (R3,C325,G14)

Post by PerryC »

I saw that in the log, but it didn't appear until after I upgraded the client so that might have been shutting the client down improperly for that.

The other WU in the screen shots uploaded no problem, and I am now about 25% through the next. I'll let it go till the expiry date and see if it uploads between now and then.
Last edited by PerryC on Sat Oct 22, 2011 3:38 pm, edited 1 time in total.
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project 11294 (R3,C325,G14)

Post by sortofageek »

In regard to the server, you may also want to follow this topic ---> viewtopic.php?f=18&t=19874&p=197713#p197713
PerryC
Posts: 3
Joined: Sat Oct 22, 2011 1:17 pm

Re: Project 11294 (R3,C325,G14)

Post by PerryC »

Does it matter if the collection server for that WU shows as "standby" and "not accepting"?
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project 11294 (R3,C325,G14)

Post by sortofageek »

PerryC wrote:Does it matter if the collection server for that WU shows as "standby" and "not accepting"?

Please see my post about the collection server in that other topic I mentioned: viewtopic.php?f=18&t=19874&p=197716#p197716

To keep the discussion on this issue together, let's close this topic now and follow up on the overloaded work server in that other topic ---> viewtopic.php?f=18&t=19874&start=0
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project 11294 (R3,C325,G14) [Resolved]

Post by sortofageek »

The server issue has been resolved. See Professor Pande's update here ---> http://foldingforum.org/viewtopic.php?f ... 83#p197783
Locked