FAH stopped folding

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
Konan
Posts: 12
Joined: Thu Mar 05, 2015 5:56 pm

FAH stopped folding

Post by Konan »

I came into work after this weekend and my client seems to have stopped folding. I have rebooted the machine and tried pausing it an restarting and it seems to be stuck. I have included my log file from after I restarted the machine. Looking for any suggestions as to what may have happened and what I need to do to get it working again.

Thanks
Joel

Code: Select all

*********************** Log Started 2015-03-23T12:08:13Z ***********************
12:08:13:************************* Folding@home Client *************************
12:08:13:      Website: http://folding.stanford.edu/
12:08:13:    Copyright: (c) 2009-2014 Stanford University
12:08:13:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
12:08:13:         Args: 
12:08:13:       Config: C:/Users/jkohn/AppData/Roaming/FAHClient/config.xml
12:08:13:******************************** Build ********************************
12:08:13:      Version: 7.4.4
12:08:13:         Date: Mar 4 2014
12:08:13:         Time: 20:26:54
12:08:13:      SVN Rev: 4130
12:08:13:       Branch: fah/trunk/client
12:08:13:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
12:08:13:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
12:08:13:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
12:08:13:     Platform: win32 XP
12:08:13:         Bits: 32
12:08:13:         Mode: Release
12:08:13:******************************* System ********************************
12:08:13:          CPU: Intel(R) Core(TM) i5-4570S CPU @ 2.90GHz
12:08:13:       CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
12:08:13:         CPUs: 4
12:08:13:       Memory: 7.95GiB
12:08:13:  Free Memory: 6.86GiB
12:08:13:      Threads: WINDOWS_THREADS
12:08:13:   OS Version: 6.1
12:08:13:  Has Battery: false
12:08:13:   On Battery: false
12:08:13:   UTC Offset: -5
12:08:13:          PID: 3776
12:08:13:          CWD: C:/Users/jkohn/AppData/Roaming/FAHClient
12:08:13:           OS: Windows 7 Professional
12:08:13:      OS Arch: AMD64
12:08:13:         GPUs: 1
12:08:13:        GPU 0: NVIDIA:2 GF119 [GeForce GT 610]
12:08:13:         CUDA: 2.1
12:08:13:  CUDA Driver: 6050
12:08:13:Win32 Service: false
12:08:13:***********************************************************************
12:08:13:<config>
12:08:13:  <!-- Network -->
12:08:13:  <proxy v=':8080'/>
12:08:13:
12:08:13:  <!-- Slot Control -->
12:08:13:  <power v='full'/>
12:08:13:
12:08:13:  <!-- User Information -->
12:08:13:  <passkey v='********************************'/>
12:08:13:  <team v='227768'/>
12:08:13:  <user v='CrazyCruncher'/>
12:08:13:
12:08:13:  <!-- Folding Slots -->
12:08:13:  <slot id='0' type='CPU'/>
12:08:13:  <slot id='1' type='GPU'/>
12:08:13:</config>
12:08:13:Trying to access database...
12:08:13:Successfully acquired database lock
12:08:13:Enabled folding slot 00: READY cpu:3
12:08:13:Enabled folding slot 01: READY gpu:0:GF119 [GeForce GT 610]
12:08:13:WU00:FS01:Starting
12:08:13:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/jkohn/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 3776 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
12:08:13:WU00:FS01:Started FahCore on PID 3576
12:08:14:WU00:FS01:Core PID:1496
12:08:14:WU00:FS01:FahCore 0x18 started
12:08:14:WU01:FS00:Connecting to 171.67.108.200:8080
12:08:14:WU00:FS01:0x18:*********************** Log Started 2015-03-23T12:08:14Z ***********************
12:08:14:WU00:FS01:0x18:Project: 10477 (Run 1, Clone 30, Gen 13)
12:08:14:WU00:FS01:0x18:Unit: 0x00000019538b3dba548b264208a79362
12:08:14:WU00:FS01:0x18:CPU: 0x00000000000000000000000000000000
12:08:14:WU00:FS01:0x18:Machine: 1
12:08:14:WU00:FS01:0x18:Digital signatures verified
12:08:14:WU00:FS01:0x18:Folding@home GPU core18
12:08:14:WU00:FS01:0x18:Version 0.0.3
12:08:15:WU00:FS01:0x18:  Found a checkpoint file
12:08:15:WU01:FS00:Assigned to work server 171.64.65.124
12:08:15:WU01:FS00:Requesting new work unit for slot 00: READY cpu:3 from 171.64.65.124
12:08:15:WU01:FS00:Connecting to 171.64.65.124:8080
12:08:17:WU01:FS00:Downloading 902.13KiB
12:08:37:WU00:FS01:0x18:Completed 1500000 out of 5000000 steps (30%)
12:08:37:WU00:FS01:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
12:11:11:FS01:Paused
12:11:11:FS01:Shutting core down
12:11:11:WU00:FS01:0x18:WARNING:Console control signal 1 on PID 1496
12:11:11:WU00:FS01:0x18:Exiting, please wait. . .
12:11:13:WU00:FS01:0x18:Lost lifeline PID 3576, exiting
12:11:13:WU00:FS01:0x18:ERROR:103: Lost client lifeline
12:11:13:WU00:FS01:0x18:Folding@home Core Shutdown: CLIENT_DIED
12:11:13:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
12:11:16:Removing old file 'configs/config-20150317-120812.xml'
12:11:16:Saving configuration to config.xml
12:11:16:<config>
12:11:16:  <!-- Network -->
12:11:16:  <proxy v=':8080'/>
12:11:16:
12:11:16:  <!-- Slot Control -->
12:11:16:  <power v='full'/>
12:11:16:
12:11:16:  <!-- User Information -->
12:11:16:  <passkey v='********************************'/>
12:11:16:  <team v='227768'/>
12:11:16:  <user v='CrazyCruncher'/>
12:11:16:
12:11:16:  <!-- Folding Slots -->
12:11:16:  <slot id='0' type='CPU'/>
12:11:16:  <slot id='1' type='GPU'>
12:11:16:    <paused v='true'/>
12:11:16:  </slot>
12:11:16:</config>
12:21:25:FS01:Unpaused
12:21:25:WU00:FS01:Starting
12:21:25:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/jkohn/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 3776 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
12:21:25:WU00:FS01:Started FahCore on PID 6004
12:21:25:WU00:FS01:Core PID:5760
12:21:25:WU00:FS01:FahCore 0x18 started
12:21:25:WU00:FS01:0x18:*********************** Log Started 2015-03-23T12:21:25Z ***********************
12:21:25:WU00:FS01:0x18:Project: 10477 (Run 1, Clone 30, Gen 13)
12:21:25:WU00:FS01:0x18:Unit: 0x00000019538b3dba548b264208a79362
12:21:25:WU00:FS01:0x18:CPU: 0x00000000000000000000000000000000
12:21:25:WU00:FS01:0x18:Machine: 1
12:21:25:WU00:FS01:0x18:Digital signatures verified
12:21:25:WU00:FS01:0x18:Folding@home GPU core18
12:21:25:WU00:FS01:0x18:Version 0.0.3
12:21:25:WU00:FS01:0x18:  Found a checkpoint file
12:21:26:Removing old file 'configs/config-20150317-170100.xml'
12:21:26:Saving configuration to config.xml
12:21:26:<config>
12:21:26:  <!-- Network -->
12:21:26:  <proxy v=':8080'/>
12:21:26:
12:21:26:  <!-- Slot Control -->
12:21:26:  <power v='full'/>
12:21:26:
12:21:26:  <!-- User Information -->
12:21:26:  <passkey v='********************************'/>
12:21:26:  <team v='227768'/>
12:21:26:  <user v='CrazyCruncher'/>
12:21:26:
12:21:26:  <!-- Folding Slots -->
12:21:26:  <slot id='0' type='CPU'/>
12:21:26:  <slot id='1' type='GPU'/>
12:21:26:</config>
12:21:42:FS01:Paused
12:21:42:FS01:Shutting core down
12:21:42:WU00:FS01:0x18:WARNING:Console control signal 1 on PID 5760
12:21:42:WU00:FS01:0x18:Exiting, please wait. . .
12:21:48:WU00:FS01:0x18:Completed 1500000 out of 5000000 steps (30%)
12:21:48:WU00:FS01:0x18:Lost lifeline PID 6004, exiting
12:21:48:WU00:FS01:0x18:ERROR:103: Lost client lifeline
12:21:48:WU00:FS01:0x18:Folding@home Core Shutdown: CLIENT_DIED
12:21:48:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
12:22:27:Removing old file 'configs/config-20150317-190704.xml'
12:22:27:Saving configuration to config.xml
12:22:27:<config>
12:22:27:  <!-- Network -->
12:22:27:  <proxy v=':8080'/>
12:22:27:
12:22:27:  <!-- Slot Control -->
12:22:27:  <power v='full'/>
12:22:27:
12:22:27:  <!-- User Information -->
12:22:27:  <passkey v='********************************'/>
12:22:27:  <team v='227768'/>
12:22:27:  <user v='CrazyCruncher'/>
12:22:27:
12:22:27:  <!-- Folding Slots -->
12:22:27:  <slot id='0' type='CPU'/>
12:22:27:  <slot id='1' type='GPU'>
12:22:27:    <paused v='true'/>
12:22:27:  </slot>
12:22:27:</config>
15:24:00:FS00:Paused
15:24:03:FS00:Unpaused
Konan
Posts: 12
Joined: Thu Mar 05, 2015 5:56 pm

Re: FAH stopped folding

Post by Konan »

I should probably give a little more information. The issue I am having is for the cpu. I have the gpu paused since I cannot work on the machine if it is crunching. The client tells me the cpu is ready in the folding slot window and it tells me in the work queue window "Download".
Joe_H
Site Admin
Posts: 8002
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Studio M1 Max 32 GB smp6
Mac Hack i7-7700K 48 GB smp4
Location: W. MA

Re: FAH stopped folding

Post by Joe_H »

The download is shown as started on getting a WU for your CPU slot, but there is no further information in the 15 or so minutes of log after that.

There is a known issue with the network code in the folding client, it can sometimes fail to retry a download or upload that fails for some reason. If this happens the client will just sit there and never retry the connection. It is more commonly seen when there are network problems. When this happens the only way to get the download or upload to resume is restarting the FAHClient process whether by rebooting the system or manually stopping and restarting that process.

So restarting FAHClient would be my first suggestion. If that takes care of it, then you should be good until whenever next this problem occurs. Things that can interfere with your connection with the WS involved include interference on a wireless connection or heavy traffic on your network or your local connection with your ISP due to streaming video or torrent sharing.
Image
Konan
Posts: 12
Joined: Thu Mar 05, 2015 5:56 pm

Re: FAH stopped folding

Post by Konan »

Restarted the machine and it is working now. I restarted it before and it still sat there. Must have had a networking issue when I did that.

Thanks for your help
Joel
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: FAH stopped folding

Post by bruce »

Konan wrote:I have the gpu paused since I cannot work on the machine if it is crunching.
Though this isn't the issue you asked about, I would consider configuring your GPU slot for idle=true. With that setting, the GPU should pause whenever your mouse or keyboard is active but it should fold overnight (even if nothing much is accomplished during the day). I would also set max-packet-size=small so that the servers will assign you WUs that have relatively long deadlines, which would be appropriate for your part-time GT 610 GPU.
Post Reply