Page 1 of 1

CPU fails to get work assignment, but GPU okay

Posted: Thu Feb 27, 2014 12:28 am
by tj3f3rsn
Hi all.
I've been running fah for years. I started running on my work machine a few weeks ago. For a week or two both the CPU and GPU work units ran as expected.
About a week ago (I think) the CPU began failing to get work assignments, although the GPU continues to run great.
When I check the assignment servers in my browser the "assign" server fails to load but the "gpu" servers gets OK (as expected from my symptoms).
I suspect it is a firewall/antivirus issue, but the confusing part for me is that the cpu WAS getting assignments and the gpu still IS getting assignments.

Any ideas?

Code: Select all

*********************** Log Started 2014-02-26T23:49:46Z ***********************
23:49:46:************************* Folding@home Client *************************
23:49:46:      Website: http://folding.stanford.edu/
23:49:46:    Copyright: (c) 2009-2013 Stanford University
23:49:46:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
23:49:46:         Args: 
23:49:46:       Config: C:/temp/FAH/FAHClient/config.xml
23:49:46:******************************** Build ********************************
23:49:46:      Version: 7.3.6
23:49:46:         Date: Feb 18 2013
23:49:46:         Time: 15:25:17
23:49:46:      SVN Rev: 3923
23:49:46:       Branch: fah/trunk/client
23:49:46:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
23:49:46:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
23:49:46:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
23:49:46:     Platform: win32 XP
23:49:46:         Bits: 32
23:49:46:         Mode: Release
23:49:46:******************************* System ********************************
23:49:46:          CPU: Intel(R) Core(TM) i7-3740QM CPU @ 2.70GHz
23:49:46:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
23:49:46:         CPUs: 4
23:49:46:       Memory: 15.88GiB
23:49:46:  Free Memory: 11.51GiB
23:49:46:      Threads: WINDOWS_THREADS
23:49:46:  Has Battery: true
23:49:46:   On Battery: false
23:49:46:   UTC offset: -8
23:49:46:          PID: 4196
23:49:46:          CWD: C:/temp/FAH/FAHClient
23:49:46:           OS: Windows 7 Enterprise
23:49:46:      OS Arch: AMD64
23:49:46:         GPUs: 1
23:49:46:        GPU 0: NVIDIA:3 GK107 [Quadro 2100M]
23:49:46:         CUDA: 3.0
23:49:46:  CUDA Driver: 5050
23:49:46:Win32 Service: false
23:49:46:***********************************************************************
23:49:46:<config>
23:49:46:  <!-- Folding Core -->
23:49:46:  <checkpoint v='10'/>
23:49:46:
23:49:46:  <!-- Folding Slot Configuration -->
23:49:46:  <power v='full'/>
23:49:46:
23:49:46:  <!-- Network -->
23:49:46:  <proxy v=':8080'/>
23:49:46:
23:49:46:  <!-- User Information -->
23:49:46:  <team v='182116'/>
23:49:46:  <user v='tj3f3rsn'/>
23:49:46:
23:49:46:  <!-- Folding Slots -->
23:49:46:  <slot id='0' type='GPU'/>
23:49:46:  <slot id='1' type='CPU'>
23:49:46:    <cpus v='-1'/>
23:49:46:  </slot>
23:49:46:</config>
23:49:46:Trying to access database...
23:49:46:Successfully acquired database lock
23:49:46:Enabled folding slot 00: READY gpu:0:GK107 [Quadro 2100M]
23:49:46:Enabled folding slot 01: READY cpu:4
23:49:46:WU01:FS00:Starting
23:49:46:WU01:FS00:Running FahCore: C:\temp\FAH\FAHClient/FAHCoreWrapper.exe C:/temp/FAH/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 703 -lifeline 4196 -checkpoint 10 -gpu 0 -gpu-vendor nvidia
23:49:46:WU01:FS00:Started FahCore on PID 7788
23:49:46:WU01:FS00:Core PID:9316
23:49:46:WU01:FS00:FahCore 0x15 started
23:49:46:WU00:FS01:Connecting to assign3.stanford.edu:8080
23:49:46:WU01:FS00:0x15:
23:49:46:WU01:FS00:0x15:*------------------------------*
23:49:46:WU01:FS00:0x15:Folding@Home GPU Core
23:49:46:WU01:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
23:49:46:WU01:FS00:0x15:Build host             AmoebaRemote
23:49:46:WU01:FS00:0x15:Board Type             NVIDIA/CUDA
23:49:46:WU01:FS00:0x15:Core                   15
23:49:46:WU01:FS00:0x15:
23:49:46:WU01:FS00:0x15:Window's signal control handler registered.
23:49:46:WU01:FS00:0x15:Preparing to commence simulation
23:49:46:WU01:FS00:0x15:- Looking at optimizations...
23:49:46:WU01:FS00:0x15:- Files status OK
23:49:46:WU01:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
23:49:46:WU01:FS00:0x15:- Expanded 145724 -> 660986 (decompressed 453.5 percent)
23:49:46:WU01:FS00:0x15:Called DecompressByteArray: compressed_data_size=145724 data_size=660986, decompressed_data_size=660986 diff=0
23:49:46:WU01:FS00:0x15:- Digital signature verified
23:49:46:WU01:FS00:0x15:
23:49:46:WU01:FS00:0x15:Project: 8018 (Run 10, Clone 1, Gen 206)
23:49:46:WU01:FS00:0x15:
23:49:46:WU01:FS00:0x15:Assembly optimizations on if available.
23:49:46:WU01:FS00:0x15:Entering M.D.
23:49:47:WARNING:WU00:FS01:Failed to get assignment from 'assign3.stanford.edu:8080': Failed to connect to assign3.stanford.edu:8080: No connection could be made because the target machine actively refused it.
23:49:47:WU00:FS01:Connecting to assign4.stanford.edu:80
23:49:48:WU01:FS00:0x15:Will resume from checkpoint file 01/wudata_01.ckp
23:49:48:WU01:FS00:0x15:Tpr hash 01/wudata_01.tpr:  1272161019 2027377412 1023562327 708743017 2769342692
23:49:48:WU01:FS00:0x15:GPU device id=0
23:49:48:WU01:FS00:0x15:Working on GRowing Old MAkes el Chrono Sweat
23:49:48:WU01:FS00:0x15:Client config unavailable.
23:49:48:WU01:FS00:0x15:Starting GUI Server
23:49:49:WARNING:WU00:FS01:Failed to get assignment from 'assign4.stanford.edu:80': Failed to connect to assign4.stanford.edu:80: No connection could be made because the target machine actively refused it.
23:49:49:ERROR:WU00:FS01:Exception: Could not get an assignment
23:49:49:WU00:FS01:Connecting to assign3.stanford.edu:8080
23:49:50:WARNING:WU00:FS01:Failed to get assignment from 'assign3.stanford.edu:8080': Failed to connect to assign3.stanford.edu:8080: No connection could be made because the target machine actively refused it.
23:49:50:WU00:FS01:Connecting to assign4.stanford.edu:80
23:49:51:WARNING:WU00:FS01:Failed to get assignment from 'assign4.stanford.edu:80': Failed to connect to assign4.stanford.edu:80: No connection could be made because the target machine actively refused it.
23:49:51:ERROR:WU00:FS01:Exception: Could not get an assignment
23:50:49:WU00:FS01:Connecting to assign3.stanford.edu:8080
23:50:50:WU01:FS00:0x15:Resuming from checkpoint
23:50:50:WU01:FS00:0x15:fcCheckPointResume: retreived and current tpr file hash:
23:50:50:WU01:FS00:0x15:   0   1272161019   1272161019
23:50:50:WU01:FS00:0x15:   1   2027377412   2027377412
23:50:50:WU01:FS00:0x15:   2   1023562327   1023562327
23:50:50:WU01:FS00:0x15:   3    708743017    708743017
23:50:50:WU01:FS00:0x15:   4   2769342692   2769342692
23:50:50:WU01:FS00:0x15:fcCheckPointResume: file hashes same.
23:50:50:WU01:FS00:0x15:fcCheckPointResume: state restored.
23:50:50:WU01:FS00:0x15:fcCheckPointResume: name 01/wudata_01.log Verified 01/wudata_01.log
23:50:50:WU01:FS00:0x15:fcCheckPointResume: name 01/wudata_01.trr Verified 01/wudata_01.trr
23:50:50:WU01:FS00:0x15:fcCheckPointResume: name 01/wudata_01.xtc Verified 01/wudata_01.xtc
23:50:50:WU01:FS00:0x15:fcCheckPointResume: name 01/wudata_01.edr Verified 01/wudata_01.edr
23:50:50:WU01:FS00:0x15:fcCheckPointResume: state restored 2
23:50:50:WU01:FS00:0x15:Resumed from checkpoint
23:50:50:WU01:FS00:0x15:Setting checkpoint frequency: 250000
23:50:50:WU01:FS00:0x15:Completed   2500001 out of 25000000 steps (10%).
23:50:50:WARNING:WU00:FS01:Failed to get assignment from 'assign3.stanford.edu:8080': Failed to connect to assign3.stanford.edu:8080: No connection could be made because the target machine actively refused it.
23:50:50:WU00:FS01:Connecting to assign4.stanford.edu:80
23:50:51:WARNING:WU00:FS01:Failed to get assignment from 'assign4.stanford.edu:80': Failed to connect to assign4.stanford.edu:80: No connection could be made because the target machine actively refused it.
23:50:51:ERROR:WU00:FS01:Exception: Could not get an assignment
23:52:26:WU00:FS01:Connecting to assign3.stanford.edu:8080
23:52:27:WARNING:WU00:FS01:Failed to get assignment from 'assign3.stanford.edu:8080': Failed to connect to assign3.stanford.edu:8080: No connection could be made because the target machine actively refused it.
23:52:27:WU00:FS01:Connecting to assign4.stanford.edu:80
23:52:29:WARNING:WU00:FS01:Failed to get assignment from 'assign4.stanford.edu:80': Failed to connect to assign4.stanford.edu:80: No connection could be made because the target machine actively refused it.
23:52:29:ERROR:WU00:FS01:Exception: Could not get an assignment
23:55:03:WU00:FS01:Connecting to assign3.stanford.edu:8080
23:55:05:WARNING:WU00:FS01:Failed to get assignment from 'assign3.stanford.edu:8080': Failed to connect to assign3.stanford.edu:8080: No connection could be made because the target machine actively refused it.
23:55:05:WU00:FS01:Connecting to assign4.stanford.edu:80
23:55:06:WARNING:WU00:FS01:Failed to get assignment from 'assign4.stanford.edu:80': Failed to connect to assign4.stanford.edu:80: No connection could be made because the target machine actively refused it.
23:55:06:ERROR:WU00:FS01:Exception: Could not get an assignment

Re: CPU fails to get work assignment, but GPU okay

Posted: Thu Feb 27, 2014 12:43 am
by Jesse_V
Can you open assign3.stanford.edu:8080 in your browser? If you can, it will say "OK".

Re: CPU fails to get work assignment, but GPU okay

Posted: Thu Feb 27, 2014 10:17 pm
by tj3f3rsn
Jesse_V wrote:Can you open assign3.stanford.edu:8080 in your browser? If you can, it will say "OK".
No, I mentioned that in the original post, but I only get "OK" for the GPU server, not the regular server. (Any of the "assign" servers get "This page can't be displayed" messages in my browser.)

Again, I suspect there is some conflict with firewall/antivirus, but that doesn't explain to me why the "assign" servers USED to work, while the GPU server STILL works.
Talking to my IT Dept is not really an option, so I was just hoping someone here might have an easy answer off the top of their head.

Re: CPU fails to get work assignment, but GPU okay

Posted: Fri Feb 28, 2014 1:55 am
by bruce
According to serverstat, http://assign.stanford.edu:8080 is working but http://assign2.stanford.edu:80 is not. which matches what my browser tells me. The same is true for assign3 (OK) and assign4 (not OK).

I'll find somebody at Stanford who can fix it.

Re: CPU fails to get work assignment, but GPU okay

Posted: Fri Feb 28, 2014 4:01 pm
by tj3f3rsn
Awesome, thanks Bruce!

My machine was able to download a CPU work unit about 5 hours or so ago, so looks like you got in touch with the right person.