Page 1 of 1

P7200 - several bad WUs?

Posted: Tue Mar 05, 2013 4:21 pm
by art_l_j_PlanetAMD64
I had a PRCG 7200 (11, 3, 497) with a TPF of 6 hours 8 minutes, so I "dumped" it the "correct" way by doing this:
  • Quit FAHControl and FAHClient
  • Open a Command Prompt window
  • cd to C:\Users\Art Johnson\AppData/Roaming\FAHClient
  • Execute the command:

    Code: Select all

    FAHClient --dump 02
    (02 was the Work Queue ID of the bad WU)
  • Here is the log file from doing that:

    Code: Select all

    *********************** Log Started 2013-03-05T15:40:06Z ***********************
    15:40:06:************************* Folding@home Client *************************
    15:40:06:      Website: http://folding.stanford.edu/
    15:40:06:    Copyright: (c) 2009-2012 Stanford University
    15:40:06:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
    15:40:06:         Args: --dump 02
    15:40:06:       Config: C:/Users/Art Johnson/AppData/Roaming/FAHClient/config.xml
    15:40:06:******************************** Build ********************************
    15:40:06:      Version: 7.2.9
    15:40:06:         Date: Oct 3 2012
    15:40:06:         Time: 18:05:48
    15:40:06:      SVN Rev: 3578
    15:40:06:       Branch: fah/trunk/client
    15:40:06:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
    15:40:06:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
    15:40:06:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
    15:40:06:     Platform: win32 XP
    15:40:06:         Bits: 32
    15:40:06:         Mode: Release
    15:40:06:******************************* System ********************************
    15:40:06:          CPU: AMD FX(tm)-8150 Eight-Core Processor
    15:40:06:       CPU ID: AuthenticAMD Family 21 Model 1 Stepping 2
    15:40:06:         CPUs: 8
    15:40:06:       Memory: 3.97GiB
    15:40:06:  Free Memory: 3.00GiB
    15:40:06:      Threads: WINDOWS_THREADS
    15:40:06:   On Battery: false
    15:40:06:   UTC offset: -8
    15:40:06:          PID: 1268
    15:40:06:          CWD: C:/Users/Art Johnson/AppData/Roaming/FAHClient
    15:40:06:           OS: Windows 7 Home Premium
    15:40:06:      OS Arch: AMD64
    15:40:06:         GPUs: 2
    15:40:06:        GPU 0: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
    15:40:06:        GPU 1: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
    15:40:06:         CUDA: 3.0
    15:40:06:  CUDA Driver: 5000
    15:40:06:Win32 Service: false
    15:40:06:***********************************************************************
    15:40:07:<config>
    15:40:07:  <!-- Folding Slot Configuration -->
    15:40:07:  <gpu v='true'/>
    15:40:07:  <max-packet-size v='big'/>
    15:40:07:
    15:40:07:  <!-- Network -->
    15:40:07:  <proxy v=':8080'/>
    15:40:07:
    15:40:07:  <!-- User Information -->
    15:40:07:  <passkey v='********************************'/>
    15:40:07:  <team v='45862'/>
    15:40:07:  <user v='art_l_j_PlanetAMD64'/>
    15:40:07:
    15:40:07:  <!-- Work Unit Control -->
    15:40:07:  <next-unit-percentage v='100'/>
    15:40:07:
    15:40:07:  <!-- Folding Slots -->
    15:40:07:  <slot id='0' type='GPU'>
    15:40:07:    <cuda-index v='0'/>
    15:40:07:    <gpu-index v='0'/>
    15:40:07:    <opencl-index v='0'/>
    15:40:07:  </slot>
    15:40:07:  <slot id='1' type='GPU'>
    15:40:07:    <cuda-index v='1'/>
    15:40:07:    <gpu-index v='1'/>
    15:40:07:    <opencl-index v='1'/>
    15:40:07:  </slot>
    15:40:07:  <slot id='2' type='SMP'>
    15:40:07:    <cpus v='2'/>
    15:40:07:  </slot>
    15:40:07:</config>
    15:40:07:Trying to access database...
    15:40:07:Successfully acquired database lock
    15:40:07:Enabled folding slot 00: READY gpu:0:"GK104 [GeForce GTX 660 Ti]"
    15:40:07:Enabled folding slot 01: READY gpu:1:"GK104 [GeForce GTX 660 Ti]"
    15:40:07:Enabled folding slot 02: READY smp:2
    15:40:07:WARNING:Dumping WU02 per user request
    15:40:07:WU02:FS02:Sending unit results: id:02 state:SEND error:DUMPED project:7200 run:11 clone:3 gen:497 core:0xa4 unit:0x0000027407ba4bae4da71f4218b9bf17
    15:40:07:WU02:FS02:Connecting to 129.64.95.82:8080
    15:40:07:WU02:FS02:Server responded WORK_ACK (400)
    15:40:07:WU02:FS02:Cleaning up
    15:40:07:Unit processing completed
    15:40:09:Clean exit
  • The above log file clearly shows the bad WU being returned to the Work Server, which responds with a WORK_ACK (400)
Now, I have this: PRCG 7200 (12, 10, 458), with a TPF of 1.41 days, and an ETA of 140.62 days. Help!! :egeek:

Re: P7200 - several bad WUs?

Posted: Tue Mar 05, 2013 4:51 pm
by Joe_H
Is that TPF from the log, or just the estimated value shown in FAHControl? Initial estimates shown in FAHControl are often inaccurate, a WU needs to complete a frame or two before they settle down.

Re: P7200 - several bad WUs?

Posted: Tue Mar 05, 2013 10:24 pm
by P5-133XL
I'm sorry, but what you are not showing is the log with the bad TPF's. There are several potential causes of extremely bad TPF's not the least of which is an external application using high amounts of CPU on a single core. Just as a side note, occasionally even FAHControl itself can go nuts and start using a full core continuously.