Page 1 of 1

FAULTY project:8011 run:3 clone:77 gen:9 w/ immediate exit

Posted: Fri Jan 06, 2012 4:58 am
by GreyWhiskers
I've added my report to the existing thread on 8011s. I've pruned the log to have only the offending WU.

BTW, this client (the new i7 2860QM laptop) has successfully completed 6 other P8011 WUs both before and after this Faulty WU for by far the best PPD of any other SMP-8 WUs I've processed on the laptop.
22K, 21K, 25.8K ppd for example.

Stats thanks to MtM's FAHWatch7 Preview version.

Code: Select all

*********************** Log Started 2012-01-01T17:34:09 ************************
17:34:09:************************* Folding@home Client *************************
17:34:09:      Website: http://folding.stanford.edu/
17:34:09:    Copyright: (c) 2009-2011 Stanford University
17:34:09:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
17:34:09:         Args: --lifeline 3732 --command-port=36330
17:34:09:       Config: C:/Users/USER/AppData/Roaming/FAHClient/config.xml
17:34:09:******************************** Build ********************************
17:34:09:      Version: 7.1.38
17:34:09:         Date: Oct 6 2011
17:34:09:         Time: 19:57:04
17:34:09:      SVN Rev: 3080
17:34:09:       Branch: fah/trunk/client
17:34:09:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
17:34:09:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
17:34:09:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
17:34:09:     Platform: win32 XP
17:34:09:         Bits: 32
17:34:09:         Mode: Release
17:34:09:******************************* System ********************************
17:34:09:          CPU: Intel(R) Core(TM) i7-2860QM CPU @ 2.50GHz
17:34:09:       CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
17:34:09:         CPUs: 8
17:34:09:       Memory: 15.98GiB
17:34:09:  Free Memory: 14.29GiB
17:34:09:      Threads: WINDOWS_THREADS
17:34:09:   On Battery: false
17:34:09:   UTC offset: -8
17:34:09:          PID: 4392
17:34:09:          CWD: C:/Users/USER/AppData/Roaming/FAHClient
17:34:09:           OS: Windows 7 Home Premium
17:34:09:      OS Arch: AMD64
17:34:09:         GPUs: 0
17:34:09:         CUDA: 2.1
17:34:09:  CUDA Driver: 4010
17:34:09:Win32 Service: false
17:34:09:***********************************************************************
17:34:09:<config>
17:34:09:  <service-description v='Folding@home Client'/>
17:34:09:  <service-restart v='true'/>
17:34:09:  <service-restart-delay v='5000'/>
17:34:09:
17:34:09:  <!-- Client Control -->
17:34:09:  <cycle-rate v='4'/>
17:34:09:  <cycles v='-1'/>
17:34:09:  <data-directory v='.'/>
17:34:09:  <disable-project-lookup v='false'/>
17:34:09:  <exec-directory v='C:\Program Files (x86)\FAHClient'/>
17:34:09:  <exit-when-done v='false'/>
17:34:09:  <threads v='4'/>
17:34:09:
17:34:09:  <!-- Configuration -->
17:34:09:  <config-rotate v='true'/>
17:34:09:  <config-rotate-dir v='configs'/>
17:34:09:  <config-rotate-max v='16'/>
17:34:09:
17:34:09:  <!-- Debugging -->
17:34:09:  <assignment-servers>
17:34:09:    assign3.stanford.edu:8080 assign4.stanford.edu:80
17:34:09:  </assignment-servers>
17:34:09:  <capture-directory v='capture'/>
17:34:09:  <capture-sockets v='false'/>
17:34:09:  <debug-sockets v='false'/>
17:34:09:  <exception-locations v='true'/>
17:34:09:  <gpu-assignment-servers>
17:34:09:    assign-GPU.stanford.edu:80 assign-GPU.stanford.edu:8080
17:34:09:  </gpu-assignment-servers>
17:34:09:  <stack-traces v='false'/>
17:34:09:
17:34:09:  <!-- Error Handling -->
17:34:09:  <max-slot-errors v='5'/>
17:34:09:  <max-unit-errors v='5'/>
17:34:09:
17:34:09:  <!-- FahCore Control -->
17:34:09:  <checkpoint v='15'/>
17:34:09:  <core-dir v='cores'/>
17:34:09:  <core-priority v='idle'/>
17:34:09:  <cpu-affinity v='false'/>
17:34:09:  <cpu-usage v='100'/>
17:34:09:  <no-assembly v='false'/>
17:34:09:
17:34:09:  <!-- Folding Slot Configuration -->
17:34:09:  <client-subtype v='STDCLI'/>
17:34:09:  <client-type v='normal'/>
17:34:09:  <cpu-species v='X86_PENTIUM_II'/>
17:34:09:  <cpu-type v='AMD64'/>
17:34:09:  <cpus v='-1'/>
17:34:09:  <cuda-index v='0'/>
17:34:09:  <gpu v='false'/>
17:34:09:  <gpu-usage v='100'/>
17:34:09:  <max-packet-size v='normal'/>
17:34:09:  <opencl-index v='0'/>
17:34:09:  <os-species v='UNKNOWN'/>
17:34:09:  <os-type v='WIN32'/>
17:34:09:  <project-key v='0'/>
17:34:09:  <smp v='true'/>
17:34:09:
17:34:09:  <!-- Logging -->
17:34:09:  <log v='log.txt'/>
17:34:09:  <log-color v='false'/>
17:34:09:  <log-crlf v='true'/>
17:34:09:  <log-date v='false'/>
17:34:09:  <log-debug v='true'/>
17:34:09:  <log-domain v='false'/>
17:34:09:  <log-header v='true'/>
17:34:09:  <log-level v='true'/>
17:34:09:  <log-no-info-header v='true'/>
17:34:09:  <log-redirect v='false'/>
17:34:09:  <log-rotate v='true'/>
17:34:09:  <log-rotate-dir v='logs'/>
17:34:09:  <log-rotate-max v='60'/>
17:34:09:  <log-short-level v='false'/>
17:34:09:  <log-simple-domains v='true'/>
17:34:09:  <log-thread-id v='false'/>
17:34:09:  <log-time v='true'/>
17:34:09:  <log-to-screen v='true'/>
17:34:09:  <log-truncate v='false'/>
17:34:09:  <verbosity v='4'/>
17:34:09:
17:34:09:  <!-- Network -->
17:34:09:  <proxy v=':8080'/>
17:34:09:  <proxy-enable v='false'/>
17:34:09:  <proxy-pass v=''/>
17:34:09:  <proxy-user v=''/>
17:34:09:
17:34:09:  <!-- Process Control -->
17:34:09:  <child v='false'/>
17:34:09:  <daemon v='false'/>
17:34:09:  <pid v='false'/>
17:34:09:  <pid-file v='Folding@home Client.pid'/>
17:34:09:  <respawn v='false'/>
17:34:09:  <service v='false'/>
17:34:09:
17:34:09:  <!-- Remote Command Server -->
17:34:09:  <command-address v='0.0.0.0'/>
17:34:09:  <command-allow v='127.0.0.1'/>
17:34:09:  <command-allow-no-pass v='127.0.0.1'/>
17:34:09:  <command-deny v='0.0.0.0/0'/>
17:34:09:  <command-deny-no-pass v='0.0.0.0/0'/>
17:34:09:  <command-port v='36330'/>
17:34:09:  <password v='********************************'/>
17:34:09:
17:34:09:  <!-- Slot Control -->
17:34:09:  <max-shutdown-wait v='60'/>
17:34:09:  <pause-on-battery v='true'/>
17:34:09:  <pause-on-start v='false'/>
17:34:09:
17:34:09:  <!-- User Information -->
17:34:09:  <machine-id v='0'/>
17:34:09:  <passkey v='********************************'/>
17:34:09:  <team v='0'/>
17:34:09:  <user v='GreyWhiskers'/>
17:34:09:
17:34:09:  <!-- Work Unit Control -->
17:34:09:  <dump-after-deadline v='true'/>
17:34:09:  <max-queue v='16'/>
17:34:09:  <max-units v='0'/>
17:34:09:  <next-unit-percentage v='100'/>
17:34:09:
17:34:09:  <!-- Folding Slots -->
17:34:09:  <slot id='0' type='SMP'/>
17:34:09:</config>
17:34:09:Trying to access database...
17:34:10:Successfully acquired database lock

Downloaded: 	2012-01-05T19:59:11

19:59:10:Connecting to assign3.stanford.edu:8080
19:59:10:News: Welcome to Folding@Home
19:59:10:Assigned to work server 171.67.108.60
19:59:10:Requesting new work unit for slot 00: RUNNING smp:8 from 171.67.108.60
19:59:10:Connecting to 171.67.108.60:8080
19:59:11:Slot 00: Downloading 52.17KiB
19:59:11:Slot 00: Download complete
19:59:11:Received Unit: id:01 state:DOWNLOAD error:OK project:8011 run:3 clone:77 gen:9 core:0xa4 unit:0x000000106652edcc4efd7dd561865c3a

SNIP

19:59:25:Starting Unit 01
19:59:25:Connecting to 171.67.108.60:8080
19:59:25:Running core: "C:/Users/USER/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe" -dir 01 -suffix 01 -lifeline 4392 -version 701 -checkpoint 15 -np 8
19:59:25:Started core on PID 3200
19:59:25:FahCore 0xa4 started
19:59:26:Unit 01:
19:59:26:Unit 01:*------------------------------*
19:59:26:Unit 01:Folding@Home Gromacs GB Core
19:59:26:Unit 01:Version 2.27 (Dec. 15, 2010)
19:59:26:Unit 01:
19:59:26:Unit 01:Preparing to commence simulation
19:59:26:Unit 01:- Looking at optimizations...
19:59:26:Unit 01:- Created dyn
19:59:26:Unit 01:- Files status OK
19:59:26:Unit 01:- Expanded 52906 -> 1367528 (decompressed 2584.8 percent)
19:59:26:Unit 01:Called DecompressByteArray: compressed_data_size=52906 data_size=1367528, decompressed_data_size=1367528 diff=0
19:59:26:Unit 01:- Digital signature verified
19:59:26:Unit 01:
19:59:26:Unit 01:Project: 8011 (Run 3, Clone 77, Gen 9)
19:59:26:Unit 01:
19:59:26:Unit 01:Assembly optimizations on if available.
19:59:26:Unit 01:Entering M.D.

19:59:31:Unit 01:Mapping NT from 8 to 8 
19:59:31:Unit 01:mdrun returned 255
19:59:31:Unit 01:Going to send back what have done -- stepsTotalG=250000
19:59:31:Unit 01:Work fraction=0.0000 steps=250000.
19:59:36:Unit 01:logfile size=6836 infoLength=6836 edr=0 trr=25
19:59:36:Unit 01:logfile size: 6836 info=6836 bed=0 hdr=25
19:59:36:Unit 01:- Writing 7374 bytes of core data to disk...
19:59:36:Unit 01:Done: 6862 -> 2457 (compressed to 35.8 percent)
19:59:36:Unit 01:  ... Done.
19:59:36:FahCore, running Unit 01, returned: BAD_WORK_UNIT (114 = 0x72)
19:59:36:Sending unit results: id:01 state:SEND error:FAULTY project:8011 run:3 clone:77 gen:9 core:0xa4 unit:0x000000106652edcc4efd7dd561865c3a
19:59:36:Unit 01: Uploading 2.90KiB to 171.67.108.60
19:59:36:Connecting to 171.67.108.60:8080
19:59:36:Unit 01: Upload complete
19:59:36:Server responded WORK_ACK (400)
19:59:36:Cleaning up Unit 01
Mod Edit: Split From viewtopic.php?f=19&t=20413 - PantherX

Re: FAULTY project:8011 run:3 clone:77 gen:9 w/ immediate ex

Posted: Sat Jan 07, 2012 12:46 pm
by PantherX
There were multiple failures in the WU Database so I have marked it as a bad WU:
The WU (P8011,R3,C77,G9) has been reported as a bad WU.
Thanks for your report.