Project 10121 (Run 2, Clone 392, Gen 0)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Posts: 13
Joined: Tue May 10, 2011 2:53 pm

Project 10121 (Run 2, Clone 392, Gen 0)

Post by AlphaWolf »

Core crashed, but has stared up again. I've never had a problem with this machine before. CPU is not overclocked, GPU is factory "superclocked". I do run more aggressive DEP (data execution prevention), in that I enforce it for all programs except those I specify, rather than just "essential windows programs and services".

Since it is moving again, I almost didn't bother reporting it, except the errors seem remarkably similar to the errors encountered in this thread: viewtopic.php?f=66&t=18541&hilit=10121
I don't have access to post there, as it is the beta forums. I run the v7 client with SMP and a GPU slot. In the log below, I'll cut out as much of the GPU-related stuff as possible. Shortly after 1% on the SMP core, I paused both cores and rebooted my machine, so you'll notice it starts out @ 1%.

Code: Select all

*********************** Log Started 13/May/2011-00:37:19 ***********************
00:37:19:************************* Folding@home Client *************************
00:37:19:      Website:
00:37:19:    Copyright: (c) 2009,2010 Stanford University
00:37:19:       Author: Joseph Coffland <>
00:37:19:         Args: --lifeline 1012 --command-port=36330
00:37:19:       Config: C:/Users/Mike/AppData/Roaming/FAHClient/config.xml
00:37:19:******************************** Build ********************************
00:37:19:      Version: 7.1.24
00:37:19:         Date: Apr 6 2011
00:37:19:         Time: 21:37:58
00:37:19:      SVN Rev: 2908
00:37:19:       Branch: fah/trunk/client
00:37:19:     Compiler: Intel(R) C++ MSVC 1500 mode 1110
00:37:19:      Options: /TP /nologo /EHa /wd4297 /wd4103 /wd1786 /Ox -arch:SSE2
00:37:19:               /QaxSSE3,SSSE3,SSE4.1,SSE4.2 /Qrestrict /MT
00:37:19:     Platform: win32 Vista
00:37:19:         Bits: 32
00:37:19:         Mode: Release
00:37:19:******************************* System ********************************
00:37:19:           OS: Microsoft Windows 7 Home Premium
00:37:19:          CPU: Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz
00:37:19:       CPU ID: GenuineIntel Family 6 Model 23 Stepping 10
00:37:19:         CPUs: 4
00:37:19:       Memory: 4.00GiB
00:37:19:  Free Memory: 2.98GiB
00:37:19:      Threads: WINDOWS_THREADS
00:37:19:         GPUs: 1
00:37:19:        GPU 0: NVIDIA:1 GT200 [GeForce GTX 260]
00:37:19:         CUDA: 1.3
00:37:19:  CUDA Driver: 4000
00:37:19:   On Battery: false
00:37:19:   UTC offset: -4
00:37:19:          PID: 1400
00:37:19:          CWD: C:/Users/Mike/AppData/Roaming/FAHClient
00:37:19:Win32 Service: false
00:37:20:  <service-description v='Folding@home Client'/>
00:37:20:  <service-restart v='true'/>
00:37:20:  <service-restart-delay v='5000'/>
00:37:20:  <!-- Client Control -->
00:37:20:  <cycle-rate v='4'/>
00:37:20:  <cycles v='-1'/>
00:37:20:  <data-directory v='.'/>
00:37:20:  <exec-directory v='C:\Program Files (x86)\FAHClient'/>
00:37:20:  <exit-when-done v='false'/>
00:37:20:  <max-delay v='21600'/>
00:37:20:  <min-delay v='60'/>
00:37:20:  <threads v='4'/>
00:37:20:  <!-- Configuration -->
00:37:20:  <config-rotate v='true'/>
00:37:20:  <config-rotate-dir v='configs'/>
00:37:20:  <config-rotate-max v='16'/>
00:37:20:  <!-- Debugging -->
00:37:20:  <assignment-servers>
00:37:20:  </assignment-servers>
00:37:20:  <capture-directory v='capture'/>
00:37:20:  <capture-sockets v='false'/>
00:37:20:  <debug-sockets v='false'/>
00:37:20:  <exception-locations v='true'/>
00:37:20:  <gpu-assignment-servers>
00:37:20:  </gpu-assignment-servers>
00:37:20:  <stack-traces v='false'/>
00:37:20:  <!-- Error Handling -->
00:37:20:  <max-slot-errors v='5'/>
00:37:20:  <max-unit-errors v='5'/>
00:37:20:  <!-- FahCore Control -->
00:37:20:  <checkpoint v='15'/>
00:37:20:  <core-dir v='cores'/>
00:37:20:  <core-priority v='idle'/>
00:37:20:  <cpu-affinity v='false'/>
00:37:20:  <cpu-usage v='100'/>
00:37:20:  <no-assembly v='false'/>
00:37:20:  <!-- Folding Slot Configuration -->
00:37:20:  <client-subtype v='STDCLI'/>
00:37:20:  <client-type v='advanced'/>
00:37:20:  <cpu-species v='X86_PENTIUM_II'/>
00:37:20:  <cpu-type v='X86'/>
00:37:20:  <cpus v='4'/>
00:37:20:  <gpu v='true'/>
00:37:20:  <gpu-id v='0'/>
00:37:20:  <max-packet-size v='big'/>
00:37:20:  <os-species v='UNKNOWN'/>
00:37:20:  <os-type v='WIN32'/>
00:37:20:  <project-key v='0'/>
00:37:20:  <smp v='true'/>
00:37:20:  <!-- Logging -->
00:37:20:  <log v='log.txt'/>
00:37:20:  <log-color v='false'/>
00:37:20:  <log-crlf v='true'/>
00:37:20:  <log-date v='false'/>
00:37:20:  <log-debug v='true'/>
00:37:20:  <log-domain v='false'/>
00:37:20:  <log-header v='true'/>
00:37:20:  <log-level v='true'/>
00:37:20:  <log-no-info-header v='true'/>
00:37:20:  <log-redirect v='false'/>
00:37:20:  <log-rotate v='true'/>
00:37:20:  <log-rotate-dir v='logs'/>
00:37:20:  <log-rotate-max v='16'/>
00:37:20:  <log-short-level v='false'/>
00:37:20:  <log-simple-domains v='true'/>
00:37:20:  <log-thread-id v='false'/>
00:37:20:  <log-time v='true'/>
00:37:20:  <log-to-screen v='true'/>
00:37:20:  <log-truncate v='false'/>
00:37:20:  <verbosity v='5'/>
00:37:20:  <!-- Network -->
00:37:20:  <proxy v=':8080'/>
00:37:20:  <proxy-enable v='false'/>
00:37:20:  <proxy-pass v=''/>
00:37:20:  <proxy-user v=''/>
00:37:20:  <!-- Process Control -->
00:37:20:  <child v='false'/>
00:37:20:  <daemon v='false'/>
00:37:20:  <pid v='false'/>
00:37:20:  <pid-file v='Folding@home'/>
00:37:20:  <respawn v='false'/>
00:37:20:  <service v='false'/>
00:37:20:  <!-- Remote Command Server -->
00:37:20:  <command-address v=''/>
00:37:20:  <command-allow v=''/>
00:37:20:  <command-allow-no-pass v=''/>
00:37:20:  <command-deny v=''/>
00:37:20:  <command-deny-no-pass v=''/>
00:37:20:  <command-port v='36330'/>
00:37:20:  <!-- Slot Control -->
00:37:20:  <max-shutdown-wait v='60'/>
00:37:20:  <pause-on-battery v='false'/>
00:37:20:  <pause-on-start v='false'/>
00:37:20:  <!-- User Information -->
00:37:20:  <machine-id v='0'/>
00:37:20:  <passkey v='********************************'/>
00:37:20:  <team v='111065'/>
00:37:20:  <user v='AlphaWolf50'/>
00:37:20:  <!-- Work Unit Control -->
00:37:20:  <dump-after-deadline v='true'/>
00:37:20:  <max-queue v='16'/>
00:37:20:  <max-units v='0'/>
00:37:20:  <next-unit-percentage v='100'/>
00:37:20:  <!-- Folding Slots -->
00:37:20:  <slot id='0' type='GPU'/>
00:37:20:  <slot id='1' type='SMP'/>
00:37:20:Trying to access database...
00:37:21:Database locked
00:37:21:Enabled folding slot 00: READY gpu:0:"GT200 [GeForce GTX 260]"
00:37:21:Enabled folding slot 01: READY smp:4
00:37:21:Started thread 6 on PID 1400
00:37:21:Started thread 5 on PID 1400
00:37:21:Started thread 4 on PID 1400
00:37:21:Started thread 1 on PID 1400
00:37:21:Started thread 3 on PID 1400
00:37:21:Starting Unit 01
00:37:21:Running core: C:/Users/Mike/AppData/Roaming/FAHClient/cores/ -dir 01 -suffix 01 -lifeline 1400 -version 701 -checkpoint 15 -np 4
00:37:21:Started core on PID 760
00:37:21:FahCore 0xa3 started
00:37:21:Started thread 7 on PID 1400
00:37:21:Connecting to
00:37:21:News: Welcome to Folding@Home
00:37:21:Assigned to work server
00:37:21:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from
00:37:21:Connecting to
00:37:21:Unit 01:
00:37:21:Unit 01:*------------------------------*
00:37:21:Unit 01:Folding@Home Gromacs SMP Core
00:37:21:Unit 01:Version 2.27 (Dec. 15, 2010)
00:37:21:Unit 01:
00:37:21:Unit 01:Preparing to commence simulation
00:37:21:Unit 01:- Ensuring status. Please wait.
00:37:22:Slot 00: Downloading 72.65KiB
00:37:23:Slot 00: Download complete
00:37:23:Received Unit: id:00 state:DOWNLOAD project:6605 run:6 clone:266 gen:845 core:0x11 unit:0x2a827cbc4dcc7d54034d010a000619cd
00:37:23:Starting Unit 00
00:37:23:Running core: C:/Users/Mike/AppData/Roaming/FAHClient/cores/ -dir 00 -suffix 01 -lifeline 1400 -version 701 -checkpoint 15 -gpu 0
00:37:23:Started core on PID 1672
00:37:23:FahCore 0x11 started
00:37:23:Started thread 8 on PID 1400
00:37:23:Unit 00:
00:37:23:Unit 00:*------------------------------*
00:37:23:Unit 00:Folding@Home GPU Core
00:37:23:Unit 00:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
00:37:23:Unit 00:
00:37:23:Unit 00:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
00:37:23:Unit 00:Build host: amoeba
00:37:23:Unit 00:Board Type: Nvidia
00:37:23:Unit 00:Core      : 
00:37:23:Unit 00:Preparing to commence simulation
00:37:23:Unit 00:- Looking at optimizations...
00:37:23:Unit 00:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp
00:37:23:Unit 00:- Created dyn
00:37:23:Unit 00:- Files status OK
00:37:23:Unit 00:- Expanded 73880 -> 383588 (decompressed 519.2 percent)
00:37:23:Unit 00:Called DecompressByteArray: compressed_data_size=73880 data_size=383588, decompressed_data_size=383588 diff=0
00:37:23:Unit 00:- Digital signature verified
00:37:23:Unit 00:
00:37:23:Unit 00:Project: 6605 (Run 6, Clone 266, Gen 845)
00:37:23:Unit 00:
00:37:23:Unit 00:Assembly optimizations on if available.
00:37:23:Unit 00:Entering M.D.
00:37:24:Server connection id=1 on from
00:37:24:Started thread 9 on PID 1400
00:37:29:Unit 00:Tpr hash 00/wudata_01.tpr:  1589591875 2638857951 2393670717 2413450958 1367645381
00:37:29:Unit 00:
00:37:29:Unit 00:Calling fah_main args: 14 usage=100
00:37:29:Unit 00:
00:37:29:Unit 00:Working on Protein
00:37:30:Unit 00:Client config unavailable.
00:37:31:Unit 01:- Looking at optimizations...
00:37:31:Unit 01:- Working with standard loops on this execution.
00:37:31:Unit 01:- Previous termination of core was improper.
00:37:31:Unit 01:- Files status OK
00:37:31:Unit 01:- Expanded 809277 -> 2075428 (decompressed 256.4 percent)
00:37:31:Unit 01:Called DecompressByteArray: compressed_data_size=809277 data_size=2075428, decompressed_data_size=2075428 diff=0
00:37:31:Unit 01:- Digital signature verified
00:37:31:Unit 01:
00:37:31:Unit 01:Project: 10121 (Run 2, Clone 392, Gen 0)
00:37:31:Unit 01:
00:37:31:Unit 01:Entering M.D.
00:37:31:Unit 00:Starting GUI Server
00:37:37:Unit 01:Using Gromacs checkpoints
00:37:37:Unit 01:Mapping NT from 4 to 4 
00:37:38:Unit 01:Resuming from checkpoint
00:37:38:Unit 01:Verified 01/wudata_01.log
00:37:38:Unit 01:Verified 01/wudata_01.trr
00:37:38:Unit 01:Verified 01/wudata_01.xtc
00:37:38:Unit 01:Verified 01/wudata_01.edr
00:37:38:Unit 01:Completed 30444 out of 2500000 steps  (1%)
01:06:38:Unit 01:Completed 50000 out of 2500000 steps  (2%)
01:43:00:Unit 01:Completed 75000 out of 2500000 steps  (3%)
02:18:31:Unit 01:Completed 100000 out of 2500000 steps  (4%)
02:54:50:Unit 01:Completed 125000 out of 2500000 steps  (5%)
03:31:38:Unit 01:Completed 150000 out of 2500000 steps  (6%)
04:07:56:Unit 01:Completed 175000 out of 2500000 steps  (7%)
04:44:39:Unit 01:Completed 200000 out of 2500000 steps  (8%)
05:20:37:Unit 01:Completed 225000 out of 2500000 steps  (9%)
09:30:48:FahCore, running Unit 01, returned: UNKNOWN_ENUM (-1073741819)
09:30:48:Starting Unit 01
09:30:48:Running core: C:/Users/Mike/AppData/Roaming/FAHClient/cores/ -dir 01 -suffix 01 -lifeline 1400 -version 701 -checkpoint 15 -np 4
09:30:48:Started core on PID 3924
09:30:48:FahCore 0xa3 started
09:30:48:Started thread 15 on PID 1400
09:30:48:Unit 01:
09:30:48:Unit 01:*------------------------------*
09:30:48:Unit 01:Folding@Home Gromacs SMP Core
09:30:48:Unit 01:Version 2.27 (Dec. 15, 2010)
09:30:48:Unit 01:
09:30:48:Unit 01:Preparing to commence simulation
09:30:48:Unit 01:- Ensuring status. Please wait.
09:30:54:Unit 02:Completed 57%
09:30:58:Unit 01:- Looking at optimizations...
09:30:58:Unit 01:- Working with standard loops on this execution.
09:30:58:Unit 01:- Previous termination of core was improper.
09:30:58:Unit 01:- Going to use standard loops.
09:30:58:Unit 01:- Files status OK
09:30:58:Unit 01:- Expanded 809277 -> 2075428 (decompressed 256.4 percent)
09:30:58:Unit 01:Called DecompressByteArray: compressed_data_size=809277 data_size=2075428, decompressed_data_size=2075428 diff=0
09:30:58:Unit 01:- Digital signature verified
09:30:58:Unit 01:
09:30:58:Unit 01:Project: 10121 (Run 2, Clone 392, Gen 0)
09:30:58:Unit 01:
09:30:58:Unit 01:Entering M.D.
09:31:04:Unit 01:Using Gromacs checkpoints
09:31:04:Unit 01:Mapping NT from 4 to 4 
09:31:04:Unit 01:Resuming from checkpoint
09:31:04:Unit 01:Verified 01/wudata_01.log
09:31:04:Unit 01:Verified 01/wudata_01.trr
09:31:04:Unit 01:Verified 01/wudata_01.xtc
09:31:04:Unit 01:Verified 01/wudata_01.edr
09:31:04:Unit 01:Completed 235928 out of 2500000 steps  (9%)
09:51:40:Unit 01:Completed 250000 out of 2500000 steps  (10%)
You should also notice that when I came back to my machine and closed the windows error message, the core started back up on its own. So far it is still progressing. Here are the related windows error logs:

Code: Select all

Faulting application name: FahCore_a3.exe, version:, time stamp: 0x4d4720af
Faulting module name: FahCore_a3.exe, version:, time stamp: 0x4d4720af
Exception code: 0xc0000005
Fault offset: 0x0026efdb
Faulting process id: 0x2f8
Faulting application start time: 0x01cc1105ebaa938a
Faulting application path: C:\Users\Mike\AppData\Roaming\FAHClient\cores\\~pande\Win32\x86\Core_a3.fah\FahCore_a3.exe
Faulting module path: C:\Users\Mike\AppData\Roaming\FAHClient\cores\\~pande\Win32\x86\Core_a3.fah\FahCore_a3.exe
Report Id: 68fa72fb-7d24-11e0-b1bf-00241d27e46b

Code: Select all

Fault bucket , type 0
Event Name: APPCRASH
Response: Not available
Cab Id: 0

Problem signature:
P1: FahCore_a3.exe
P3: 4d4720af
P4: FahCore_a3.exe
P6: 4d4720af
P7: c0000005
P8: 0026efdb

Attached files:

These files may be available here:

Analysis symbol: 
Rechecking for solution: 0
Report Id: 68fa72fb-7d24-11e0-b1bf-00241d27e46b
Report Status: 0
And here is the file it references above (The "WER" file):

Code: Select all

Sig[0].Name=Application Name
Sig[1].Name=Application Version
Sig[2].Name=Application Timestamp
Sig[3].Name=Fault Module Name
Sig[4].Name=Fault Module Version
Sig[5].Name=Fault Module Timestamp
Sig[6].Name=Exception Code
Sig[7].Name=Exception Offset
DynamicSig[1].Name=OS Version
DynamicSig[2].Name=Locale ID
DynamicSig[22].Name=Additional Information 1
DynamicSig[23].Name=Additional Information 2
DynamicSig[24].Name=Additional Information 3
DynamicSig[25].Name=Additional Information 4
UI[3]=FahCore_a3.exe has stopped working
UI[4]=Windows can check online for a solution to the problem.
UI[5]=Check online for a solution and close the program
UI[6]=Check online for a solution later and close the program
UI[7]=Close the program
LoadedModule[4]=C:\Program Files\AVAST Software\Avast\snxhk.dll
FriendlyEventName=Stopped working
Let me know if you need anything else. RIght now it is happily chugging away again... except my PPD is in the gutter :(
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project 10121 (Run 2, Clone 392, Gen 0)

Post by bruce »

Thanks. The stats database for Project: 10121 (Run 2, Clone 392, Gen 0) shows one error report, so it's probably a bad WU. We'll wait for one more report.

It bothers me slightly that Windows detected a crash but the WU resumed processing but it did back up to the previous checkpoint and apparently processed beyond the point of the error, so I guess it's OK.

You'll find a number of forum posts reporting Windows errors c0000005 which is apparently a memory error. I've pretty well convinced myself that they're some kind of hardware error. You've should probably back off a tad on your overclocking and make sure the RAM is over-heating.
Posts: 13
Joined: Tue May 10, 2011 2:53 pm

Re: Project 10121 (Run 2, Clone 392, Gen 0)

Post by AlphaWolf »

bruce wrote:You'll find a number of forum posts reporting Windows errors c0000005 which is apparently a memory error. I've pretty well convinced myself that they're some kind of hardware error. You've should probably back off a tad on your overclocking and make sure the RAM is over-heating.
CPU is stock clock. I'm not sure if the RAM can be considered "overclocked" -- It's DDR3-1333 6-6-6-20 @ 1.8v, which is Crucial's "factory specs" for this RAM. I don't believe these settings are valid from a JDEC point of view so perhaps that's considered an "overclock"? Thoughts? I don't know how I'd determine if the RAM is getting too hot -- the second temp sensor on my motherboard (don't know if it's supposed to be case temp or chipset or what) reads 51 deg. celsius, max 52.

The GPU is factory "superclocked", but it shouldn't be relative to this issue since it was an SMP core that crashed, correct?

Once these units stop I'll go ahead and run StressCPU/Memtest86/MemtestG80 just to make sure nothing is failing. Almost all my parts are 3yr or lifetime warrantied so at least I can get replacements if necessary :|
Posts: 13
Joined: Tue May 10, 2011 2:53 pm

Re: Project 10121 (Run 2, Clone 392, Gen 0)

Post by AlphaWolf »

I just noticed this project is no longer listed on the psummary page -- since the initial crash the WU has been processing fairly well for the last several hours. I've got an ETA of about 1.5 days -- if/when I finish this WU is it going to be accepted?
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona

Re: Project 10121 (Run 2, Clone 392, Gen 0)

Post by 7im »

Yes. Psummary is a list of WUs being actively assigned. The list is updated dynamically.

All assigned WUs will always be accepted, assuming no technical difficulties.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Post Reply