F@h Keeps crashing, Not sure where to start

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
DR650SE
Posts: 18
Joined: Thu Dec 08, 2011 4:30 am

F@h Keeps crashing, Not sure where to start

Post by DR650SE »

Man, my app keeps crashing with the following:

This recently started happening. I get the following crash. Constantly. Sometimes one of my GPUs, Sometimes my CPU.

Code: Select all

Problem signature:
  Problem Event Name:	APPCRASH
  Application Name:	FahCore_a4.exe
  Application Version:	0.0.0.0
  Application Timestamp:	4d23eafc
  Fault Module Name:	ntdll.dll
  Fault Module Version:	6.1.7601.17725
  Fault Module Timestamp:	4ec49b8f
  Exception Code:	c0000029
  Exception Offset:	00090812
  OS Version:	6.1.7601.2.1.0.256.1
  Locale ID:	1033
  Additional Information 1:	0a9e
  Additional Information 2:	0a9e372d3b4ad19135b953a78882e789
  Additional Information 3:	0a9e
  Additional Information 4:	0a9e372d3b4ad19135b953a78882e789
My log is below.

Code: Select all

*********************** Log Started 2012-02-02T21:28:06 ************************
21:28:06:************************* Folding@home Client *************************
21:28:06:      Website: http://folding.stanford.edu/
21:28:06:    Copyright: (c) 2009-2012 Stanford University
21:28:06:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:28:06:         Args: --lifeline 3000 --command-port=36330
21:28:06:       Config: C:/Users/D.Tillman/AppData/Roaming/FAHClient/config.xml
21:28:06:******************************** Build ********************************
21:28:06:      Version: 7.1.43
21:28:06:         Date: Jan 2 2012
21:28:06:         Time: 12:33:05
21:28:06:      SVN Rev: 3223
21:28:06:       Branch: fah/trunk/client
21:28:06:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:28:06:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
21:28:06:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
21:28:06:     Platform: win32 XP
21:28:06:         Bits: 32
21:28:06:         Mode: Release
21:28:06:******************************* System ********************************
21:28:06:          CPU: Intel(R) Core(TM) i7 CPU X 940 @ 2.13GHz
21:28:06:       CPU ID: GenuineIntel Family 6 Model 30 Stepping 5
21:28:06:         CPUs: 8
21:28:06:       Memory: 15.99GiB
21:28:06:  Free Memory: 14.02GiB
21:28:06:      Threads: WINDOWS_THREADS
21:28:06:   On Battery: false
21:28:06:   UTC offset: -5
21:28:06:          PID: 5664
21:28:06:          CWD: C:/Users/D.Tillman/AppData/Roaming/FAHClient
21:28:06:           OS: Windows 7 Ultimate
21:28:06:      OS Arch: AMD64
21:28:06:         GPUs: 2
21:28:06:        GPU 0: ATI:4 Whistler XT [AMD Radeon HD 6970M]
21:28:06:        GPU 1: ATI:4 Whistler XT [AMD Radeon HD 6970M]
21:28:06:         CUDA: Not detected
21:28:06:Win32 Service: false
21:28:06:***********************************************************************
21:28:07:<config>
21:28:07:  <!-- FahCore Control -->
21:28:07:  <checkpoint v='20'/>
21:28:07:
21:28:07:  <!-- Folding Slot Configuration -->
21:28:07:  <gpu v='true'/>
21:28:07:
21:28:07:  <!-- Network -->
21:28:07:  <proxy v=':8080'/>
21:28:07:
21:28:07:  <!-- User Information -->
21:28:07:  <passkey v='********************************'/>
21:28:07:  <team v='213698'/>
21:28:07:  <user v='DR650SE'/>
21:28:07:
21:28:07:  <!-- Folding Slots -->
21:28:07:  <slot id='0' type='GPU'>
21:28:07:    <next-unit-percentage v='90'/>
21:28:07:  </slot>
21:28:07:  <slot id='1' type='GPU'>
21:28:07:    <next-unit-percentage v='90'/>
21:28:07:  </slot>
21:28:07:  <slot id='2' type='SMP'>
21:28:07:    <cpus v='6'/>
21:28:07:    <next-unit-percentage v='90'/>
21:28:07:  </slot>
21:28:07:</config>
21:28:07:Trying to access database...
21:28:07:Successfully acquired database lock
21:28:07:Enabled folding slot 00: READY gpu:0:"Whistler XT [AMD Radeon HD 6970M]"
21:28:07:Enabled folding slot 01: READY gpu:1:"Whistler XT [AMD Radeon HD 6970M]"
21:28:07:Enabled folding slot 02: READY smp:6
21:28:07:WU03:FS01:Starting
21:28:07:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/D.Tillman/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 03 -suffix 01 -version 701 -checkpoint 20 -gpu 1
21:28:07:WU03:FS01:Started FahCore on PID 1692
21:28:07:WU03:FS01:Core PID:4016
21:28:07:WU03:FS01:FahCore 0x16 started
21:28:07:WU01:FS00:Starting
21:28:07:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/D.Tillman/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 01 -suffix 01 -version 701 -checkpoint 20 -gpu 0
21:28:07:WU01:FS00:Started FahCore on PID 1980
21:28:07:WU01:FS00:Core PID:4212
21:28:07:WU01:FS00:FahCore 0x16 started
21:28:07:WU02:FS02:Starting
21:28:07:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/D.Tillman/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 701 -checkpoint 20 -np 6
21:28:07:WU02:FS02:Started FahCore on PID 3940
21:28:07:WU02:FS02:Core PID:6856
21:28:07:WU02:FS02:FahCore 0xa4 started
21:28:07:WU03:FS01:0x16:
21:28:07:WU03:FS01:0x16:*------------------------------*
21:28:07:WU03:FS01:0x16:Folding@Home GPU Core
21:28:07:WU03:FS01:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
21:28:07:WU03:FS01:0x16:
21:28:07:WU01:FS00:0x16:
21:28:07:WU03:FS01:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
21:28:07:WU01:FS00:0x16:*------------------------------*
21:28:07:WU03:FS01:0x16:Build host: user-f6d030f24f
21:28:07:WU01:FS00:0x16:Folding@Home GPU Core
21:28:07:WU03:FS01:0x16:Board Type: AMD/OpenCL
21:28:07:WU01:FS00:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
21:28:07:WU03:FS01:0x16:Core      : x=16
21:28:07:WU02:FS02:0xa4:
21:28:07:WU01:FS00:0x16:
21:28:07:WU03:FS01:0x16: Window's signal control handler registered.
21:28:07:WU02:FS02:0xa4:*------------------------------*
21:28:07:WU01:FS00:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
21:28:07:WU03:FS01:0x16:Preparing to commence simulation
21:28:07:WU02:FS02:0xa4:Folding@Home Gromacs GB Core
21:28:07:WU01:FS00:0x16:Build host: user-f6d030f24f
21:28:07:WU03:FS01:0x16:- Looking at optimizations...
21:28:07:WU02:FS02:0xa4:Version 2.27 (Dec. 15, 2010)
21:28:07:WU01:FS00:0x16:Board Type: AMD/OpenCL
21:28:07:WU03:FS01:0x16:- Files status OK
21:28:07:WU02:FS02:0xa4:
21:28:07:WU01:FS00:0x16:Core      : x=16
21:28:07:WU03:FS01:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
21:28:07:WU02:FS02:0xa4:Preparing to commence simulation
21:28:07:WU01:FS00:0x16: Window's signal control handler registered.
21:28:07:WU03:FS01:0x16:- Expanded 42495 -> 171163 (decompressed 402.7 percent)
21:28:07:WU02:FS02:0xa4:- Looking at optimizations...
21:28:07:WU01:FS00:0x16:Preparing to commence simulation
21:28:08:WU03:FS01:0x16:Called DecompressByteArray: compressed_data_size=42495 data_size=171163, decompressed_data_size=171163 diff=0
21:28:08:WU02:FS02:0xa4:- Files status OK
21:28:08:WU01:FS00:0x16:- Looking at optimizations...
21:28:08:WU03:FS01:0x16:- Digital signature verified
21:28:08:WU02:FS02:0xa4:- Expanded 2079520 -> 5386224 (decompressed 259.0 percent)
21:28:08:WU01:FS00:0x16:- Files status OK
21:28:08:WU03:FS01:0x16:
21:28:08:WU02:FS02:0xa4:Called DecompressByteArray: compressed_data_size=2079520 data_size=5386224, decompressed_data_size=5386224 diff=0
21:28:08:WU01:FS00:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
21:28:08:WU03:FS01:0x16:Project: 11293 (Run 1, Clone 407, Gen 0)
21:28:08:WU02:FS02:0xa4:- Digital signature verified
21:28:08:WU01:FS00:0x16:- Expanded 44853 -> 171163 (decompressed 381.6 percent)
21:28:08:WU03:FS01:0x16:
21:28:08:WU02:FS02:0xa4:
21:28:08:WU01:FS00:0x16:Called DecompressByteArray: compressed_data_size=44853 data_size=171163, decompressed_data_size=171163 diff=0
21:28:08:WU03:FS01:0x16:Assembly optimizations on if available.
21:28:08:WU02:FS02:0xa4:Project: 7809 (Run 3, Clone 327, Gen 33)
21:28:08:WU01:FS00:0x16:- Digital signature verified
21:28:08:WU03:FS01:0x16:Entering M.D.
21:28:08:WU02:FS02:0xa4:
21:28:08:WU01:FS00:0x16:
21:28:08:WU02:FS02:0xa4:Assembly optimizations on if available.
21:28:08:WU01:FS00:0x16:Project: 11293 (Run 0, Clone 243, Gen 2)
21:28:08:WU02:FS02:0xa4:Entering M.D.
21:28:08:WU01:FS00:0x16:
21:28:08:WU01:FS00:0x16:Assembly optimizations on if available.
21:28:08:WU01:FS00:0x16:Entering M.D.
21:28:09:WU03:FS01:0x16:Will resume from checkpoint file 03/wudata_01.ckp
21:28:09:WU03:FS01:0x16:Tpr hash 03/wudata_01.tpr:  428829383 4171448299 3993183890 4073803069 1795910398
21:28:09:WU03:FS01:0x16:Working on ALZHEIMER DISEASE AMYLOID
21:28:09:WU03:FS01:0x16:Client config unavailable.
21:28:09:WU03:FS01:0x16:Starting GUI Server
21:28:09:WU01:FS00:0x16:Will resume from checkpoint file 01/wudata_01.ckp
21:28:09:WU01:FS00:0x16:Tpr hash 01/wudata_01.tpr:  537407621 3550239141 1616066183 4199144504 2505667728
21:28:09:WU01:FS00:0x16:Working on ALZHEIMER DISEASE AMYLOID
21:28:09:WU01:FS00:0x16:Client config unavailable.
21:28:09:WU01:FS00:0x16:Starting GUI Server
21:28:10:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
21:28:13:WU02:FS02:0xa4:Using Gromacs checkpoints
21:28:13:WU02:FS02:0xa4:Mapping NT from 6 to 6 
21:28:14:WU02:FS02:0xa4:Resuming from checkpoint
21:28:14:WU02:FS02:0xa4:Verified 02/wudata_01.log
21:28:14:WU02:FS02:0xa4:Verified 02/wudata_01.trr
21:28:14:WU02:FS02:0xa4:Verified 02/wudata_01.xtc
21:28:14:WU02:FS02:0xa4:Verified 02/wudata_01.edr
21:28:14:WU02:FS02:0xa4:Completed 775880 out of 1500000 steps  (51%)
21:28:16:WU03:FS01:0x16:Resuming from checkpoint
21:28:16:WU03:FS01:0x16:fcCheckPointResume: retreived and current tpr file hash:
21:28:16:WU03:FS01:0x16:   0    428829383    428829383
21:28:16:WU03:FS01:0x16:   1   4171448299   4171448299
21:28:16:WU03:FS01:0x16:   2   3993183890   3993183890
21:28:16:WU03:FS01:0x16:   3   4073803069   4073803069
21:28:16:WU03:FS01:0x16:   4   1795910398   1795910398
21:28:16:WU03:FS01:0x16:fcCheckPointResume: file hashes same.
21:28:16:WU03:FS01:0x16:fcCheckPointResume: state restored.
21:28:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.log Verified 03/wudata_01.log
21:28:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.trr Verified 03/wudata_01.trr
21:28:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.xtc Verified 03/wudata_01.xtc
21:28:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.edr Verified 03/wudata_01.edr
21:28:16:WU03:FS01:0x16:fcCheckPointResume: state restored 2
21:28:16:WU03:FS01:0x16:Resumed from checkpoint
21:28:16:WU03:FS01:0x16:Setting checkpoint frequency: 500000
21:28:16:WU03:FS01:0x16:Completed  41500001 out of 50000000 steps (83%).
21:28:16:WU01:FS00:0x16:Resuming from checkpoint
21:28:16:WU01:FS00:0x16:fcCheckPointResume: retreived and current tpr file hash:
21:28:16:WU01:FS00:0x16:   0    537407621    537407621
21:28:16:WU01:FS00:0x16:   1   3550239141   3550239141
21:28:16:WU01:FS00:0x16:   2   1616066183   1616066183
21:28:16:WU01:FS00:0x16:   3   4199144504   4199144504
21:28:16:WU01:FS00:0x16:   4   2505667728   2505667728
21:28:16:WU01:FS00:0x16:fcCheckPointResume: file hashes same.
21:28:16:WU01:FS00:0x16:fcCheckPointResume: state restored.
21:28:16:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.log Verified 01/wudata_01.log
21:28:16:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.trr Verified 01/wudata_01.trr
21:28:16:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.xtc Verified 01/wudata_01.xtc
21:28:16:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.edr Verified 01/wudata_01.edr
21:28:17:WU01:FS00:0x16:fcCheckPointResume: state restored 2
21:28:17:WU01:FS00:0x16:Resumed from checkpoint
21:28:17:WU01:FS00:0x16:Setting checkpoint frequency: 500000
21:28:17:WU01:FS00:0x16:Completed  37500001 out of 50000000 steps (75%).
21:32:28:WU03:FS01:0x16:Completed  42000000 out of 50000000 steps (84%).
21:32:29:WU01:FS00:0x16:Completed  38000000 out of 50000000 steps (76%).
21:33:27:WU02:FS02:0xa4:Completed 780000 out of 1500000 steps  (52%)
21:37:06:WU03:FS01:0x16:Completed  42500000 out of 50000000 steps (85%).
21:37:06:WU01:FS00:0x16:Completed  38500000 out of 50000000 steps (77%).
21:41:43:WU03:FS01:0x16:Completed  43000000 out of 50000000 steps (86%).
21:41:43:WU01:FS00:0x16:Completed  39000000 out of 50000000 steps (78%).
21:46:21:WU03:FS01:0x16:Completed  43500000 out of 50000000 steps (87%).
21:46:22:WU01:FS00:0x16:Completed  39500000 out of 50000000 steps (79%).
21:51:42:WU02:FS02:0xa4:Completed 795000 out of 1500000 steps  (53%)
22:08:26:WU02:FS02:0xa4:Completed 810000 out of 1500000 steps  (54%)
22:24:52:WU03:FS01:FahCore returned: UNKNOWN_ENUM (-1073741819 = 0xc0000005)
22:24:52:WARNING:WU03:FS01:FahCore returned an unknown error code which probably indicates that it crashed
22:24:52:WU03:FS01:Starting
22:24:52:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/D.Tillman/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 03 -suffix 01 -version 701 -checkpoint 20 -gpu 1
22:24:52:WU03:FS01:Started FahCore on PID 5624
22:24:52:WU03:FS01:Core PID:5680
22:24:52:WU03:FS01:FahCore 0x16 started
22:24:52:WU03:FS01:0x16:
22:24:52:WU03:FS01:0x16:*------------------------------*
22:24:52:WU03:FS01:0x16:Folding@Home GPU Core
22:24:52:WU03:FS01:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
22:24:52:WU03:FS01:0x16:
22:24:52:WU03:FS01:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
22:24:52:WU03:FS01:0x16:Build host: user-f6d030f24f
22:24:52:WU03:FS01:0x16:Board Type: AMD/OpenCL
22:24:52:WU03:FS01:0x16:Core      : x=16
22:24:52:WU03:FS01:0x16: Window's signal control handler registered.
22:24:52:WU03:FS01:0x16:Preparing to commence simulation
22:24:52:WU03:FS01:0x16:- Ensuring status. Please wait.
22:24:53:WU01:FS00:FahCore returned: UNKNOWN_ENUM (-1073741819 = 0xc0000005)
22:24:53:WARNING:WU01:FS00:FahCore returned an unknown error code which probably indicates that it crashed
22:24:53:WU01:FS00:Starting
22:24:53:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/D.Tillman/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 01 -suffix 01 -version 701 -checkpoint 20 -gpu 0
22:24:53:WU01:FS00:Started FahCore on PID 4040
22:24:53:WU01:FS00:Core PID:5512
22:24:53:WU01:FS00:FahCore 0x16 started
22:24:53:WU01:FS00:0x16:
22:24:53:WU01:FS00:0x16:*------------------------------*
22:24:53:WU01:FS00:0x16:Folding@Home GPU Core
22:24:53:WU01:FS00:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
22:24:53:WU01:FS00:0x16:
22:24:53:WU01:FS00:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
22:24:53:WU01:FS00:0x16:Build host: user-f6d030f24f
22:24:53:WU01:FS00:0x16:Board Type: AMD/OpenCL
22:24:53:WU01:FS00:0x16:Core      : x=16
22:24:53:WU01:FS00:0x16: Window's signal control handler registered.
22:24:53:WU01:FS00:0x16:Preparing to commence simulation
22:24:53:WU01:FS00:0x16:- Ensuring status. Please wait.
22:25:01:WU03:FS01:0x16:- Looking at optimizations...
22:25:01:WU03:FS01:0x16:- Working with standard loops on this execution.
22:25:01:WU03:FS01:0x16:- Previous termination of core was improper.
22:25:01:WU03:FS01:0x16:- Files status OK
22:25:01:WU03:FS01:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:25:01:WU03:FS01:0x16:- Expanded 42495 -> 171163 (decompressed 402.7 percent)
22:25:01:WU03:FS01:0x16:Called DecompressByteArray: compressed_data_size=42495 data_size=171163, decompressed_data_size=171163 diff=0
22:25:01:WU03:FS01:0x16:- Digital signature verified
22:25:01:WU03:FS01:0x16:
22:25:01:WU03:FS01:0x16:Project: 11293 (Run 1, Clone 407, Gen 0)
22:25:02:WU03:FS01:0x16:
22:25:02:WU03:FS01:0x16:Entering M.D.
22:25:02:WU01:FS00:0x16:- Looking at optimizations...
22:25:02:WU01:FS00:0x16:- Working with standard loops on this execution.
22:25:02:WU01:FS00:0x16:- Previous termination of core was improper.
22:25:02:WU01:FS00:0x16:- Files status OK
22:25:02:WU01:FS00:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:25:02:WU01:FS00:0x16:- Expanded 44853 -> 171163 (decompressed 381.6 percent)
22:25:02:WU01:FS00:0x16:Called DecompressByteArray: compressed_data_size=44853 data_size=171163, decompressed_data_size=171163 diff=0
22:25:02:WU01:FS00:0x16:- Digital signature verified
22:25:02:WU01:FS00:0x16:
22:25:02:WU01:FS00:0x16:Project: 11293 (Run 0, Clone 243, Gen 2)
22:25:02:WU01:FS00:0x16:
22:25:02:WU01:FS00:0x16:Entering M.D.
22:25:04:WU03:FS01:0x16:Will resume from checkpoint file 03/wudata_01.ckp
22:25:04:WU03:FS01:0x16:Tpr hash 03/wudata_01.tpr:  428829383 4171448299 3993183890 4073803069 1795910398
22:25:04:WU03:FS01:0x16:Working on ALZHEIMER DISEASE AMYLOID
22:25:04:WU03:FS01:0x16:Client config unavailable.
22:25:04:WU03:FS01:0x16:Starting GUI Server
22:25:04:WU01:FS00:0x16:Will resume from checkpoint file 01/wudata_01.ckp
22:25:04:WU01:FS00:0x16:Tpr hash 01/wudata_01.tpr:  537407621 3550239141 1616066183 4199144504 2505667728
22:25:04:WU01:FS00:0x16:Working on ALZHEIMER DISEASE AMYLOID
22:25:04:WU01:FS00:0x16:Client config unavailable.
22:25:05:WU01:FS00:0x16:Starting GUI Server
22:25:10:WU02:FS02:0xa4:Completed 825000 out of 1500000 steps  (55%)
22:25:16:WU03:FS01:0x16:Resuming from checkpoint
22:25:16:WU03:FS01:0x16:fcCheckPointResume: retreived and current tpr file hash:
22:25:16:WU03:FS01:0x16:   0    428829383    428829383
22:25:16:WU03:FS01:0x16:   1   4171448299   4171448299
22:25:16:WU03:FS01:0x16:   2   3993183890   3993183890
22:25:16:WU03:FS01:0x16:   3   4073803069   4073803069
22:25:16:WU03:FS01:0x16:   4   1795910398   1795910398
22:25:16:WU03:FS01:0x16:fcCheckPointResume: file hashes same.
22:25:16:WU03:FS01:0x16:fcCheckPointResume: state restored.
22:25:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.log Verified 03/wudata_01.log
22:25:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.trr Verified 03/wudata_01.trr
22:25:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.xtc Verified 03/wudata_01.xtc
22:25:16:WU03:FS01:0x16:fcCheckPointResume: name 03/wudata_01.edr Verified 03/wudata_01.edr
22:25:16:WU03:FS01:0x16:fcCheckPointResume: state restored 2
22:25:16:WU03:FS01:0x16:Resumed from checkpoint
22:25:16:WU03:FS01:0x16:Setting checkpoint frequency: 500000
22:25:16:WU03:FS01:0x16:Completed  43000001 out of 50000000 steps (86%).
22:25:19:WU01:FS00:0x16:Resuming from checkpoint
22:25:19:WU01:FS00:0x16:fcCheckPointResume: retreived and current tpr file hash:
22:25:19:WU01:FS00:0x16:   0    537407621    537407621
22:25:19:WU01:FS00:0x16:   1   3550239141   3550239141
22:25:19:WU01:FS00:0x16:   2   1616066183   1616066183
22:25:19:WU01:FS00:0x16:   3   4199144504   4199144504
22:25:19:WU01:FS00:0x16:   4   2505667728   2505667728
22:25:19:WU01:FS00:0x16:fcCheckPointResume: file hashes same.
22:25:19:WU01:FS00:0x16:fcCheckPointResume: state restored.
22:25:19:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.log Verified 01/wudata_01.log
22:25:19:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.trr Verified 01/wudata_01.trr
22:25:19:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.xtc Verified 01/wudata_01.xtc
22:25:19:WU01:FS00:0x16:fcCheckPointResume: name 01/wudata_01.edr Verified 01/wudata_01.edr
22:25:19:WU01:FS00:0x16:fcCheckPointResume: state restored 2
22:25:19:WU01:FS00:0x16:Resumed from checkpoint
22:25:19:WU01:FS00:0x16:Setting checkpoint frequency: 500000
22:25:19:WU01:FS00:0x16:Completed  39000001 out of 50000000 steps (78%).
22:29:27:WU01:FS00:0x16:Completed  39500000 out of 50000000 steps (79%).
22:29:33:WU03:FS01:0x16:Completed  43500000 out of 50000000 steps (87%).
22:33:57:WU01:FS00:0x16:Completed  40000000 out of 50000000 steps (80%).
22:34:07:WU03:FS01:0x16:Completed  44000000 out of 50000000 steps (88%).
It's on my M17x R2. I'm using the latest V7 beta. :(
Silver Alienware M17X R2 | i7 940XM | X-Fire AMD 6990M | BIOS A10 MOD | 320GB Intel X-25M RAID 0 | 1TB Internal HDD Drive | 16GB 1333MHz | RGB LED | Killer 1103 Wifi
Black Alienware M11X R1 | SU7300 | 1GB 335M | 256GB SSD | 8GB DDR3 | Win 7
bollix47
Posts: 2958
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: F@h Keeps crashing, Not sure where to start

Post by bollix47 »

You might be running into a heat problem.

Try setting one gpu to Finish and the other to Pause. When the one finishes then set the other to fold. If that stops the problem then delete the one that you set to finish and just use one GPU client and the smp:6. I have a laptop and only run the gpu or the smp but not both and I've never had a problem but since you're trying to run 2 GPUs and the smp:6 you are probably getting some very high temperatures even for a laptop. Another option might be a cooling pad but it would have to be a good one for the kind of work you're asking the laptop to do.

gl
DR650SE
Posts: 18
Joined: Thu Dec 08, 2011 4:30 am

Re: F@h Keeps crashing, Not sure where to start

Post by DR650SE »

Hmm. Can't be heat. I monitor it closely. My gpu's top out at 68* and cpu at 78*. :(
Silver Alienware M17X R2 | i7 940XM | X-Fire AMD 6990M | BIOS A10 MOD | 320GB Intel X-25M RAID 0 | 1TB Internal HDD Drive | 16GB 1333MHz | RGB LED | Killer 1103 Wifi
Black Alienware M11X R1 | SU7300 | 1GB 335M | 256GB SSD | 8GB DDR3 | Win 7
Jesse_V
Site Moderator
Posts: 2850
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: F@h Keeps crashing, Not sure where to start

Post by Jesse_V »

Memory issues on RAM or GPU? Overclocking? Those were the two things I noticed after I did a Google Search.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: F@h Keeps crashing, Not sure where to start

Post by 7im »

You might want to change that next unit percentage setting on the SMP client to 99 or 100. This won't fix the problem, but at least the SMP WU isn't sitting around losing bonus points while the current SMP WU folds the last 10 frames. ;)

If you overclock, might need to back that off a few MHz. Or give the memory a little extra voltage to make it fah stable at the current settings.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
DR650SE
Posts: 18
Joined: Thu Dec 08, 2011 4:30 am

Re: F@h Keeps crashing, Not sure where to start

Post by DR650SE »

Ok guys, thanks for the help. Changing slots to 99%. Also I think perhaps it needs some memory voltage. The system was designed for a max of 8GB (2x4GB) since at the time the system was made single 8GB stick were not available. So it may be a RAM issue. Currently it's operating at 1.5v so I will up that to 1.6v. If it still crashes I will try 1.7v. As far as overclocking, nothing is overclocked. All is running stock. But I will try again.
Silver Alienware M17X R2 | i7 940XM | X-Fire AMD 6990M | BIOS A10 MOD | 320GB Intel X-25M RAID 0 | 1TB Internal HDD Drive | 16GB 1333MHz | RGB LED | Killer 1103 Wifi
Black Alienware M11X R1 | SU7300 | 1GB 335M | 256GB SSD | 8GB DDR3 | Win 7
DR650SE
Posts: 18
Joined: Thu Dec 08, 2011 4:30 am

Re: F@h Keeps crashing, Not sure where to start

Post by DR650SE »

Well I tried upping the memory voltage and that didn't seem to help. I still crashed inside of 5 min at 1.6v on the memory.

I had flashed the GPUs with an M8900 vbios a few weeks back, but didn't seem to have many issues back then. I flashed the GPUs back to a 6990M vbios and that seems to have fixed the issue. Folding at stock clocks and undervolted from 1.1v down to 1.0v and no crashes overnight. So thing s seem to be working well. GPU temps max at 68°C-70°C. Fingers crossed this cleared the issue as I'll be out of town and would lilke to get some consistent folding done.

Thanks guys for helping me out!
Silver Alienware M17X R2 | i7 940XM | X-Fire AMD 6990M | BIOS A10 MOD | 320GB Intel X-25M RAID 0 | 1TB Internal HDD Drive | 16GB 1333MHz | RGB LED | Killer 1103 Wifi
Black Alienware M11X R1 | SU7300 | 1GB 335M | 256GB SSD | 8GB DDR3 | Win 7
Post Reply