Page 7 of 7

Re: Core 17 has suddenly started crashing

Posted: Thu Jun 19, 2014 11:34 pm
by Eagle
If it's not too much of a hassle, then yes, please. And thanks for your additional assistance. :)

I simply clicked "Reboot now" on Windows Update's dialog box, so 2) for that.
Launching via 1), so start-up happens automatically upon log-in.

Re: Core 17 has suddenly started crashing

Posted: Fri Jun 20, 2014 4:30 am
by PantherX
Eagle wrote:If it's not too much of a hassle, then yes, please...
Sure, not an issue at all since if there is a bug, a fix should eventually be released.

Since I manually start FAHClient, in order to mimic your automatically start after log-in, I placed the shortcut in the start-up folder (C:\Users\PantherX\AppData\Roaming\Microsoft\Windows\Start Menu\Programs\Startup), tweaked my configuration to match your behavior and performed two tests.

1) Rebooting the system without exiting FAHClient.
Do note that for the reboot, I used a shortcut (shutdown.exe -r -t 0) and this caused the WU to fail since FahCore crashed:

Code: Select all

*********************** Log Started 2014-06-18T21:06:56Z ***********************
21:06:56:************************* Folding@home Client *************************
21:06:56:      Website: http://folding.stanford.edu/
21:06:56:    Copyright: (c) 2009-2014 Stanford University
21:06:56:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:06:56:         Args: 
21:06:56:       Config: D:/FAH/V7/config.xml
21:06:56:******************************** Build ********************************
21:06:56:      Version: 7.4.4
21:06:56:         Date: Mar 4 2014
21:06:56:         Time: 20:26:54
21:06:56:      SVN Rev: 4130
21:06:56:       Branch: fah/trunk/client
21:06:56:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:06:56:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
21:06:56:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
21:06:56:     Platform: win32 XP
21:06:56:         Bits: 32
21:06:56:         Mode: Release
21:06:56:******************************* System ********************************
21:06:56:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
21:06:56:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
21:06:56:         CPUs: 8
21:06:56:       Memory: 15.89GiB
21:06:56:  Free Memory: 11.29GiB
21:06:56:      Threads: WINDOWS_THREADS
21:06:56:   OS Version: 6.2
21:06:56:  Has Battery: true
21:06:56:   On Battery: false
21:06:56:   UTC Offset: 3
21:06:56:          PID: 10568
21:06:56:          CWD: D:/FAH/V7
21:06:56:           OS: Windows 8 Pro
21:06:56:      OS Arch: AMD64
21:06:56:         GPUs: 1
21:06:56:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
21:06:56:         CUDA: 2.1
21:06:56:  CUDA Driver: 6000
21:06:56:Win32 Service: false
21:06:56:***********************************************************************
21:06:56:<config>
21:06:56:  <!-- Network -->
21:06:56:  <proxy v=':8080'/>
21:06:56:
21:06:56:  <!-- Remote Command Server -->
21:06:56:  <password v='*********'/>
21:06:56:
21:06:56:  <!-- Slot Control -->
21:06:56:  <power v='full'/>
21:06:56:
21:06:56:  <!-- User Information -->
21:06:56:  <passkey v='********************************'/>
21:06:56:  <team v='69411'/>
21:06:56:  <user v='PantherX'/>
21:06:56:
21:06:56:  <!-- Folding Slots -->
21:06:56:  <slot id='0' type='CPU'>
21:06:56:    <cpus v='7'/>
21:06:56:    <max-packet-size v='small'/>
21:06:56:    <max-slot-errors v='1'/>
21:06:56:    <max-unit-errors v='1'/>
21:06:56:    <next-unit-percentage v='100'/>
21:06:56:    <pause-on-start v='true'/>
21:06:56:  </slot>
21:06:56:  <slot id='1' type='GPU'>
21:06:56:    <max-slot-errors v='1'/>
21:06:56:    <max-unit-errors v='1'/>
21:06:56:    <next-unit-percentage v='100'/>
21:06:56:    <pause-on-start v='true'/>
21:06:56:  </slot>
21:06:56:</config>
21:06:56:Trying to access database...
21:06:56:Successfully acquired database lock
21:06:56:Enabled folding slot 00: PAUSED cpu:7 (by user)
21:06:56:Enabled folding slot 01: PAUSED gpu:0:GF114 [GeForce GTX 675M] (by user)
21:07:06:FS01:Unpaused
21:07:07:WU00:FS01:Connecting to 171.67.108.201:80
21:07:11:WU00:FS01:Assigned to work server 140.163.4.231
21:07:11:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GF114 [GeForce GTX 675M] from 140.163.4.231
21:07:11:WU00:FS01:Connecting to 140.163.4.231:8080
21:07:12:WU00:FS01:Downloading 4.84MiB
21:07:17:WU00:FS01:Download complete
21:07:17:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13001 run:316 clone:0 gen:16 core:0x17 unit:0x00000028538b3db75328a93e6b24aff3
21:07:18:WU00:FS01:Downloading core from http://web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah
21:07:18:WU00:FS01:Connecting to web.stanford.edu:80
21:07:32:WU00:FS01:FahCore 17: Downloading 2.55MiB
21:07:38:WU00:FS01:FahCore 17: 19.60%
21:07:44:WU00:FS01:FahCore 17: 46.56%
21:07:50:WU00:FS01:FahCore 17: 71.07%
21:07:56:WU00:FS01:FahCore 17: 98.02%
21:07:56:WU00:FS01:FahCore 17: Download complete
21:07:57:WU00:FS01:Valid core signature
21:07:57:WU00:FS01:Unpacked 8.60MiB to cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe
21:07:57:WU00:FS01:Starting
21:07:57:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 10568 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
21:07:57:WU00:FS01:Started FahCore on PID 4744
21:07:57:WU00:FS01:Core PID:10104
21:07:57:WU00:FS01:FahCore 0x17 started
21:07:58:WU00:FS01:0x17:*********************** Log Started 2014-06-18T21:07:58Z ***********************
21:07:58:WU00:FS01:0x17:Project: 13001 (Run 316, Clone 0, Gen 16)
21:07:58:WU00:FS01:0x17:Unit: 0x00000028538b3db75328a93e6b24aff3
21:07:58:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:07:58:WU00:FS01:0x17:Machine: 1
21:07:58:WU00:FS01:0x17:Reading tar file state.xml
21:08:00:WU00:FS01:0x17:Reading tar file system.xml
21:08:01:WU00:FS01:0x17:Reading tar file integrator.xml
21:08:01:WU00:FS01:0x17:Reading tar file core.xml
21:08:01:WU00:FS01:0x17:Digital signatures verified
21:08:01:WU00:FS01:0x17:Folding@home GPU core17
21:08:01:WU00:FS01:0x17:Version 0.0.52
21:11:13:WU00:FS01:0x17:Completed 0 out of 5000000 steps (0%)
21:11:13:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
21:13:41:FS01:Finishing
21:45:34:WU00:FS01:0x17:Completed 50000 out of 5000000 steps (1%)
SNIP
01:59:50:WU00:FS01:0x17:Completed 2500000 out of 5000000 steps (50%)
02:27:40:Saving configuration to config.xml
02:27:40:<config>
02:27:40:  <!-- Network -->
02:27:40:  <proxy v=':8080'/>
02:27:40:
02:27:40:  <!-- Remote Command Server -->
02:27:40:  <password v='*********'/>
02:27:40:
02:27:40:  <!-- Slot Control -->
02:27:40:  <power v='full'/>
02:27:40:
02:27:40:  <!-- User Information -->
02:27:40:  <passkey v='********************************'/>
02:27:40:  <team v='69411'/>
02:27:40:  <user v='PantherX'/>
02:27:40:
02:27:40:  <!-- Folding Slots -->
02:27:40:  <slot id='0' type='CPU'>
02:27:40:    <cpus v='7'/>
02:27:40:    <max-packet-size v='small'/>
02:27:40:    <max-slot-errors v='1'/>
02:27:40:    <max-unit-errors v='1'/>
02:27:40:    <next-unit-percentage v='100'/>
02:27:40:    <pause-on-start v='true'/>
02:27:40:  </slot>
02:27:40:  <slot id='1' type='GPU'>
02:27:40:    <max-slot-errors v='1'/>
02:27:40:    <max-unit-errors v='1'/>
02:27:40:    <next-unit-percentage v='100'/>
02:27:40:  </slot>
02:27:40:</config>
02:27:48:Removing old file 'configs/config-20140323-150815.xml'
02:27:48:Saving configuration to config.xml
02:27:48:<config>
02:27:48:  <!-- Network -->
02:27:48:  <proxy v=':8080'/>
02:27:48:
02:27:48:  <!-- Remote Command Server -->
02:27:48:  <password v='*********'/>
02:27:48:
02:27:48:  <!-- Slot Control -->
02:27:48:  <power v='full'/>
02:27:48:
02:27:48:  <!-- User Information -->
02:27:48:  <passkey v='********************************'/>
02:27:48:  <team v='69411'/>
02:27:48:  <user v='PantherX'/>
02:27:48:
02:27:48:  <!-- Folding Slots -->
02:27:48:  <slot id='0' type='CPU'>
02:27:48:    <cpus v='7'/>
02:27:48:    <max-packet-size v='small'/>
02:27:48:    <max-slot-errors v='1'/>
02:27:48:    <max-unit-errors v='1'/>
02:27:48:    <next-unit-percentage v='100'/>
02:27:48:    <pause-on-start v='true'/>
02:27:48:  </slot>
02:27:48:  <slot id='1' type='GPU'>
02:27:48:    <max-slot-errors v='1'/>
02:27:48:    <max-unit-errors v='1'/>
02:27:48:    <next-unit-percentage v='100'/>
02:27:48:  </slot>
02:27:48:</config>
02:30:26:WARNING:WU00:FS01:FahCore crashed with Windows unhandled exception code 0x40010004, searching for this code online may provide more information
02:30:26:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (1073807364 = 0x40010004)
02:30:26:WARNING:WU00:FS01:Too many errors, failing
02:30:26:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:13001 run:316 clone:0 gen:16 core:0x17 unit:0x00000028538b3db75328a93e6b24aff3
02:30:26:WU00:FS01:Connecting to 140.163.4.231:8080
02:30:27:WU00:FS01:Server responded WORK_ACK (400)
02:30:28:WU00:FS01:Cleaning up
02:30:28:ERROR:WU00:FS01:Exception: Failed to remove directory './work/00': boost::filesystem::remove: The process cannot access the file because it is being used by another process: ".\work\00\01\log.txt"
02:30:28:WU00:FS01:Cleaning up
02:30:28:ERROR:WU00:FS01:Exception: Failed to remove directory './work/00': boost::filesystem::remove: The process cannot access the file because it is being used by another process: ".\work\00\01\log.txt"
2) Windows Update reboot without exiting FAHClient
So, after I rebooted the system, I checked for Windows updates and installed them. Some of them required a reboot which I did from the Windows Update window and in this case too, the FahCore crashed causing the WU to be dumped:

Code: Select all

*********************** Log Started 2014-06-20T02:31:20Z ***********************
02:31:20:************************* Folding@home Client *************************
02:31:20:      Website: http://folding.stanford.edu/
02:31:20:    Copyright: (c) 2009-2014 Stanford University
02:31:20:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:31:20:         Args: 
02:31:20:       Config: D:/FAH/V7/config.xml
02:31:20:******************************** Build ********************************
02:31:20:      Version: 7.4.4
02:31:20:         Date: Mar 4 2014
02:31:20:         Time: 20:26:54
02:31:20:      SVN Rev: 4130
02:31:20:       Branch: fah/trunk/client
02:31:20:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
02:31:20:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
02:31:20:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
02:31:20:     Platform: win32 XP
02:31:20:         Bits: 32
02:31:20:         Mode: Release
02:31:20:******************************* System ********************************
02:31:20:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
02:31:20:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
02:31:20:         CPUs: 8
02:31:20:       Memory: 15.89GiB
02:31:20:  Free Memory: 13.80GiB
02:31:20:      Threads: WINDOWS_THREADS
02:31:20:   OS Version: 6.2
02:31:20:  Has Battery: true
02:31:20:   On Battery: false
02:31:20:   UTC Offset: 3
02:31:20:          PID: 6952
02:31:20:          CWD: D:/FAH/V7
02:31:20:           OS: Windows 8 Pro
02:31:20:      OS Arch: AMD64
02:31:20:         GPUs: 1
02:31:20:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
02:31:20:         CUDA: 2.1
02:31:20:  CUDA Driver: 6000
02:31:20:Win32 Service: false
02:31:20:***********************************************************************
02:31:20:<config>
02:31:20:  <!-- Network -->
02:31:20:  <proxy v=':8080'/>
02:31:20:
02:31:20:  <!-- Remote Command Server -->
02:31:20:  <password v='*********'/>
02:31:20:
02:31:20:  <!-- Slot Control -->
02:31:20:  <power v='full'/>
02:31:20:
02:31:20:  <!-- User Information -->
02:31:20:  <passkey v='********************************'/>
02:31:20:  <team v='69411'/>
02:31:20:  <user v='PantherX'/>
02:31:20:
02:31:20:  <!-- Folding Slots -->
02:31:20:  <slot id='0' type='CPU'>
02:31:20:    <cpus v='7'/>
02:31:20:    <max-packet-size v='small'/>
02:31:20:    <max-slot-errors v='1'/>
02:31:20:    <max-unit-errors v='1'/>
02:31:20:    <next-unit-percentage v='100'/>
02:31:20:    <pause-on-start v='true'/>
02:31:20:  </slot>
02:31:20:  <slot id='1' type='GPU'>
02:31:20:    <max-slot-errors v='1'/>
02:31:20:    <max-unit-errors v='1'/>
02:31:20:    <next-unit-percentage v='100'/>
02:31:20:  </slot>
02:31:20:</config>
02:31:20:Trying to access database...
02:31:20:Successfully acquired database lock
02:31:20:Enabled folding slot 00: PAUSED cpu:7 (by user)
02:31:20:Enabled folding slot 01: READY gpu:0:GF114 [GeForce GTX 675M]
02:31:20:WU00:FS01:Cleaning up
02:31:21:WU00:FS01:Connecting to 171.67.108.201:80
02:31:23:WU00:FS01:Assigned to work server 171.64.65.56
02:31:23:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GF114 [GeForce GTX 675M] from 171.64.65.56
02:31:23:WU00:FS01:Connecting to 171.64.65.56:8080
02:31:24:WU00:FS01:Downloading 5.24MiB
02:31:30:WU00:FS01:Download 23.86%
02:31:36:WU00:FS01:Download 42.95%
02:31:42:WU00:FS01:Download 60.85%
02:31:48:WU00:FS01:Download 96.65%
02:31:48:WU00:FS01:Download complete
02:31:48:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
02:31:48:WU00:FS01:Starting
02:31:48:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 6952 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
02:31:48:WU00:FS01:Started FahCore on PID 5716
02:31:49:WU00:FS01:Core PID:5188
02:31:49:WU00:FS01:FahCore 0x17 started
02:31:50:WU00:FS01:0x17:*********************** Log Started 2014-06-20T02:31:49Z ***********************
02:31:50:WU00:FS01:0x17:Project: 9406 (Run 80, Clone 0, Gen 72)
02:31:50:WU00:FS01:0x17:Unit: 0x000000710a3b1e5c533e7381f3ef7605
02:31:50:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
02:31:50:WU00:FS01:0x17:Machine: 1
02:31:50:WU00:FS01:0x17:Reading tar file state.xml
02:31:51:WU00:FS01:0x17:Reading tar file system.xml
02:31:51:WU00:FS01:0x17:Reading tar file integrator.xml
02:31:51:WU00:FS01:0x17:Reading tar file core.xml
02:31:51:WU00:FS01:0x17:Digital signatures verified
02:31:51:WU00:FS01:0x17:Folding@home GPU core17
02:31:51:WU00:FS01:0x17:Version 0.0.52
02:31:52:FS01:Paused
02:31:52:FS01:Shutting core down
02:31:52:WU00:FS01:0x17:WARNING:Console control signal 1 on PID 5188
02:31:52:WU00:FS01:0x17:Exiting, please wait. . .
02:32:21:Removing old file 'configs/config-20140323-150916.xml'
02:32:21:Saving configuration to config.xml
02:32:21:<config>
02:32:21:  <!-- Network -->
02:32:21:  <proxy v=':8080'/>
02:32:21:
02:32:21:  <!-- Remote Command Server -->
02:32:21:  <password v='*********'/>
02:32:21:
02:32:21:  <!-- Slot Control -->
02:32:21:  <power v='full'/>
02:32:21:
02:32:21:  <!-- User Information -->
02:32:21:  <passkey v='********************************'/>
02:32:21:  <team v='69411'/>
02:32:21:  <user v='PantherX'/>
02:32:21:
02:32:21:  <!-- Folding Slots -->
02:32:21:  <slot id='0' type='CPU'>
02:32:21:    <cpus v='7'/>
02:32:21:    <max-packet-size v='small'/>
02:32:21:    <max-slot-errors v='1'/>
02:32:21:    <max-unit-errors v='1'/>
02:32:21:    <next-unit-percentage v='100'/>
02:32:21:    <pause-on-start v='true'/>
02:32:21:  </slot>
02:32:21:  <slot id='1' type='GPU'>
02:32:21:    <max-slot-errors v='1'/>
02:32:21:    <max-unit-errors v='1'/>
02:32:21:    <next-unit-percentage v='100'/>
02:32:21:    <paused v='true'/>
02:32:21:  </slot>
02:32:21:</config>
02:32:53:WARNING:FS01:Killing WU00
02:32:53:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
02:33:05:FS01:Unpaused
02:33:05:WU00:FS01:Starting
02:33:05:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 6952 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
02:33:05:WU00:FS01:Started FahCore on PID 6824
02:33:05:WU00:FS01:Core PID:6796
02:33:05:WU00:FS01:FahCore 0x17 started
02:33:06:WU00:FS01:0x17:*********************** Log Started 2014-06-20T02:33:05Z ***********************
02:33:06:WU00:FS01:0x17:Project: 9406 (Run 80, Clone 0, Gen 72)
02:33:06:WU00:FS01:0x17:Unit: 0x000000710a3b1e5c533e7381f3ef7605
02:33:06:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
02:33:06:WU00:FS01:0x17:Machine: 1
02:33:06:WU00:FS01:0x17:Reading tar file state.xml
02:33:07:WU00:FS01:0x17:Reading tar file system.xml
02:33:07:WU00:FS01:0x17:Reading tar file integrator.xml
02:33:07:WU00:FS01:0x17:Reading tar file core.xml
02:33:07:WU00:FS01:0x17:Digital signatures verified
02:33:07:WU00:FS01:0x17:Folding@home GPU core17
02:33:07:WU00:FS01:0x17:Version 0.0.52
02:33:22:Removing old file 'configs/config-20140323-151023.xml'
02:33:22:Saving configuration to config.xml
02:33:22:<config>
02:33:22:  <!-- Network -->
02:33:22:  <proxy v=':8080'/>
02:33:22:
02:33:22:  <!-- Remote Command Server -->
02:33:22:  <password v='*********'/>
02:33:22:
02:33:22:  <!-- Slot Control -->
02:33:22:  <power v='full'/>
02:33:22:
02:33:22:  <!-- User Information -->
02:33:22:  <passkey v='********************************'/>
02:33:22:  <team v='69411'/>
02:33:22:  <user v='PantherX'/>
02:33:22:
02:33:22:  <!-- Folding Slots -->
02:33:22:  <slot id='0' type='CPU'>
02:33:22:    <cpus v='7'/>
02:33:22:    <max-packet-size v='small'/>
02:33:22:    <max-slot-errors v='1'/>
02:33:22:    <max-unit-errors v='1'/>
02:33:22:    <next-unit-percentage v='100'/>
02:33:22:    <pause-on-start v='true'/>
02:33:22:  </slot>
02:33:22:  <slot id='1' type='GPU'>
02:33:22:    <max-slot-errors v='1'/>
02:33:22:    <max-unit-errors v='1'/>
02:33:22:    <next-unit-percentage v='100'/>
02:33:22:  </slot>
02:33:22:</config>
02:36:03:WU00:FS01:0x17:Completed 0 out of 2000000 steps (0%)
02:36:03:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
02:52:20:WU00:FS01:0x17:Completed 20000 out of 2000000 steps (1%)
03:08:21:WU00:FS01:0x17:Completed 40000 out of 2000000 steps (2%)
03:24:37:WU00:FS01:0x17:Completed 60000 out of 2000000 steps (3%)
03:32:02:WARNING:WU00:FS01:FahCore crashed with Windows unhandled exception code 0x40010004, searching for this code online may provide more information
03:32:02:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (1073807364 = 0x40010004)
03:32:02:WARNING:WU00:FS01:Too many errors, failing
03:32:02:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
03:32:02:WU00:FS01:Connecting to 171.64.65.56:8080
After restart, here is the continuation of the log:

Code: Select all

*********************** Log Started 2014-06-20T03:33:30Z ***********************
03:33:30:************************* Folding@home Client *************************
03:33:30:      Website: http://folding.stanford.edu/
03:33:30:    Copyright: (c) 2009-2014 Stanford University
03:33:30:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:33:30:         Args: 
03:33:30:       Config: D:/FAH/V7/config.xml
03:33:30:******************************** Build ********************************
03:33:30:      Version: 7.4.4
03:33:30:         Date: Mar 4 2014
03:33:30:         Time: 20:26:54
03:33:30:      SVN Rev: 4130
03:33:30:       Branch: fah/trunk/client
03:33:30:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
03:33:30:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
03:33:30:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
03:33:30:     Platform: win32 XP
03:33:30:         Bits: 32
03:33:30:         Mode: Release
03:33:30:******************************* System ********************************
03:33:30:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
03:33:30:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
03:33:30:         CPUs: 8
03:33:30:       Memory: 15.89GiB
03:33:30:  Free Memory: 13.80GiB
03:33:30:      Threads: WINDOWS_THREADS
03:33:30:   OS Version: 6.2
03:33:30:  Has Battery: true
03:33:30:   On Battery: false
03:33:30:   UTC Offset: 3
03:33:30:          PID: 7104
03:33:30:          CWD: D:/FAH/V7
03:33:30:           OS: Windows 8 Pro
03:33:30:      OS Arch: AMD64
03:33:30:         GPUs: 1
03:33:30:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
03:33:30:         CUDA: 2.1
03:33:30:  CUDA Driver: 6000
03:33:30:Win32 Service: false
03:33:30:***********************************************************************
03:33:30:<config>
03:33:30:  <!-- Network -->
03:33:30:  <proxy v=':8080'/>
03:33:30:
03:33:30:  <!-- Remote Command Server -->
03:33:30:  <password v='*********'/>
03:33:30:
03:33:30:  <!-- Slot Control -->
03:33:30:  <power v='full'/>
03:33:30:
03:33:30:  <!-- User Information -->
03:33:30:  <passkey v='********************************'/>
03:33:30:  <team v='69411'/>
03:33:30:  <user v='PantherX'/>
03:33:30:
03:33:30:  <!-- Folding Slots -->
03:33:30:  <slot id='0' type='CPU'>
03:33:30:    <cpus v='7'/>
03:33:30:    <max-packet-size v='small'/>
03:33:30:    <max-slot-errors v='1'/>
03:33:30:    <max-unit-errors v='1'/>
03:33:30:    <next-unit-percentage v='100'/>
03:33:30:    <pause-on-start v='true'/>
03:33:30:  </slot>
03:33:30:  <slot id='1' type='GPU'>
03:33:30:    <max-slot-errors v='1'/>
03:33:30:    <max-unit-errors v='1'/>
03:33:30:    <next-unit-percentage v='100'/>
03:33:30:  </slot>
03:33:30:</config>
03:33:30:Trying to access database...
03:33:31:Successfully acquired database lock
03:33:32:Enabled folding slot 00: PAUSED cpu:7 (by user)
03:33:32:Enabled folding slot 01: READY gpu:0:GF114 [GeForce GTX 675M]
03:33:32:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
03:33:32:WU00:FS01:Connecting to 171.64.65.56:8080
03:33:32:WU01:FS01:Connecting to 171.67.108.201:80
03:33:33:WU00:FS01:Server responded WORK_ACK (400)
03:33:33:WU00:FS01:Cleaning up
Thus, it seems that the safest method is to exit FAHClient before restarting Windows. I normally manually exit all applications before restart so for me, this is fine. However, for others who expect a set-and-forget client, may cause issues especially during a Windows Update prompted reboot if not the normal reboot (since I used a custom shortcut which may have different results when compared to the default settings).

Re: Core 17 has suddenly started crashing

Posted: Fri Jun 20, 2014 1:16 pm
by Eagle
PantherX wrote:Sure, not an issue at all since if there is a bug, a fix should eventually be released.
Thanks a lot! :)
PantherX wrote:Since I manually start FAHClient, in order to mimic your automatically start after log-in, I placed the shortcut in the start-up folder (C:\Users\PantherX\AppData\Roaming\Microsoft\Windows\Start Menu\Programs\Startup), tweaked my configuration to match your behavior and performed two tests.
My start-up folder path is localized (ends with "Startmenü\Programme\Autostart"), but I believe that shouldn't be a problem.
PantherX wrote:1) Rebooting the system without exiting FAHClient.
Do note that for the reboot, I used a shortcut (shutdown.exe -r -t 0) and this caused the WU to fail since FahCore crashed:

Code: Select all

*********************** Log Started 2014-06-18T21:06:56Z ***********************
21:06:56:************************* Folding@home Client *************************
21:06:56:      Website: http://folding.stanford.edu/
21:06:56:    Copyright: (c) 2009-2014 Stanford University
21:06:56:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:06:56:         Args: 
21:06:56:       Config: D:/FAH/V7/config.xml
21:06:56:******************************** Build ********************************
21:06:56:      Version: 7.4.4
21:06:56:         Date: Mar 4 2014
21:06:56:         Time: 20:26:54
21:06:56:      SVN Rev: 4130
21:06:56:       Branch: fah/trunk/client
21:06:56:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:06:56:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
21:06:56:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
21:06:56:     Platform: win32 XP
21:06:56:         Bits: 32
21:06:56:         Mode: Release
21:06:56:******************************* System ********************************
21:06:56:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
21:06:56:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
21:06:56:         CPUs: 8
21:06:56:       Memory: 15.89GiB
21:06:56:  Free Memory: 11.29GiB
21:06:56:      Threads: WINDOWS_THREADS
21:06:56:   OS Version: 6.2
21:06:56:  Has Battery: true
21:06:56:   On Battery: false
21:06:56:   UTC Offset: 3
21:06:56:          PID: 10568
21:06:56:          CWD: D:/FAH/V7
21:06:56:           OS: Windows 8 Pro
21:06:56:      OS Arch: AMD64
21:06:56:         GPUs: 1
21:06:56:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
21:06:56:         CUDA: 2.1
21:06:56:  CUDA Driver: 6000
21:06:56:Win32 Service: false
21:06:56:***********************************************************************
21:06:56:<config>
21:06:56:  <!-- Network -->
21:06:56:  <proxy v=':8080'/>
21:06:56:
21:06:56:  <!-- Remote Command Server -->
21:06:56:  <password v='*********'/>
21:06:56:
21:06:56:  <!-- Slot Control -->
21:06:56:  <power v='full'/>
21:06:56:
21:06:56:  <!-- User Information -->
21:06:56:  <passkey v='********************************'/>
21:06:56:  <team v='69411'/>
21:06:56:  <user v='PantherX'/>
21:06:56:
21:06:56:  <!-- Folding Slots -->
21:06:56:  <slot id='0' type='CPU'>
21:06:56:    <cpus v='7'/>
21:06:56:    <max-packet-size v='small'/>
21:06:56:    <max-slot-errors v='1'/>
21:06:56:    <max-unit-errors v='1'/>
21:06:56:    <next-unit-percentage v='100'/>
21:06:56:    <pause-on-start v='true'/>
21:06:56:  </slot>
21:06:56:  <slot id='1' type='GPU'>
21:06:56:    <max-slot-errors v='1'/>
21:06:56:    <max-unit-errors v='1'/>
21:06:56:    <next-unit-percentage v='100'/>
21:06:56:    <pause-on-start v='true'/>
21:06:56:  </slot>
21:06:56:</config>
21:06:56:Trying to access database...
21:06:56:Successfully acquired database lock
21:06:56:Enabled folding slot 00: PAUSED cpu:7 (by user)
21:06:56:Enabled folding slot 01: PAUSED gpu:0:GF114 [GeForce GTX 675M] (by user)
21:07:06:FS01:Unpaused
21:07:07:WU00:FS01:Connecting to 171.67.108.201:80
21:07:11:WU00:FS01:Assigned to work server 140.163.4.231
21:07:11:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GF114 [GeForce GTX 675M] from 140.163.4.231
21:07:11:WU00:FS01:Connecting to 140.163.4.231:8080
21:07:12:WU00:FS01:Downloading 4.84MiB
21:07:17:WU00:FS01:Download complete
21:07:17:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13001 run:316 clone:0 gen:16 core:0x17 unit:0x00000028538b3db75328a93e6b24aff3
21:07:18:WU00:FS01:Downloading core from http://web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah
21:07:18:WU00:FS01:Connecting to web.stanford.edu:80
21:07:32:WU00:FS01:FahCore 17: Downloading 2.55MiB
21:07:38:WU00:FS01:FahCore 17: 19.60%
21:07:44:WU00:FS01:FahCore 17: 46.56%
21:07:50:WU00:FS01:FahCore 17: 71.07%
21:07:56:WU00:FS01:FahCore 17: 98.02%
21:07:56:WU00:FS01:FahCore 17: Download complete
21:07:57:WU00:FS01:Valid core signature
21:07:57:WU00:FS01:Unpacked 8.60MiB to cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe
21:07:57:WU00:FS01:Starting
21:07:57:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 10568 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
21:07:57:WU00:FS01:Started FahCore on PID 4744
21:07:57:WU00:FS01:Core PID:10104
21:07:57:WU00:FS01:FahCore 0x17 started
21:07:58:WU00:FS01:0x17:*********************** Log Started 2014-06-18T21:07:58Z ***********************
21:07:58:WU00:FS01:0x17:Project: 13001 (Run 316, Clone 0, Gen 16)
21:07:58:WU00:FS01:0x17:Unit: 0x00000028538b3db75328a93e6b24aff3
21:07:58:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:07:58:WU00:FS01:0x17:Machine: 1
21:07:58:WU00:FS01:0x17:Reading tar file state.xml
21:08:00:WU00:FS01:0x17:Reading tar file system.xml
21:08:01:WU00:FS01:0x17:Reading tar file integrator.xml
21:08:01:WU00:FS01:0x17:Reading tar file core.xml
21:08:01:WU00:FS01:0x17:Digital signatures verified
21:08:01:WU00:FS01:0x17:Folding@home GPU core17
21:08:01:WU00:FS01:0x17:Version 0.0.52
21:11:13:WU00:FS01:0x17:Completed 0 out of 5000000 steps (0%)
21:11:13:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
21:13:41:FS01:Finishing
21:45:34:WU00:FS01:0x17:Completed 50000 out of 5000000 steps (1%)
SNIP
01:59:50:WU00:FS01:0x17:Completed 2500000 out of 5000000 steps (50%)
02:27:40:Saving configuration to config.xml
02:27:40:<config>
02:27:40:  <!-- Network -->
02:27:40:  <proxy v=':8080'/>
02:27:40:
02:27:40:  <!-- Remote Command Server -->
02:27:40:  <password v='*********'/>
02:27:40:
02:27:40:  <!-- Slot Control -->
02:27:40:  <power v='full'/>
02:27:40:
02:27:40:  <!-- User Information -->
02:27:40:  <passkey v='********************************'/>
02:27:40:  <team v='69411'/>
02:27:40:  <user v='PantherX'/>
02:27:40:
02:27:40:  <!-- Folding Slots -->
02:27:40:  <slot id='0' type='CPU'>
02:27:40:    <cpus v='7'/>
02:27:40:    <max-packet-size v='small'/>
02:27:40:    <max-slot-errors v='1'/>
02:27:40:    <max-unit-errors v='1'/>
02:27:40:    <next-unit-percentage v='100'/>
02:27:40:    <pause-on-start v='true'/>
02:27:40:  </slot>
02:27:40:  <slot id='1' type='GPU'>
02:27:40:    <max-slot-errors v='1'/>
02:27:40:    <max-unit-errors v='1'/>
02:27:40:    <next-unit-percentage v='100'/>
02:27:40:  </slot>
02:27:40:</config>
02:27:48:Removing old file 'configs/config-20140323-150815.xml'
02:27:48:Saving configuration to config.xml
02:27:48:<config>
02:27:48:  <!-- Network -->
02:27:48:  <proxy v=':8080'/>
02:27:48:
02:27:48:  <!-- Remote Command Server -->
02:27:48:  <password v='*********'/>
02:27:48:
02:27:48:  <!-- Slot Control -->
02:27:48:  <power v='full'/>
02:27:48:
02:27:48:  <!-- User Information -->
02:27:48:  <passkey v='********************************'/>
02:27:48:  <team v='69411'/>
02:27:48:  <user v='PantherX'/>
02:27:48:
02:27:48:  <!-- Folding Slots -->
02:27:48:  <slot id='0' type='CPU'>
02:27:48:    <cpus v='7'/>
02:27:48:    <max-packet-size v='small'/>
02:27:48:    <max-slot-errors v='1'/>
02:27:48:    <max-unit-errors v='1'/>
02:27:48:    <next-unit-percentage v='100'/>
02:27:48:    <pause-on-start v='true'/>
02:27:48:  </slot>
02:27:48:  <slot id='1' type='GPU'>
02:27:48:    <max-slot-errors v='1'/>
02:27:48:    <max-unit-errors v='1'/>
02:27:48:    <next-unit-percentage v='100'/>
02:27:48:  </slot>
02:27:48:</config>
02:30:26:WARNING:WU00:FS01:FahCore crashed with Windows unhandled exception code 0x40010004, searching for this code online may provide more information
02:30:26:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (1073807364 = 0x40010004)
02:30:26:WARNING:WU00:FS01:Too many errors, failing
02:30:26:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:13001 run:316 clone:0 gen:16 core:0x17 unit:0x00000028538b3db75328a93e6b24aff3
02:30:26:WU00:FS01:Connecting to 140.163.4.231:8080
02:30:27:WU00:FS01:Server responded WORK_ACK (400)
02:30:28:WU00:FS01:Cleaning up
02:30:28:ERROR:WU00:FS01:Exception: Failed to remove directory './work/00': boost::filesystem::remove: The process cannot access the file because it is being used by another process: ".\work\00\01\log.txt"
02:30:28:WU00:FS01:Cleaning up
02:30:28:ERROR:WU00:FS01:Exception: Failed to remove directory './work/00': boost::filesystem::remove: The process cannot access the file because it is being used by another process: ".\work\00\01\log.txt"
For curiosity's sake: why didn't you opt for clicking Start -> Restart?
PantherX wrote:2) Windows Update reboot without exiting FAHClient
So, after I rebooted the system, I checked for Windows updates and installed them. Some of them required a reboot which I did from the Windows Update window and in this case too, the FahCore crashed causing the WU to be dumped:

Code: Select all

*********************** Log Started 2014-06-20T02:31:20Z ***********************
02:31:20:************************* Folding@home Client *************************
02:31:20:      Website: http://folding.stanford.edu/
02:31:20:    Copyright: (c) 2009-2014 Stanford University
02:31:20:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:31:20:         Args: 
02:31:20:       Config: D:/FAH/V7/config.xml
02:31:20:******************************** Build ********************************
02:31:20:      Version: 7.4.4
02:31:20:         Date: Mar 4 2014
02:31:20:         Time: 20:26:54
02:31:20:      SVN Rev: 4130
02:31:20:       Branch: fah/trunk/client
02:31:20:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
02:31:20:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
02:31:20:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
02:31:20:     Platform: win32 XP
02:31:20:         Bits: 32
02:31:20:         Mode: Release
02:31:20:******************************* System ********************************
02:31:20:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
02:31:20:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
02:31:20:         CPUs: 8
02:31:20:       Memory: 15.89GiB
02:31:20:  Free Memory: 13.80GiB
02:31:20:      Threads: WINDOWS_THREADS
02:31:20:   OS Version: 6.2
02:31:20:  Has Battery: true
02:31:20:   On Battery: false
02:31:20:   UTC Offset: 3
02:31:20:          PID: 6952
02:31:20:          CWD: D:/FAH/V7
02:31:20:           OS: Windows 8 Pro
02:31:20:      OS Arch: AMD64
02:31:20:         GPUs: 1
02:31:20:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
02:31:20:         CUDA: 2.1
02:31:20:  CUDA Driver: 6000
02:31:20:Win32 Service: false
02:31:20:***********************************************************************
02:31:20:<config>
02:31:20:  <!-- Network -->
02:31:20:  <proxy v=':8080'/>
02:31:20:
02:31:20:  <!-- Remote Command Server -->
02:31:20:  <password v='*********'/>
02:31:20:
02:31:20:  <!-- Slot Control -->
02:31:20:  <power v='full'/>
02:31:20:
02:31:20:  <!-- User Information -->
02:31:20:  <passkey v='********************************'/>
02:31:20:  <team v='69411'/>
02:31:20:  <user v='PantherX'/>
02:31:20:
02:31:20:  <!-- Folding Slots -->
02:31:20:  <slot id='0' type='CPU'>
02:31:20:    <cpus v='7'/>
02:31:20:    <max-packet-size v='small'/>
02:31:20:    <max-slot-errors v='1'/>
02:31:20:    <max-unit-errors v='1'/>
02:31:20:    <next-unit-percentage v='100'/>
02:31:20:    <pause-on-start v='true'/>
02:31:20:  </slot>
02:31:20:  <slot id='1' type='GPU'>
02:31:20:    <max-slot-errors v='1'/>
02:31:20:    <max-unit-errors v='1'/>
02:31:20:    <next-unit-percentage v='100'/>
02:31:20:  </slot>
02:31:20:</config>
02:31:20:Trying to access database...
02:31:20:Successfully acquired database lock
02:31:20:Enabled folding slot 00: PAUSED cpu:7 (by user)
02:31:20:Enabled folding slot 01: READY gpu:0:GF114 [GeForce GTX 675M]
02:31:20:WU00:FS01:Cleaning up
02:31:21:WU00:FS01:Connecting to 171.67.108.201:80
02:31:23:WU00:FS01:Assigned to work server 171.64.65.56
02:31:23:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GF114 [GeForce GTX 675M] from 171.64.65.56
02:31:23:WU00:FS01:Connecting to 171.64.65.56:8080
02:31:24:WU00:FS01:Downloading 5.24MiB
02:31:30:WU00:FS01:Download 23.86%
02:31:36:WU00:FS01:Download 42.95%
02:31:42:WU00:FS01:Download 60.85%
02:31:48:WU00:FS01:Download 96.65%
02:31:48:WU00:FS01:Download complete
02:31:48:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
02:31:48:WU00:FS01:Starting
02:31:48:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 6952 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
02:31:48:WU00:FS01:Started FahCore on PID 5716
02:31:49:WU00:FS01:Core PID:5188
02:31:49:WU00:FS01:FahCore 0x17 started
02:31:50:WU00:FS01:0x17:*********************** Log Started 2014-06-20T02:31:49Z ***********************
02:31:50:WU00:FS01:0x17:Project: 9406 (Run 80, Clone 0, Gen 72)
02:31:50:WU00:FS01:0x17:Unit: 0x000000710a3b1e5c533e7381f3ef7605
02:31:50:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
02:31:50:WU00:FS01:0x17:Machine: 1
02:31:50:WU00:FS01:0x17:Reading tar file state.xml
02:31:51:WU00:FS01:0x17:Reading tar file system.xml
02:31:51:WU00:FS01:0x17:Reading tar file integrator.xml
02:31:51:WU00:FS01:0x17:Reading tar file core.xml
02:31:51:WU00:FS01:0x17:Digital signatures verified
02:31:51:WU00:FS01:0x17:Folding@home GPU core17
02:31:51:WU00:FS01:0x17:Version 0.0.52
02:31:52:FS01:Paused
02:31:52:FS01:Shutting core down
02:31:52:WU00:FS01:0x17:WARNING:Console control signal 1 on PID 5188
02:31:52:WU00:FS01:0x17:Exiting, please wait. . .
02:32:21:Removing old file 'configs/config-20140323-150916.xml'
02:32:21:Saving configuration to config.xml
02:32:21:<config>
02:32:21:  <!-- Network -->
02:32:21:  <proxy v=':8080'/>
02:32:21:
02:32:21:  <!-- Remote Command Server -->
02:32:21:  <password v='*********'/>
02:32:21:
02:32:21:  <!-- Slot Control -->
02:32:21:  <power v='full'/>
02:32:21:
02:32:21:  <!-- User Information -->
02:32:21:  <passkey v='********************************'/>
02:32:21:  <team v='69411'/>
02:32:21:  <user v='PantherX'/>
02:32:21:
02:32:21:  <!-- Folding Slots -->
02:32:21:  <slot id='0' type='CPU'>
02:32:21:    <cpus v='7'/>
02:32:21:    <max-packet-size v='small'/>
02:32:21:    <max-slot-errors v='1'/>
02:32:21:    <max-unit-errors v='1'/>
02:32:21:    <next-unit-percentage v='100'/>
02:32:21:    <pause-on-start v='true'/>
02:32:21:  </slot>
02:32:21:  <slot id='1' type='GPU'>
02:32:21:    <max-slot-errors v='1'/>
02:32:21:    <max-unit-errors v='1'/>
02:32:21:    <next-unit-percentage v='100'/>
02:32:21:    <paused v='true'/>
02:32:21:  </slot>
02:32:21:</config>
02:32:53:WARNING:FS01:Killing WU00
02:32:53:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
02:33:05:FS01:Unpaused
02:33:05:WU00:FS01:Starting
02:33:05:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" D:/FAH/V7/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 6952 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
02:33:05:WU00:FS01:Started FahCore on PID 6824
02:33:05:WU00:FS01:Core PID:6796
02:33:05:WU00:FS01:FahCore 0x17 started
02:33:06:WU00:FS01:0x17:*********************** Log Started 2014-06-20T02:33:05Z ***********************
02:33:06:WU00:FS01:0x17:Project: 9406 (Run 80, Clone 0, Gen 72)
02:33:06:WU00:FS01:0x17:Unit: 0x000000710a3b1e5c533e7381f3ef7605
02:33:06:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
02:33:06:WU00:FS01:0x17:Machine: 1
02:33:06:WU00:FS01:0x17:Reading tar file state.xml
02:33:07:WU00:FS01:0x17:Reading tar file system.xml
02:33:07:WU00:FS01:0x17:Reading tar file integrator.xml
02:33:07:WU00:FS01:0x17:Reading tar file core.xml
02:33:07:WU00:FS01:0x17:Digital signatures verified
02:33:07:WU00:FS01:0x17:Folding@home GPU core17
02:33:07:WU00:FS01:0x17:Version 0.0.52
02:33:22:Removing old file 'configs/config-20140323-151023.xml'
02:33:22:Saving configuration to config.xml
02:33:22:<config>
02:33:22:  <!-- Network -->
02:33:22:  <proxy v=':8080'/>
02:33:22:
02:33:22:  <!-- Remote Command Server -->
02:33:22:  <password v='*********'/>
02:33:22:
02:33:22:  <!-- Slot Control -->
02:33:22:  <power v='full'/>
02:33:22:
02:33:22:  <!-- User Information -->
02:33:22:  <passkey v='********************************'/>
02:33:22:  <team v='69411'/>
02:33:22:  <user v='PantherX'/>
02:33:22:
02:33:22:  <!-- Folding Slots -->
02:33:22:  <slot id='0' type='CPU'>
02:33:22:    <cpus v='7'/>
02:33:22:    <max-packet-size v='small'/>
02:33:22:    <max-slot-errors v='1'/>
02:33:22:    <max-unit-errors v='1'/>
02:33:22:    <next-unit-percentage v='100'/>
02:33:22:    <pause-on-start v='true'/>
02:33:22:  </slot>
02:33:22:  <slot id='1' type='GPU'>
02:33:22:    <max-slot-errors v='1'/>
02:33:22:    <max-unit-errors v='1'/>
02:33:22:    <next-unit-percentage v='100'/>
02:33:22:  </slot>
02:33:22:</config>
02:36:03:WU00:FS01:0x17:Completed 0 out of 2000000 steps (0%)
02:36:03:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
02:52:20:WU00:FS01:0x17:Completed 20000 out of 2000000 steps (1%)
03:08:21:WU00:FS01:0x17:Completed 40000 out of 2000000 steps (2%)
03:24:37:WU00:FS01:0x17:Completed 60000 out of 2000000 steps (3%)
03:32:02:WARNING:WU00:FS01:FahCore crashed with Windows unhandled exception code 0x40010004, searching for this code online may provide more information
03:32:02:WARNING:WU00:FS01:FahCore returned: UNKNOWN_ENUM (1073807364 = 0x40010004)
03:32:02:WARNING:WU00:FS01:Too many errors, failing
03:32:02:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
03:32:02:WU00:FS01:Connecting to 171.64.65.56:8080
After restart, here is the continuation of the log:

Code: Select all

*********************** Log Started 2014-06-20T03:33:30Z ***********************
03:33:30:************************* Folding@home Client *************************
03:33:30:      Website: http://folding.stanford.edu/
03:33:30:    Copyright: (c) 2009-2014 Stanford University
03:33:30:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:33:30:         Args: 
03:33:30:       Config: D:/FAH/V7/config.xml
03:33:30:******************************** Build ********************************
03:33:30:      Version: 7.4.4
03:33:30:         Date: Mar 4 2014
03:33:30:         Time: 20:26:54
03:33:30:      SVN Rev: 4130
03:33:30:       Branch: fah/trunk/client
03:33:30:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
03:33:30:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
03:33:30:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
03:33:30:     Platform: win32 XP
03:33:30:         Bits: 32
03:33:30:         Mode: Release
03:33:30:******************************* System ********************************
03:33:30:          CPU: Intel(R) Core(TM) i7-3840QM CPU @ 2.80GHz
03:33:30:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
03:33:30:         CPUs: 8
03:33:30:       Memory: 15.89GiB
03:33:30:  Free Memory: 13.80GiB
03:33:30:      Threads: WINDOWS_THREADS
03:33:30:   OS Version: 6.2
03:33:30:  Has Battery: true
03:33:30:   On Battery: false
03:33:30:   UTC Offset: 3
03:33:30:          PID: 7104
03:33:30:          CWD: D:/FAH/V7
03:33:30:           OS: Windows 8 Pro
03:33:30:      OS Arch: AMD64
03:33:30:         GPUs: 1
03:33:30:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 675M]
03:33:30:         CUDA: 2.1
03:33:30:  CUDA Driver: 6000
03:33:30:Win32 Service: false
03:33:30:***********************************************************************
03:33:30:<config>
03:33:30:  <!-- Network -->
03:33:30:  <proxy v=':8080'/>
03:33:30:
03:33:30:  <!-- Remote Command Server -->
03:33:30:  <password v='*********'/>
03:33:30:
03:33:30:  <!-- Slot Control -->
03:33:30:  <power v='full'/>
03:33:30:
03:33:30:  <!-- User Information -->
03:33:30:  <passkey v='********************************'/>
03:33:30:  <team v='69411'/>
03:33:30:  <user v='PantherX'/>
03:33:30:
03:33:30:  <!-- Folding Slots -->
03:33:30:  <slot id='0' type='CPU'>
03:33:30:    <cpus v='7'/>
03:33:30:    <max-packet-size v='small'/>
03:33:30:    <max-slot-errors v='1'/>
03:33:30:    <max-unit-errors v='1'/>
03:33:30:    <next-unit-percentage v='100'/>
03:33:30:    <pause-on-start v='true'/>
03:33:30:  </slot>
03:33:30:  <slot id='1' type='GPU'>
03:33:30:    <max-slot-errors v='1'/>
03:33:30:    <max-unit-errors v='1'/>
03:33:30:    <next-unit-percentage v='100'/>
03:33:30:  </slot>
03:33:30:</config>
03:33:30:Trying to access database...
03:33:31:Successfully acquired database lock
03:33:32:Enabled folding slot 00: PAUSED cpu:7 (by user)
03:33:32:Enabled folding slot 01: READY gpu:0:GF114 [GeForce GTX 675M]
03:33:32:WU00:FS01:Sending unit results: id:00 state:SEND error:FAILED project:9406 run:80 clone:0 gen:72 core:0x17 unit:0x000000710a3b1e5c533e7381f3ef7605
03:33:32:WU00:FS01:Connecting to 171.64.65.56:8080
03:33:32:WU01:FS01:Connecting to 171.67.108.201:80
03:33:33:WU00:FS01:Server responded WORK_ACK (400)
03:33:33:WU00:FS01:Cleaning up
Thus, it seems that the safest method is to exit FAHClient before restarting Windows. I normally manually exit all applications before restart so for me, this is fine. However, for others who expect a set-and-forget client, may cause issues especially during a Windows Update prompted reboot if not the normal reboot (since I used a custom shortcut which may have different results when compared to the default settings).
First of all I'm confused why it results in a crash/dump on your end, but none of this on my side - FAH stopped and right after restart continued folding just to immediately die.

Regarding your suggestion, I'll stick to that next time, thanks. However, both, Windows Update scenario and normal reboots/shutdowns via Start, are IMHO typical user scenarios where the users expect FAH client to handle it properly. Especially, because Windows sends a signal when you do a restart/shutdown and even waits (darkened "Waiting for other processes"-overlay) for them to complete their exit.

Re: Core 17 has suddenly started crashing

Posted: Fri Jun 20, 2014 1:52 pm
by 7im
Yes, we do expect the fahclient to shutdown gracefully with windows. It does with XP, but not on anything newer. I have had a bug ticket open about this "Unhandled exception" error a long long time. :(

Re: Core 17 has suddenly started crashing

Posted: Sat Jun 21, 2014 3:12 am
by bruce
Eagle wrote:I'm an optimist - I'm waiting for 7.4.5 / 0.0.56 then and if that doesn't work, well, Maxwell is coming..)
You don't understand. If somebody fixes a bug in the client, it will be distributed as 7.4.5 or some higher number. If a bug in the FahCore is fixed, it will be distributed in 0.0.56 or higher. A specific bug will only be found in one or the other and both numbers will not change to get that fix.

The Revision number of the FahCore has changed several times since v7.4.4 was released and it has been automatically updated whenever there is a critical change.

Other bugs are potentially fixed whenever NVidia or AMD/ATI release new drivers.

Re: Core 17 has suddenly started crashing

Posted: Sat Jun 21, 2014 3:53 am
by PantherX
Eagle wrote:...For curiosity's sake: why didn't you opt for clicking Start -> Restart?...
In Windows 8, it would be:
Charms Bar -> Settings -> Power -> Restart
That's a lot of clicks and takes too much time. Hence, having a shortcut available on the desktop is ideal for me (I don't really use third-party applications to revive the Start Menu since I rarely use it in the first place).
Eagle wrote:...First of all I'm confused why it results in a crash/dump on your end, but none of this on my side - FAH stopped and right after restart continued folding just to immediately die...
I too would like to know why that is especially for the Windows update reboot. Unfortunately, I don't know. However, this might be related to this bug that 7im mentioned earlier (https://fah.stanford.edu/projects/FAHClient/ticket/1048).

Re: Core 17 has suddenly started crashing

Posted: Sat Jun 21, 2014 11:44 pm
by Eagle
7im wrote:Yes, we do expect the fahclient to shutdown gracefully with windows. It does with XP, but not on anything newer. I have had a bug ticket open about this "Unhandled exception" error a long long time. :(
Very sad to hear such a bug still exists. I believe several WUs are dumped just because of this. Very sad for the science.. :(
bruce wrote:You don't understand. If somebody fixes a bug in the client, it will be distributed as 7.4.5 or some higher number. If a bug in the FahCore is fixed, it will be distributed in 0.0.56 or higher. A specific bug will only be found in one or the other and both numbers will not change to get that fix.

The Revision number of the FahCore has changed several times since v7.4.4 was released and it has been automatically updated whenever there is a critical change.

Other bugs are potentially fixed whenever NVidia or AMD/ATI release new drivers.
I'm sorry, but it seems like you got me wrong a second time - or I wrote mistakable. To put it simply: it's all one to me regarding FAH client, core or even GPU driver. If it gets fixed, an update (with a higher version number of that component) will be distributed.
If it doesn't get fixed, maybe a brand new FAH core (to support whatever is required to be supported) or a new GPU driver (due to new hardware architecture, i.e. Maxwell, Volta, etc.) might solve it as the new code works around, fixes or acts completely different. It still wouldn't bother me as long as this - at least from my point of view - very annoying bug finally gets resolved. ;)

Additionally, I hope this makes it clear to you that I do know a) what different versions of different components mean and b) that they don't have to be incremented synchronously. :)
PantherX wrote:In Windows 8, it would be:
Charms Bar -> Settings -> Power -> Restart
That's a lot of clicks and takes too much time. Hence, having a shortcut available on the desktop is ideal for me (I don't really use third-party applications to revive the Start Menu since I rarely use it in the first place).
Thanks for that hint - I'm still sticking with 7 as I'm heavily resisting to abandon that awesome and beautiful Aero glass-style for such an - IMHO - lame and ugly metro GUI. I simply don't get the point why I have to settle for something graphically so mindless if a) I've got the hardware to render an animation film within hours and b) its predecessor was so much more beautiful. Not to mention that touch-focused design isn't useful for a mouse-user, so Microsoft really lost me there. I'm hoping for the best regarding 9 - they should bring back that stuff on the desktop.

Though, I'm connected to a Server 2012 R2 right now and I've found that there's this way: Start -> Poweroff (upper right) -> Restart. Maybe that's available on 8, too?
PantherX wrote:I too would like to know why that is especially for the Windows update reboot. Unfortunately, I don't know. However, this might be related to this bug that 7im mentioned earlier (https://fah.stanford.edu/projects/FAHClient/ticket/1048).
As stated above, I'm looking forward to see this fixed as I do believe several or even many WUs get dumped because of this. Maybe the Pande Group can compare dump peaks to Microsoft patch days?

Re: Core 17 has suddenly started crashing

Posted: Sun Jun 22, 2014 4:00 am
by bruce
My point is also that different people maintain different components and because of those responsibilities, bugs have to be reported to different people. Fortunately 2 out of 3 are associated with Stanford.

Re: Core 17 has suddenly started crashing

Posted: Sun Jun 22, 2014 11:39 am
by PantherX
Eagle wrote:...I'm connected to a Server 2012 R2 right now and I've found that there's this way: Start -> Poweroff (upper right) -> Restart. Maybe that's available on 8, too?...
It isn't present in Windows 8. I believe that it is present in Windows 8.1 (or one of its update packages) and currently, I am in no mood to test what works and what doesn't with Windows 8.1 so will be leaving it as is. If rumors are true, a new Windows version (hopefully, a real desktop version) would be available in 2015 so I can live until then.
Eagle wrote:...I'm looking forward to see this fixed as I do believe several or even many WUs get dumped because of this. Maybe the Pande Group can compare dump peaks to Microsoft patch days?
Considering that it would have to be a coordinated effort across all the Server/Project owners, I highly doubt that it would happen anytime soon. However, in my case, I don't see this issue (thankfully) on my Windows 7 64-bit system using V7.4.4 installed as a service. Maybe, this issue happens in regular mode and/or with GPU Slots? Here are two log snippets showing system shutdown/reboot:
System shutdown:

Code: Select all

19:47:58:WU00:FS00:FahCore 0xa3 started
19:47:59:WU00:FS00:0xa3:
19:47:59:WU00:FS00:0xa3:*------------------------------*
19:47:59:WU00:FS00:0xa3:Folding@Home Gromacs SMP Core
19:47:59:WU00:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
19:47:59:WU00:FS00:0xa3:
19:47:59:WU00:FS00:0xa3:Preparing to commence simulation
19:47:59:WU00:FS00:0xa3:- Looking at optimizations...
19:47:59:WU00:FS00:0xa3:- Created dyn
19:47:59:WU00:FS00:0xa3:- Files status OK
19:47:59:WU00:FS00:0xa3:- Expanded 3849253 -> 4392432 (decompressed 114.1 percent)
19:47:59:WU00:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3849253 data_size=4392432, decompressed_data_size=4392432 diff=0
19:47:59:WU00:FS00:0xa3:- Digital signature verified
19:47:59:WU00:FS00:0xa3:
19:47:59:WU00:FS00:0xa3:Project: 8581 (Run 1, Clone 1, Gen 352)
19:47:59:WU00:FS00:0xa3:
19:47:59:WU00:FS00:0xa3:Assembly optimizations on if available.
19:47:59:WU00:FS00:0xa3:Entering M.D.
19:48:04:WU01:FS00:Upload 0.87%
19:48:05:WU00:FS00:0xa3:Mapping NT from 4 to 4 
19:48:05:WU00:FS00:0xa3:Completed 0 out of 500000 steps  (0%)
SNIP
******************************* Date: 2014-04-21 *******************************
18:53:40:WU00:FS00:0xa3:Completed 375000 out of 500000 steps  (75%)
19:12:51:WU00:FS00:0xa3:Completed 380000 out of 500000 steps  (76%)
19:28:24:WARNING:Console control signal 6 on PID 1732
19:28:24:Exiting, please wait. . .
19:28:24:WU00:FS00:0xa3:Received EVENT= from OS -- shutting down w/ INTERRUPTED signal; events=[CtrlC= Close= Break= Shutdown= LogOff=]
19:28:24:WU00:FS00:0xa3:Folding@home Core Shutdown: INTERRUPTED
19:28:24:Service shutdown requested
19:28:25:Clean exit

*********************** Log Started 2014-04-22T15:44:21Z ***********************
15:44:21:************************* Folding@home Client *************************
15:44:21:      Website: http://folding.stanford.edu/
15:44:21:    Copyright: (c) 2009-2014 Stanford University
15:44:21:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:44:21:         Args: 
15:44:21:       Config: C:/ProgramData/FAHClient/config.xml
15:44:21:******************************** Build ********************************
15:44:21:      Version: 7.4.4
15:44:21:         Date: Mar 4 2014
15:44:21:         Time: 20:26:54
15:44:21:      SVN Rev: 4130
15:44:21:       Branch: fah/trunk/client
15:44:21:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
15:44:21:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
15:44:21:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
15:44:21:     Platform: win32 XP
15:44:21:         Bits: 32
15:44:21:         Mode: Release
15:44:21:******************************* System ********************************
15:44:21:          CPU: Intel(R) Core(TM) i3-3240 CPU @ 3.40GHz
15:44:21:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
15:44:21:         CPUs: 4
15:44:21:       Memory: 2.74GiB
15:44:21:  Free Memory: 2.09GiB
15:44:21:      Threads: WINDOWS_THREADS
15:44:21:   OS Version: 6.1
15:44:21:  Has Battery: false
15:44:21:   On Battery: false
15:44:21:   UTC Offset: 3
15:44:21:          PID: 348
15:44:21:          CWD: C:/Windows/system32
15:44:21:           OS: Windows 7 Ultimate
15:44:21:      OS Arch: AMD64
15:44:21:         GPUs: 0
15:44:21:         CUDA: Not detected
15:44:21:Win32 Service: true
15:44:21:***********************************************************************
15:44:21:<config>
15:44:21:  <!-- HTTP Server -->
15:44:21:  <allow v='127.0.0.1, 192.168.1.1-192.168.1.10'/>
15:44:21:
15:44:21:  <!-- Network -->
15:44:21:  <proxy v=':8080'/>
15:44:21:
15:44:21:  <!-- Remote Command Server -->
15:44:21:  <password v='*********'/>
15:44:21:
15:44:21:  <!-- Slot Control -->
15:44:21:  <power v='full'/>
15:44:21:
15:44:21:  <!-- User Information -->
15:44:21:  <passkey v='********************************'/>
15:44:21:  <team v='69411'/>
15:44:21:  <user v='PantherX'/>
15:44:21:
15:44:21:  <!-- Folding Slots -->
15:44:21:  <slot id='0' type='CPU'>
15:44:21:    <cpus v='4'/>
15:44:21:    <next-unit-percentage v='100'/>
15:44:21:  </slot>
15:44:21:</config>
15:44:21:Trying to access database...
15:44:22:Successfully acquired database lock
15:44:22:Enabled folding slot 00: READY cpu:4
15:44:23:WU00:FS00:Starting
15:44:23:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 704 -lifeline 348 -checkpoint 15 -np 4 -service
15:44:23:WU00:FS00:Started FahCore on PID 2420
15:44:25:WU00:FS00:Core PID:2632
15:44:25:WU00:FS00:FahCore 0xa3 started
15:44:26:WU00:FS00:0xa3:
15:44:26:WU00:FS00:0xa3:*------------------------------*
15:44:26:WU00:FS00:0xa3:Folding@Home Gromacs SMP Core
15:44:26:WU00:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
15:44:26:WU00:FS00:0xa3:
15:44:26:WU00:FS00:0xa3:Preparing to commence simulation
15:44:26:WU00:FS00:0xa3:- Looking at optimizations...
15:44:26:WU00:FS00:0xa3:- Files status OK
15:44:27:WU00:FS00:0xa3:- Expanded 3849253 -> 4392432 (decompressed 114.1 percent)
15:44:27:WU00:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3849253 data_size=4392432, decompressed_data_size=4392432 diff=0
15:44:27:WU00:FS00:0xa3:- Digital signature verified
15:44:27:WU00:FS00:0xa3:
15:44:27:WU00:FS00:0xa3:Project: 8581 (Run 1, Clone 1, Gen 352)
15:44:27:WU00:FS00:0xa3:
15:44:27:WU00:FS00:0xa3:Assembly optimizations on if available.
15:44:27:WU00:FS00:0xa3:Entering M.D.
15:44:33:WU00:FS00:0xa3:Using Gromacs checkpoints
15:44:34:WU00:FS00:0xa3:Mapping NT from 4 to 4 
15:44:36:WU00:FS00:0xa3:Resuming from checkpoint
15:44:36:WU00:FS00:0xa3:Verified 00/wudata_01.log
15:44:37:WU00:FS00:0xa3:Verified 00/wudata_01.trr
15:44:37:WU00:FS00:0xa3:Verified 00/wudata_01.edr
15:44:37:WU00:FS00:0xa3:Completed 381415 out of 500000 steps  (76%)
15:58:54:WU00:FS00:0xa3:Completed 385000 out of 500000 steps  (77%)
16:18:12:WU00:FS00:0xa3:Completed 390000 out of 500000 steps  (78%)
System reboot:

Code: Select all

20:48:11:WU01:FS00:FahCore 0xa3 started
20:48:12:WU01:FS00:0xa3:
20:48:12:WU01:FS00:0xa3:*------------------------------*
20:48:12:WU01:FS00:0xa3:Folding@Home Gromacs SMP Core
20:48:12:WU01:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
20:48:12:WU01:FS00:0xa3:
20:48:12:WU01:FS00:0xa3:Preparing to commence simulation
20:48:12:WU01:FS00:0xa3:- Looking at optimizations...
20:48:12:WU01:FS00:0xa3:- Created dyn
20:48:12:WU01:FS00:0xa3:- Files status OK
20:48:12:WU01:FS00:0xa3:- Expanded 3851534 -> 4394468 (decompressed 114.0 percent)
20:48:12:WU01:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3851534 data_size=4394468, decompressed_data_size=4394468 diff=0
20:48:12:WU01:FS00:0xa3:- Digital signature verified
20:48:12:WU01:FS00:0xa3:
20:48:12:WU01:FS00:0xa3:Project: 8578 (Run 1, Clone 9, Gen 397)
20:48:12:WU01:FS00:0xa3:
20:48:12:WU01:FS00:0xa3:Assembly optimizations on if available.
20:48:12:WU01:FS00:0xa3:Entering M.D.
20:48:18:WU01:FS00:0xa3:Mapping NT from 4 to 4 
20:48:19:WU01:FS00:0xa3:Completed 0 out of 500000 steps  (0%)
SNIP
21:06:54:WU01:FS00:0xa3:Completed 5000 out of 500000 steps  (1%)
21:25:21:WU01:FS00:0xa3:Completed 10000 out of 500000 steps  (2%)
21:43:50:WU01:FS00:0xa3:Completed 15000 out of 500000 steps  (3%)

*********************** Log Started 2014-06-01T22:43:36Z ***********************
22:43:36:************************* Folding@home Client *************************
22:43:36:      Website: http://folding.stanford.edu/
22:43:36:    Copyright: (c) 2009-2014 Stanford University
22:43:36:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:43:36:         Args: 
22:43:36:       Config: C:/ProgramData/FAHClient/config.xml
22:43:36:******************************** Build ********************************
22:43:36:      Version: 7.4.4
22:43:36:         Date: Mar 4 2014
22:43:36:         Time: 20:26:54
22:43:36:      SVN Rev: 4130
22:43:36:       Branch: fah/trunk/client
22:43:36:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
22:43:36:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
22:43:36:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
22:43:36:     Platform: win32 XP
22:43:36:         Bits: 32
22:43:36:         Mode: Release
22:43:36:******************************* System ********************************
22:43:36:          CPU: Intel(R) Core(TM) i3-3240 CPU @ 3.40GHz
22:43:36:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
22:43:36:         CPUs: 4
22:43:36:       Memory: 2.74GiB
22:43:36:  Free Memory: 2.05GiB
22:43:36:      Threads: WINDOWS_THREADS
22:43:36:   OS Version: 6.1
22:43:36:  Has Battery: false
22:43:36:   On Battery: false
22:43:36:   UTC Offset: 3
22:43:36:          PID: 1936
22:43:36:          CWD: C:/Windows/system32
22:43:36:           OS: Windows 7 Ultimate
22:43:36:      OS Arch: AMD64
22:43:36:         GPUs: 0
22:43:36:         CUDA: Not detected
22:43:36:Win32 Service: true
22:43:36:***********************************************************************
22:43:36:<config>
22:43:36:  <!-- HTTP Server -->
22:43:36:  <allow v='127.0.0.1, 192.168.1.1-192.168.1.10'/>
22:43:36:
22:43:36:  <!-- Network -->
22:43:36:  <proxy v=':8080'/>
22:43:36:
22:43:36:  <!-- Remote Command Server -->
22:43:36:  <password v='*********'/>
22:43:36:
22:43:36:  <!-- Slot Control -->
22:43:36:  <power v='full'/>
22:43:36:
22:43:36:  <!-- User Information -->
22:43:36:  <passkey v='********************************'/>
22:43:36:  <team v='69411'/>
22:43:36:  <user v='PantherX'/>
22:43:36:
22:43:36:  <!-- Folding Slots -->
22:43:36:  <slot id='0' type='CPU'>
22:43:36:    <cpus v='4'/>
22:43:36:    <next-unit-percentage v='100'/>
22:43:36:    <pause-on-start v='true'/>
22:43:36:  </slot>
22:43:36:</config>
22:43:36:Trying to access database...
22:43:36:Successfully acquired database lock
22:43:36:Enabled folding slot 00: PAUSED cpu:4 (by user)
22:57:49:Saving configuration to config.xml
22:57:49:<config>
22:57:49:  <!-- HTTP Server -->
22:57:49:  <allow v='127.0.0.1, 192.168.1.1-192.168.1.10'/>
22:57:49:
22:57:49:  <!-- Network -->
22:57:49:  <proxy v=':8080'/>
22:57:49:
22:57:49:  <!-- Remote Command Server -->
22:57:49:  <password v='*********'/>
22:57:49:
22:57:49:  <!-- Slot Control -->
22:57:49:  <power v='full'/>
22:57:49:
22:57:49:  <!-- User Information -->
22:57:49:  <passkey v='********************************'/>
22:57:49:  <team v='69411'/>
22:57:49:  <user v='PantherX'/>
22:57:49:
22:57:49:  <!-- Folding Slots -->
22:57:49:  <slot id='0' type='CPU'>
22:57:49:    <cpus v='4'/>
22:57:49:    <next-unit-percentage v='100'/>
22:57:49:  </slot>
22:57:49:</config>
22:57:50:Saving configuration to config.xml
22:57:50:<config>
22:57:50:  <!-- HTTP Server -->
22:57:50:  <allow v='127.0.0.1, 192.168.1.1-192.168.1.10'/>
22:57:50:
22:57:50:  <!-- Network -->
22:57:50:  <proxy v=':8080'/>
22:57:50:
22:57:50:  <!-- Remote Command Server -->
22:57:50:  <password v='*********'/>
22:57:50:
22:57:50:  <!-- Slot Control -->
22:57:50:  <power v='full'/>
22:57:50:
22:57:50:  <!-- User Information -->
22:57:50:  <passkey v='********************************'/>
22:57:50:  <team v='69411'/>
22:57:50:  <user v='PantherX'/>
22:57:50:
22:57:50:  <!-- Folding Slots -->
22:57:50:  <slot id='0' type='CPU'>
22:57:50:    <cpus v='4'/>
22:57:50:    <next-unit-percentage v='100'/>
22:57:50:  </slot>
22:57:50:</config>
22:57:51:FS00:Unpaused
22:57:51:WU01:FS00:Starting
22:57:51:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 01 -suffix 01 -version 704 -lifeline 1936 -checkpoint 15 -np 4 -service
22:57:52:WU01:FS00:Started FahCore on PID 2528
22:57:52:WU01:FS00:Core PID:2584
22:57:52:WU01:FS00:FahCore 0xa3 started
22:57:52:WU01:FS00:0xa3:
22:57:52:WU01:FS00:0xa3:*------------------------------*
22:57:52:WU01:FS00:0xa3:Folding@Home Gromacs SMP Core
22:57:52:WU01:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
22:57:52:WU01:FS00:0xa3:
22:57:52:WU01:FS00:0xa3:Preparing to commence simulation
22:57:52:WU01:FS00:0xa3:- Ensuring status. Please wait.
22:58:01:WU01:FS00:0xa3:- Looking at optimizations...
22:58:01:WU01:FS00:0xa3:- Working with standard loops on this execution.
22:58:01:WU01:FS00:0xa3:- Previous termination of core was improper.
22:58:01:WU01:FS00:0xa3:- Files status OK
22:58:02:WU01:FS00:0xa3:- Expanded 3851534 -> 4394468 (decompressed 114.0 percent)
22:58:02:WU01:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3851534 data_size=4394468, decompressed_data_size=4394468 diff=0
22:58:02:WU01:FS00:0xa3:- Digital signature verified
22:58:02:WU01:FS00:0xa3:
22:58:02:WU01:FS00:0xa3:Project: 8578 (Run 1, Clone 9, Gen 397)
22:58:02:WU01:FS00:0xa3:
22:58:02:WU01:FS00:0xa3:Entering M.D.
22:58:08:WU01:FS00:0xa3:Using Gromacs checkpoints
22:58:09:WU01:FS00:0xa3:Mapping NT from 4 to 4 
22:58:09:WU01:FS00:0xa3:Resuming from checkpoint
22:58:09:WU01:FS00:0xa3:Verified 01/wudata_01.log
22:58:10:WU01:FS00:0xa3:Verified 01/wudata_01.trr
22:58:10:WU01:FS00:0xa3:Verified 01/wudata_01.edr
22:58:10:WU01:FS00:0xa3:Completed 16215 out of 500000 steps  (3%)
23:12:05:WU01:FS00:0xa3:Completed 20000 out of 500000 steps  (4%)
23:31:18:WU01:FS00:0xa3:Completed 25000 out of 500000 steps  (5%)

Re: Core 17 has suddenly started crashing

Posted: Mon Jun 23, 2014 4:18 pm
by bruce
Eagle wrote:Maybe the Pande Group can compare dump peaks to Microsoft patch days?
WindowsUpdate still has settings for "Check for updates but let me choose whether to download and install them" as well as "Download updates but let me choose whether to install them" and although Microsoft nags you that your settings are "wrong" (in their opinion!) they work fine. You're in charge of your computer, not MS.

Re: Core 17 has suddenly started crashing

Posted: Mon Jun 23, 2014 10:00 pm
by Eagle
@ Bruce: I'm sorry, but you seem to misunderstand each and every try and therefore I give up on any further explanations.

@ PantherX: that's a good point regarding 8/8.1. ;)
However, I do notice that within your rebot-log there's no core shutdown-message as before?! I's much like it was in my scenario. Also, you start the slots paused and maybe that prevents you from losing the lifeline?

Re: Core 17 has suddenly started crashing

Posted: Tue Jun 24, 2014 2:17 am
by PantherX
Eagle wrote:...However, I do notice that within your rebot-log there's no core shutdown-message as before?! I's much like it was in my scenario. Also, you start the slots paused and maybe that prevents you from losing the lifeline?
Humm, that's a good point. I will simply reboot the system and see how that goes once I ensure that there aren't any pause flags set. Since the system uses service installation, I set the pause-on-start, reboot, install the updates, reboot, remove the pause-on-start reboot and leave the system for the next ~30 days.

Re: Core 17 has suddenly started crashing

Posted: Sat Jun 28, 2014 9:38 pm
by PantherX
Okay, I finally got around to rebooting the system and here are the logs:
Before reboot:

Code: Select all

15:27:41:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 1720 -checkpoint 15 -np 4 -service
15:27:41:WU01:FS00:Started FahCore on PID 2152
15:27:41:WU01:FS00:Core PID:5364
15:27:41:WU01:FS00:FahCore 0xa4 started
15:27:41:WU01:FS00:0xa4:
15:27:41:WU01:FS00:0xa4:*------------------------------*
15:27:41:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
15:27:41:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
15:27:41:WU01:FS00:0xa4:
15:27:41:WU01:FS00:0xa4:Preparing to commence simulation
15:27:41:WU01:FS00:0xa4:- Looking at optimizations...
15:27:41:WU01:FS00:0xa4:- Created dyn
15:27:41:WU01:FS00:0xa4:- Files status OK
15:27:41:WU01:FS00:0xa4:- Expanded 979409 -> 2731536 (decompressed 278.8 percent)
15:27:41:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=979409 data_size=2731536, decompressed_data_size=2731536 diff=0
15:27:41:WU01:FS00:0xa4:- Digital signature verified
15:27:41:WU01:FS00:0xa4:
15:27:41:WU01:FS00:0xa4:Project: 10179 (Run 332, Clone 2, Gen 0)
15:27:41:WU01:FS00:0xa4:
15:27:41:WU01:FS00:0xa4:Assembly optimizations on if available.
15:27:41:WU01:FS00:0xa4:Entering M.D.
15:27:47:WU01:FS00:0xa4:Mapping NT from 4 to 4 
15:27:47:WU01:FS00:0xa4:Completed 0 out of 500000 steps  (0%)
SNIP
21:02:32:WU01:FS00:0xa4:Completed 345000 out of 500000 steps  (69%)
21:07:23:WU01:FS00:0xa4:Completed 350000 out of 500000 steps  (70%)
21:12:16:WU01:FS00:0xa4:Completed 355000 out of 500000 steps  (71%)
21:12:27:WARNING:Console control signal 6 on PID 1720
21:12:27:Exiting, please wait. . .
21:12:27:WU01:FS00:0xa4:Received EVENT= from OS -- shutting down w/ INTERRUPTED signal; events=[CtrlC= Close= Break= Shutdown= LogOff=]
21:12:27:WU01:FS00:0xa4:Folding@home Core Shutdown: INTERRUPTED
21:12:27:Service shutdown requested
21:12:28:Clean exit
After reboot:

Code: Select all

*********************** Log Started 2014-06-28T21:13:07Z ***********************
21:13:07:************************* Folding@home Client *************************
21:13:07:      Website: http://folding.stanford.edu/
21:13:07:    Copyright: (c) 2009-2014 Stanford University
21:13:07:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:13:07:         Args: 
21:13:07:       Config: C:/ProgramData/FAHClient/config.xml
21:13:07:******************************** Build ********************************
21:13:07:      Version: 7.4.4
21:13:07:         Date: Mar 4 2014
21:13:07:         Time: 20:26:54
21:13:07:      SVN Rev: 4130
21:13:07:       Branch: fah/trunk/client
21:13:07:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:13:07:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
21:13:07:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
21:13:07:     Platform: win32 XP
21:13:07:         Bits: 32
21:13:07:         Mode: Release
21:13:07:******************************* System ********************************
21:13:07:          CPU: Intel(R) Core(TM) i3-3240 CPU @ 3.40GHz
21:13:07:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
21:13:07:         CPUs: 4
21:13:07:       Memory: 2.74GiB
21:13:07:  Free Memory: 2.05GiB
21:13:07:      Threads: WINDOWS_THREADS
21:13:07:   OS Version: 6.1
21:13:07:  Has Battery: false
21:13:07:   On Battery: false
21:13:07:   UTC Offset: 3
21:13:07:          PID: 1828
21:13:07:          CWD: C:/Windows/system32
21:13:07:           OS: Windows 7 Ultimate
21:13:07:      OS Arch: AMD64
21:13:07:         GPUs: 0
21:13:07:         CUDA: Not detected
21:13:07:Win32 Service: true
21:13:07:***********************************************************************
21:13:07:<config>
21:13:07:  <!-- HTTP Server -->
21:13:07:  <allow v='127.0.0.1, 192.168.1.1-192.168.1.10'/>
21:13:07:
21:13:07:  <!-- Network -->
21:13:07:  <proxy v=':8080'/>
21:13:07:
21:13:07:  <!-- Remote Command Server -->
21:13:07:  <password v='*********'/>
21:13:07:
21:13:07:  <!-- Slot Control -->
21:13:07:  <power v='full'/>
21:13:07:
21:13:07:  <!-- User Information -->
21:13:07:  <passkey v='********************************'/>
21:13:07:  <team v='69411'/>
21:13:07:  <user v='PantherX'/>
21:13:07:
21:13:07:  <!-- Folding Slots -->
21:13:07:  <slot id='0' type='CPU'>
21:13:07:    <cpus v='4'/>
21:13:07:    <next-unit-percentage v='100'/>
21:13:07:  </slot>
21:13:07:</config>
21:13:07:Trying to access database...
21:13:07:Successfully acquired database lock
21:13:07:Enabled folding slot 00: READY cpu:4
21:13:07:WU01:FS00:Starting
21:13:07:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 1828 -checkpoint 15 -np 4 -service
21:13:07:WU01:FS00:Started FahCore on PID 1396
21:13:09:WU01:FS00:Core PID:2100
21:13:09:WU01:FS00:FahCore 0xa4 started
21:13:11:WU01:FS00:0xa4:
21:13:11:WU01:FS00:0xa4:*------------------------------*
21:13:11:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:13:11:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:13:11:WU01:FS00:0xa4:
21:13:11:WU01:FS00:0xa4:Preparing to commence simulation
21:13:11:WU01:FS00:0xa4:- Looking at optimizations...
21:13:11:WU01:FS00:0xa4:- Files status OK
21:13:11:WU01:FS00:0xa4:- Expanded 979409 -> 2731536 (decompressed 278.8 percent)
21:13:11:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=979409 data_size=2731536, decompressed_data_size=2731536 diff=0
21:13:11:WU01:FS00:0xa4:- Digital signature verified
21:13:11:WU01:FS00:0xa4:
21:13:11:WU01:FS00:0xa4:Project: 10179 (Run 332, Clone 2, Gen 0)
21:13:11:WU01:FS00:0xa4:
21:13:11:WU01:FS00:0xa4:Assembly optimizations on if available.
21:13:11:WU01:FS00:0xa4:Entering M.D.
21:13:17:WU01:FS00:0xa4:Using Gromacs checkpoints
21:13:17:WU01:FS00:0xa4:Mapping NT from 4 to 4 
21:13:17:WU01:FS00:0xa4:Resuming from checkpoint
21:13:18:WU01:FS00:0xa4:Verified 01/wudata_01.log
21:13:18:WU01:FS00:0xa4:Verified 01/wudata_01.trr
21:13:18:WU01:FS00:0xa4:Verified 01/wudata_01.xtc
21:13:18:WU01:FS00:0xa4:Verified 01/wudata_01.edr
21:13:18:WU01:FS00:0xa4:Completed 340120 out of 500000 steps  (68%)
21:18:39:WU01:FS00:0xa4:Completed 345000 out of 500000 steps  (69%)
21:23:30:WU01:FS00:0xa4:Completed 350000 out of 500000 steps  (70%)
21:28:42:WU01:FS00:0xa4:Completed 355000 out of 500000 steps  (71%)
21:33:29:WU01:FS00:0xa4:Completed 360000 out of 500000 steps  (72%)
Not sure why the message wasn't logged previously.