Client/core dumping GPU WU after pause/reboot

Moderators: Site Moderators, FAHC Science Team

Post Reply
belloq
Posts: 48
Joined: Thu Sep 24, 2020 12:58 pm

Client/core dumping GPU WU after pause/reboot

Post by belloq »

I didn't see this in the bugs list, so posting here to see if anyone else has experienced this. I have noticed that my computer production has started going down lately, and part of that might be related to what I've found. Upon shutdown and boot (laptop moving around), I would check the status of the folding client, and found that quite often it was starting up a fresh WU. At first I thought it was coincidence, but recently it seems to be somewhat repeatable.

So I started a new process where I would gracefully Pause the WU. Then shutdown. Boot back up. Unpause folding, and it again would dump the WU and download another one. Last night the one I had paused at >85% so I lost a good amount of folding time.

Is this something that is a known issue or been seen by anyone else? Does it look like a bug? Or is this just how things go?

Client is Win11, i7-12800H, NVIDIA RTX A1000 Laptop GPU, CUDA supported Compute 8.6 Driver 12.6.

In the first log snippet, you can see that I paused the machine, but then it also writes that there appears to be a crash.

Code: Select all

*********************** Log Started 2025-02-24T23:59:58Z ***********************
00:05:12:I1:Machine state pause
00:05:12:I1:WU337:WARNING:Console control signal 1 on PID 75724
00:05:12:I1:WU337:Exiting, please wait. . .
00:05:13:I1:WU337:Folding@home Core Shutdown: INTERRUPTED
00:05:55:I1:WU337:Core returned INTERRUPTED (102)
01:14:10:I1:Account websocket closed: PROTOCOL msg=Failed to read header start
01:14:10:I1:OUT86:> GET https://api.foldingathome.org/machine/0QARVvoMtxjXEYOqpfdMtYIOM3e_z6-d9PVbIJ-9FVs HTTP/1.1
01:14:12:I1:OUT86:< HTTP/1.1 200 HTTP_OK
01:14:12:I1:OUT5:> GET wss://node1.foldingathome.org/ws/client HTTP/1.1
01:14:13:I1:OUT5:< HTTP/1.1 101 HTTP_SWITCHING_PROTOCOLS
01:14:13:I1:Logging into node account
01:15:32:I1:Machine state fold
01:15:33:I3:Running FahCore: C:\ProgramData\FAHClient\cores/openmm-core-24/windows-10-64bit/release/fahcore-24-windows-10-64bit-release-8.1.4/FahCore_24.exe -dir ggAuesWUbrttSijsH1rPwm1weIke7zd01rdMlD24jkE -suffix 01 -version 8.4.9 -lifeline 30432 -gpu-uuid 86ddd4a4-1b26-9e6d-258c-f20d90412433 -gpu-platform cuda -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-platform 0 -cuda-device 0 -gpu 0
01:15:33:I3:WU337:Started FahCore on PID 68296
01:15:33:I1:WU337:*********************** Log Started 2025-02-25T01:15:33Z ***********************
01:15:33:I1:WU337:*************************** Core24 Folding@home Core ***************************
01:15:33:I1:WU337:       Core: Core24
01:15:33:I1:WU337:       Type: 0x24
01:15:33:I1:WU337:    Version: 8.1.4
01:15:33:I1:WU337:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
01:15:33:I1:WU337:  Copyright: 2022 foldingathome.org
01:15:33:I1:WU337:   Homepage: https://foldingathome.org/
01:15:33:I1:WU337:       Date: Jul 25 2024
01:15:33:I1:WU337:       Time: 05:42:49
01:15:33:I1:WU337:   Revision: cf9f0139862b8945a2091772770e4631aac37792
01:15:33:I1:WU337:     Branch: HEAD
01:15:33:I1:WU337:   Compiler: Visual C++
01:15:33:I1:WU337:    Options: $( /TP $) /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2
01:15:33:I1:WU337:             /Zc:throwingNew /MT -DOPENMM_VERSION="\"8.1.1\"" /Ox /std:c++14
01:15:33:I1:WU337:   Platform: win32 10
01:15:33:I1:WU337:       Bits: 64
01:15:33:I1:WU337:       Mode: Release
01:15:33:I1:WU337:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
01:15:33:I1:WU337:             <peastman@stanford.edu>
01:15:33:I1:WU337:       Args: -dir ggAuesWUbrttSijsH1rPwm1weIke7zd01rdMlD24jkE -suffix 01
01:15:33:I1:WU337:             -version 8.4.9 -lifeline 30432 -gpu-uuid
01:15:33:I1:WU337:             86ddd4a4-1b26-9e6d-258c-f20d90412433 -gpu-platform cuda -gpu-vendor
01:15:33:I1:WU337:             nvidia -opencl-platform 0 -opencl-device 0 -cuda-platform 0
01:15:33:I1:WU337:             -cuda-device 0 -gpu 0
01:15:33:I1:WU337:************************************ libFAH ************************************
01:15:33:I1:WU337:       Date: Jul 25 2024
01:15:33:I1:WU337:       Time: 05:23:50
01:15:33:I1:WU337:   Revision: c7d2824a47eb025fa8cda8968c7a5e971585d90c
01:15:33:I1:WU337:     Branch: HEAD
01:15:33:I1:WU337:   Compiler: Visual C++
01:15:33:I1:WU337:    Options: $( /TP $) /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:15:33:I1:WU337:   Platform: win32 10
01:15:33:I1:WU337:       Bits: 64
01:15:33:I1:WU337:       Mode: Release
01:15:33:I1:WU337:************************************ CBang *************************************
01:15:33:I1:WU337:    Version: 1.7.2
01:15:33:I1:WU337:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
01:15:33:I1:WU337:        Org: Cauldron Development LLC
01:15:33:I1:WU337:  Copyright: Cauldron Development LLC, 2003-2024
01:15:33:I1:WU337:   Homepage: https://cauldrondevelopment.com/
01:15:33:I1:WU337:    License: LGPL-2.1-or-later
01:15:33:I1:WU337:       Date: Jul 25 2024
01:15:33:I1:WU337:       Time: 05:22:43
01:15:33:I1:WU337:   Revision: f1cd4c791e8c40a35dcfeab3ab85d910949cc0cb
01:15:33:I1:WU337:     Branch: HEAD
01:15:33:I1:WU337:   Compiler: Visual C++
01:15:33:I1:WU337:    Options: $( /TP $) /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:15:33:I1:WU337:   Platform: win32 10
01:15:33:I1:WU337:       Bits: 64
01:15:33:I1:WU337:       Mode: Release
01:15:33:I1:WU337:************************************ System ************************************
01:15:33:I1:WU337:        CPU: 12th Gen Intel(R) Core(TM) i7-12800H
01:15:33:I1:WU337:     CPU ID: GenuineIntel Family 6 Model 154 Stepping 3
01:15:33:I1:WU337:       CPUs: 20
01:15:33:I1:WU337:     Memory: 31.68GiB
01:15:33:I1:WU337:Free Memory: 11.89GiB
01:15:33:I1:WU337: OS Version: 10.0
01:15:33:I1:WU337:Has Battery: true
01:15:33:I1:WU337: On Battery: false
01:15:33:I1:WU337:   Hostname: DL073RV3F97BC9
01:15:33:I1:WU337: UTC Offset: -8
01:15:33:I1:WU337:        PID: 68296
01:15:33:I1:WU337:        CWD: C:\ProgramData\FAHClient\work
01:15:33:I1:WU337:       Exec: C:\ProgramData\FAHClient\cores\openmm-core-24\windows-10-64bit\release\fahcore-24-windows-10-64bit-release-8.1.4\FahCore_24.exe
01:15:33:I1:WU337:************************************ OpenMM ************************************
01:15:33:I1:WU337:    Version: 8.1.1
01:15:33:I1:WU337:********************************************************************************
01:15:33:I1:WU337:Project: 18251 (Run 227, Clone 4, Gen 56)
01:15:33:I1:WU337:Digital signatures verified
01:15:33:I1:WU337:Folding@home GPU Core24 Folding@home Core
01:15:33:I1:WU337:Version 8.1.4
01:15:33:I1:WU337:  Checkpoint write interval: 12500 steps (5%) [20 total]
01:15:33:I1:WU337:  JSON viewer frame write interval: 2500 steps (1%) [100 total]
01:15:33:I1:WU337:  XTC frame write interval: 5000 steps (2%) [50 total]
01:15:33:I1:WU337:  TRR frame write interval: disabled
01:15:33:I1:WU337:  Global context and integrator variables write interval: disabled
01:15:33:I1:WU337:There are 4 platforms available.
01:15:33:I1:WU337:Platform 0: Reference
01:15:33:I1:WU337:Platform 1: CPU
01:15:33:I1:WU337:Platform 2: OpenCL
01:15:33:I1:WU337:  opencl-device 0 specified
01:15:33:I1:WU337:Platform 3: CUDA
01:15:33:I1:WU337:  cuda-device 0 specified
01:21:51:I1:WU337:Attempting to create CUDA context:
01:21:51:I1:WU337:  Configuring platform CUDA
01:25:42:I1:WU337:  Using CUDA on CUDA Platform and gpu 0
01:25:42:I1:WU337:  GPU info: Platform: CUDA
01:25:42:I1:WU337:  GPU info: PlatformIndex: 0
01:25:42:I1:WU337:  GPU info: Device: NVIDIA RTX A1000 Laptop GPU
01:25:42:I1:WU337:  GPU info: DeviceIndex: 0
01:25:42:I1:WU337:  GPU info: Vendor: 0x10de
01:25:42:I1:WU337:  GPU info: PCI: 01:00:00
01:25:42:I1:WU337:  GPU info: Compute: 8.6
01:25:42:I1:WU337:  GPU info: Driver: 12.6
01:25:42:I1:WU337:  GPU info: GPU: true
01:25:46:I1:WU337:Completed 150000 out of 250000 steps (60%)
01:35:20:I1:WU337:Completed 152500 out of 250000 steps (61%)
01:44:52:I1:WU337:Completed 155000 out of 250000 steps (62%)
01:54:20:I1:WU337:Completed 157500 out of 250000 steps (63%)
02:11:44:I1:WU337:Completed 160000 out of 250000 steps (64%)
02:24:50:I1:WU337:Completed 162500 out of 250000 steps (65%)
02:26:22:I1:WU337:Checkpoint completed at step 162500
02:40:48:I1:WU337:Completed 165000 out of 250000 steps (66%)
02:53:51:I1:WU337:Completed 167500 out of 250000 steps (67%)
03:03:02:I1:WU337:Completed 170000 out of 250000 steps (68%)
03:12:13:I1:WU337:Completed 172500 out of 250000 steps (69%)
03:26:32:I1:WU337:Completed 175000 out of 250000 steps (70%)
03:28:05:I1:WU337:Checkpoint completed at step 175000
03:37:16:I1:WU337:Completed 177500 out of 250000 steps (71%)
03:46:34:I1:WU337:Completed 180000 out of 250000 steps (72%)
03:55:54:I1:WU337:Completed 182500 out of 250000 steps (73%)
04:09:30:I1:WU337:Completed 185000 out of 250000 steps (74%)
04:27:18:I1:WU337:Completed 187500 out of 250000 steps (75%)
04:28:47:I1:WU337:Checkpoint completed at step 187500
04:37:56:I1:WU337:Completed 190000 out of 250000 steps (76%)
04:47:11:I1:WU337:Completed 192500 out of 250000 steps (77%)
04:56:24:I1:WU337:Completed 195000 out of 250000 steps (78%)
05:05:45:I1:WU337:Completed 197500 out of 250000 steps (79%)
05:15:18:I1:WU337:Completed 200000 out of 250000 steps (80%)
05:16:55:I1:WU337:Checkpoint completed at step 200000
05:42:27:I1:WU337:Completed 202500 out of 250000 steps (81%)
06:15:35:I1:WU337:Completed 205000 out of 250000 steps (82%)
06:48:46:I1:WU337:Completed 207500 out of 250000 steps (83%)
07:14:40:I1:WU337:Completed 210000 out of 250000 steps (84%)
07:25:12:I1:WU337:Completed 212500 out of 250000 steps (85%)
07:26:47:I1:WU337:Checkpoint completed at step 212500
07:30:19:I1:Machine state pause
07:30:19:I1:WU337:WARNING:Console control signal 1 on PID 68296
07:30:19:I1:WU337:Exiting, please wait. . .
07:30:20:I1:WU337:Folding@home Core Shutdown: INTERRUPTED
07:30:36:E :WU337:Core returned UNKNOWN_ENUM (1073807364)
07:30:36:E :WU337:Core exited with an unknown error code 1073807364 which probably indicates that it crashed. Dumping WU
07:30:36:I1:WU337:Sending dump report
07:30:36:I1:OUT91:> POST https://highland3.seas.upenn.edu/api/results HTTP/1.1
It shows that I paused, but then something about crashing. Then the new log starts up after booting up which shows it goes directly to dumping.

Code: Select all

*********************** Log Started 2025-02-25T07:33:22Z ***********************
07:33:22:I1:*********************** Folding@home Client ***********************
07:33:22:I1:    Version: 8.4.9
07:33:22:I1:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
07:33:22:I1:        Org: foldingathome.org
07:33:22:I1:  Copyright: 2023-2024, foldingathome.org
07:33:22:I1:   Homepage: https://foldingathome.org/
07:33:22:I1:    License: GPL-3.0-or-later
07:33:22:I1:        URL: https://v8-4.foldingathome.org/
07:33:22:I1:       Date: Nov 20 2024
07:33:22:I1:       Time: 14:47:46
07:33:22:I1:   Revision: 360fe71b1bd05bb89814bfb97b73a5bda84802d6
07:33:22:I1:     Branch: master
07:33:22:I1:   Compiler: Visual C++
07:33:22:I1:    Options: $( /TP $) /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2
07:33:22:I1:             /Zc:throwingNew /MT
07:33:22:I1:   Platform: win32 10
07:33:22:I1:       Bits: 64
07:33:22:I1:       Mode: Release
07:33:22:I1:     Config: C:\ProgramData\FAHClient\config.xml
07:33:22:I1:****************************** CBang ******************************
07:33:22:I1:    Version: 1.7.2
07:33:22:I1:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
07:33:22:I1:        Org: Cauldron Development
07:33:22:I1:  Copyright: Cauldron Development, 2003-2024
07:33:22:I1:   Homepage: https://cauldrondevelopment.com/
07:33:22:I1:    License: LGPL-2.1-or-later
07:33:22:I1:       Date: Nov 20 2024
07:33:22:I1:       Time: 09:04:40
07:33:22:I1:   Revision: 443c54e909eb8d8994405a18fb328b5b05a623a5
07:33:22:I1:     Branch: master
07:33:22:I1:   Compiler: Visual C++
07:33:22:I1:    Options: $( /TP $) /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2
07:33:22:I1:             /Zc:throwingNew /MT
07:33:22:I1:   Platform: win32 10
07:33:22:I1:       Bits: 64
07:33:22:I1:       Mode: Release
07:33:22:I1:***************************** System ******************************
07:33:22:I1:        CPU: 12th Gen Intel(R) Core(TM) i7-12800H
07:33:22:I1:     CPU ID: GenuineIntel Family 6 Model 154 Stepping 3
07:33:22:I1:       CPUs: 20
07:33:22:I1:     Memory: 31.68GiB
07:33:22:I1:Free Memory: 19.89GiB
07:33:22:I1: OS Version: 6.2
07:33:22:I1:Has Battery: true
07:33:22:I1: On Battery: false
07:33:22:I1:   Hostname: DL073RV3F97BC9
07:33:22:I1: UTC Offset: -8
07:33:22:I1:        PID: 22228
07:33:22:I1:        CWD: C:\ProgramData\FAHClient
07:33:22:I1:       Exec: C:\Program Files\FAHClient\FAHClient.exe
07:33:22:I1:*******************************************************************
07:33:22:I2:<config>
07:33:22:I2:  <!-- Resource Settings -->
07:33:22:I2:  <cpus v='12'/>
07:33:22:I2:
07:33:22:I2:  <!-- User Information -->
07:33:22:I2:  <passkey v='*****'/>
07:33:22:I2:  <team v='259998'/>
07:33:22:I2:  <user v='belloq'/>
07:33:22:I2:</config>
07:33:22:I1:Opening Database
07:33:22:I1:F@H ID = 0QARVvoMtxjXEYOqpfdMtYIOM3e_z6-d9PVbIJ-9FVs
07:33:22:I3:Loading default resource group
07:33:22:I1:Listening for HTTP on 127.0.0.1:7396
07:33:22:I3:WU337:Loading work unit 337 with ID ggAuesWUbrttSijsH1rPwm1weIke7zd01rdMlD24jkE
07:33:22:I3:Loaded 1 wus.
07:33:22:I1:Started Windows systray control
07:33:23:I3:gpus = {
07:33:23:I3:  "gpu:00:02:00": {
07:33:23:I3:    "vendor": 32902,
07:33:23:I3:    "type": "intel",
07:33:23:I3:    "description": "Intel(R) Iris(R) Xe Graphics",
07:33:23:I3:    "uuid": "8680a646-0c00-0000-0002-000000000000",
07:33:23:I3:    "opencl": {"platform": 1, "device": 0, "compute": "3.0", "driver": "32.0"},
07:33:23:I3:    "device": 18086,
07:33:23:I3:    "supported": false
07:33:23:I3:  },
07:33:23:I3:  "gpu:01:00:00": {
07:33:23:I3:    "vendor": 4318,
07:33:23:I3:    "type": "nvidia",
07:33:23:I3:    "description": "NVIDIA RTX A1000 Laptop GPU",
07:33:23:I3:    "uuid": "86ddd4a4-1b26-9e6d-258c-f20d90412433",
07:33:23:I3:    "opencl": {"platform": 0, "device": 0, "compute": "3.0", "driver": "561.3"},
07:33:23:I3:    "cuda": {"platform": 0, "device": 0, "compute": "8.6", "driver": "12.6"},
07:33:23:I3:    "device": 9657,
07:33:23:I3:    "supported": true
07:33:23:I3:  }
07:33:23:I3:}
07:33:23:I1:WU337:Sending dump report
07:33:23:I1:OUT1:> GET https://api.foldingathome.org/machine/0QARVvoMtxjXEYOqpfdMtYIOM3e_z6-d9PVbIJ-9FVs HTTP/1.1
07:33:23:I1:OUT2:> POST https://highland3.seas.upenn.edu/api/results HTTP/1.1
07:33:23:I1:OUT2:< HTTP/1.1 200 HTTP_OK
07:33:23:I1:WU337:Dumped
07:33:24:I1:OUT1:< HTTP/1.1 200 HTTP_OK
07:33:24:I1:OUT3:> GET wss://node1.foldingathome.org/ws/client HTTP/1.1
07:33:24:I1:OUT3:< HTTP/1.1 101 HTTP_SWITCHING_PROTOCOLS
07:33:24:I1:Logging into node account
07:34:13:I1:Machine state fold
07:34:13:I1:Default:Added new work unit: cpus:0 gpus:gpu:01:00:00
07:34:14:I1:WU338:Requesting WU assignment for user belloq team 259998
07:34:14:W :CON5:DNS lookup failed for assign1.foldingathome.org
07:34:14:E :OUT5:Failed response: CONNECT
07:34:14:I1:WU338:Retry #1 in 2 secs
07:34:16:I1:WU338:Requesting WU assignment for user belloq team 259998
07:34:16:W :CON6:DNS lookup failed for assign2.foldingathome.org
07:34:16:E :OUT6:Failed response: CONNECT
07:34:16:I1:WU338:Retry #2 in 4 secs
07:34:20:I1:WU338:Requesting WU assignment for user belloq team 259998
07:34:21:W :CON7:DNS lookup failed for assign3.foldingathome.org
07:34:21:E :OUT7:Failed response: CONNECT
07:34:21:I1:WU338:Retry #3 in 8 secs
07:34:29:I1:WU338:Requesting WU assignment for user belloq team 259998
07:34:29:W :CON8:DNS lookup failed for assign4.foldingathome.org
07:34:29:E :OUT8:Failed response: CONNECT
07:34:29:I1:WU338:Retry #4 in 16 secs
07:34:45:I1:WU338:Requesting WU assignment for user belloq team 259998
07:34:45:W :CON9:DNS lookup failed for assign5.foldingathome.org
07:34:45:E :OUT9:Failed response: CONNECT
07:34:45:I1:WU338:Retry #5 in 32 secs
07:35:17:I1:WU338:Requesting WU assignment for user belloq team 259998
07:35:17:W :CON10:DNS lookup failed for assign6.foldingathome.org
07:35:17:E :OUT10:Failed response: CONNECT
07:35:17:I1:WU338:Retry #6 in 1 min 4 secs
07:36:21:I1:WU338:Requesting WU assignment for user belloq team 259998
07:36:21:I1:OUT11:> POST https://assign1.foldingathome.org/api/assign HTTP/1.1
07:36:22:I1:OUT11:< HTTP/1.1 200 HTTP_OK
07:36:22:I1:WU338:Received WU assignment szUL0muwK1pNdLp_ZtwuJRqkFtD-J-9BUNggnlvgXIM
07:36:22:I1:WU338:Downloading WU
07:36:22:I1:OUT12:> POST https://highland1.seas.upenn.edu/api/assign HTTP/1.1
07:36:24:I1:OUT12:< HTTP/1.1 200 HTTP_OK
07:36:24:I1:WU338:Received WU P18238 R831 C2 G94
07:36:24:I1:Loaded cores/openmm-core-24/windows-10-64bit/release/fahcore-24-windows-10-64bit-release-8.1.4/FahCore_24.exe
07:36:24:I3:Running FahCore: C:\ProgramData\FAHClient\cores/openmm-core-24/windows-10-64bit/release/fahcore-24-windows-10-64bit-release-8.1.4/FahCore_24.exe -dir szUL0muwK1pNdLp_ZtwuJRqkFtD-J-9BUNggnlvgXIM -suffix 01 -version 8.4.9 -lifeline 22228 -gpu-uuid 86ddd4a4-1b26-9e6d-258c-f20d90412433 -gpu-platform cuda -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-platform 0 -cuda-device 0 -gpu 0
07:36:25:I3:WU338:Started FahCore on PID 9492
07:36:25:I1:WU338:*********************** Log Started 2025-02-25T07:36:25Z ***********************
07:36:25:I1:WU338:*************************** Core24 Folding@home Core ***************************
07:36:25:I1:WU338:       Core: Core24
07:36:25:I1:WU338:       Type: 0x24
07:36:25:I1:WU338:    Version: 8.1.4
07:36:25:I1:WU338:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
07:36:25:I1:WU338:  Copyright: 2022 foldingathome.org
07:36:25:I1:WU338:   Homepage: https://foldingathome.org/
07:36:25:I1:WU338:       Date: Jul 25 2024
07:36:25:I1:WU338:       Time: 05:42:49
07:36:25:I1:WU338:   Revision: cf9f0139862b8945a2091772770e4631aac37792
07:36:25:I1:WU338:     Branch: HEAD
07:36:25:I1:WU338:   Compiler: Visual C++
07:36:25:I1:WU338:    Options: $( /TP $) /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2
07:36:25:I1:WU338:             /Zc:throwingNew /MT -DOPENMM_VERSION="\"8.1.1\"" /Ox /std:c++14
07:36:25:I1:WU338:   Platform: win32 10
07:36:25:I1:WU338:       Bits: 64
07:36:25:I1:WU338:       Mode: Release
07:36:25:I1:WU338:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
07:36:25:I1:WU338:             <peastman@stanford.edu>
07:36:25:I1:WU338:       Args: -dir szUL0muwK1pNdLp_ZtwuJRqkFtD-J-9BUNggnlvgXIM -suffix 01
07:36:25:I1:WU338:             -version 8.4.9 -lifeline 22228 -gpu-uuid
07:36:25:I1:WU338:             86ddd4a4-1b26-9e6d-258c-f20d90412433 -gpu-platform cuda -gpu-vendor
07:36:25:I1:WU338:             nvidia -opencl-platform 0 -opencl-device 0 -cuda-platform 0
07:36:25:I1:WU338:             -cuda-device 0 -gpu 0
07:36:25:I1:WU338:************************************ libFAH ************************************
07:36:25:I1:WU338:       Date: Jul 25 2024
07:36:25:I1:WU338:       Time: 05:23:50
07:36:25:I1:WU338:   Revision: c7d2824a47eb025fa8cda8968c7a5e971585d90c
07:36:25:I1:WU338:     Branch: HEAD
07:36:25:I1:WU338:   Compiler: Visual C++
07:36:25:I1:WU338:    Options: $( /TP $) /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
07:36:25:I1:WU338:   Platform: win32 10
07:36:25:I1:WU338:       Bits: 64
07:36:25:I1:WU338:       Mode: Release
07:36:25:I1:WU338:************************************ CBang *************************************
07:36:25:I1:WU338:    Version: 1.7.2
07:36:25:I1:WU338:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
07:36:25:I1:WU338:        Org: Cauldron Development LLC
07:36:25:I1:WU338:  Copyright: Cauldron Development LLC, 2003-2024
07:36:25:I1:WU338:   Homepage: https://cauldrondevelopment.com/
07:36:25:I1:WU338:    License: LGPL-2.1-or-later
07:36:25:I1:WU338:       Date: Jul 25 2024
07:36:25:I1:WU338:       Time: 05:22:43
07:36:25:I1:WU338:   Revision: f1cd4c791e8c40a35dcfeab3ab85d910949cc0cb
07:36:25:I1:WU338:     Branch: HEAD
07:36:25:I1:WU338:   Compiler: Visual C++
07:36:25:I1:WU338:    Options: $( /TP $) /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
07:36:25:I1:WU338:   Platform: win32 10
07:36:25:I1:WU338:       Bits: 64
07:36:25:I1:WU338:       Mode: Release
07:36:25:I1:WU338:************************************ System ************************************
07:36:25:I1:WU338:        CPU: 12th Gen Intel(R) Core(TM) i7-12800H
07:36:25:I1:WU338:     CPU ID: GenuineIntel Family 6 Model 154 Stepping 3
07:36:25:I1:WU338:       CPUs: 20
07:36:25:I1:WU338:     Memory: 31.68GiB
07:36:25:I1:WU338:Free Memory: 15.67GiB
07:36:25:I1:WU338: OS Version: 10.0
07:36:25:I1:WU338:Has Battery: true
07:36:25:I1:WU338: On Battery: false
07:36:25:I1:WU338:   Hostname: DL073RV3F97BC9
07:36:25:I1:WU338: UTC Offset: -8
07:36:25:I1:WU338:        PID: 9492
07:36:25:I1:WU338:        CWD: C:\ProgramData\FAHClient\work
07:36:25:I1:WU338:       Exec: C:\ProgramData\FAHClient\cores\openmm-core-24\windows-10-64bit\release\fahcore-24-windows-10-64bit-release-8.1.4\FahCore_24.exe
07:36:25:I1:WU338:************************************ OpenMM ************************************
07:36:25:I1:WU338:    Version: 8.1.1
07:36:25:I1:WU338:********************************************************************************
07:36:25:I1:WU338:Project: 18238 (Run 831, Clone 2, Gen 94)
07:36:25:I1:WU338:Reading tar file core.xml
07:36:25:I1:WU338:Reading tar file integrator.xml
07:36:25:I1:WU338:Reading tar file state.xml.bz2
07:36:25:I1:WU338:Reading tar file system.xml.bz2
07:36:25:I1:WU338:Digital signatures verified
07:36:25:I1:WU338:Folding@home GPU Core24 Folding@home Core
07:36:25:I1:WU338:Version 8.1.4
07:36:25:I1:WU338:  Checkpoint write interval: 50000 steps (2%) [50 total]
07:36:25:I1:WU338:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
07:36:25:I1:WU338:  XTC frame write interval: 10000 steps (0.4%) [250 total]
07:36:25:I1:WU338:  TRR frame write interval: disabled
07:36:25:I1:WU338:  Global context and integrator variables write interval: disabled
07:36:25:I1:WU338:There are 4 platforms available.
07:36:25:I1:WU338:Platform 0: Reference
07:36:25:I1:WU338:Platform 1: CPU
07:36:25:I1:WU338:Platform 2: OpenCL
07:36:25:I1:WU338:  opencl-device 0 specified
07:36:25:I1:WU338:Platform 3: CUDA
07:36:25:I1:WU338:  cuda-device 0 specified
07:36:41:I1:WU338:Attempting to create CUDA context:
07:36:41:I1:WU338:  Configuring platform CUDA
07:37:03:I1:WU338:  Using CUDA on CUDA Platform and gpu 0
07:37:03:I1:WU338:  GPU info: Platform: CUDA
07:37:03:I1:WU338:  GPU info: PlatformIndex: 0
07:37:03:I1:WU338:  GPU info: Device: NVIDIA RTX A1000 Laptop GPU
07:37:03:I1:WU338:  GPU info: DeviceIndex: 0
07:37:03:I1:WU338:  GPU info: Vendor: 0x10de
07:37:03:I1:WU338:  GPU info: PCI: 01:00:00
07:37:03:I1:WU338:  GPU info: Compute: 8.6
07:37:03:I1:WU338:  GPU info: Driver: 12.6
07:37:03:I1:WU338:  GPU info: GPU: true
07:37:04:I1:WU338:Completed 0 out of 2500000 steps (0%)
07:37:09:I1:WU338:Checkpoint completed at step 0
07:57:54:I1:WU338:Completed 25000 out of 2500000 steps (1%)
08:13:41:I1:WU338:Completed 50000 out of 2500000 steps (2%)
muziqaz
Posts: 1205
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 9950x, 7950x3D, 5950x, 5800x3D
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: Client/core dumping GPU WU after pause/reboot

Post by muziqaz »

In Windows Pause the work, give it a minute, before rebooting, please.
If it is still dumping things then there might be something suspicious with your windows 11.
Looking at your logs it seems you might have issues with your SSD.
FAH Omega tester
belloq
Posts: 48
Joined: Thu Sep 24, 2020 12:58 pm

Re: Client/core dumping GPU WU after pause/reboot

Post by belloq »

muziqaz wrote: Tue Feb 25, 2025 4:30 pm In Windows Pause the work, give it a minute, before rebooting, please.
If it is still dumping things then there might be something suspicious with your windows 11.
Looking at your logs it seems you might have issues with your SSD.
Oh, I don't shut down immediately. It's usually 2-3 minutes. Do I need to wait longer? What in the logs suggests an SSD or Win11 issue? Could you maybe elaborate on what "suspicious" means? :)
muziqaz
Posts: 1205
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 9950x, 7950x3D, 5950x, 5800x3D
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: Client/core dumping GPU WU after pause/reboot

Post by muziqaz »

Normally you don't need to wait at all. If your system is responsive enough, you pause folding, and then normally proceed to Start and shutdown/reboot. This has worked for me very well on Win 11 and Win10 systems.
Your first log snippet actually shows that your WU crashes on Pause command, which tells me that there is something unstable with your system, since Pause has not been known to crash WUs, like ever. When you Pause, that pause state is being written into client.db(?) file, plus fahcore is being shutdown during that, which means stuff which was loaded into RAM while folding is now being written on to SSD, so it is either SSD write issue, or RAM is unstable. To be fair I'm just guessing because this issue has never been seen
FAH Omega tester
belloq
Posts: 48
Joined: Thu Sep 24, 2020 12:58 pm

Re: Client/core dumping GPU WU after pause/reboot

Post by belloq »

muziqaz wrote: Tue Feb 25, 2025 11:46 pm To be fair I'm just guessing because this issue has never been seen
8-) Cool.

The shutdown and startup had no issue this afternoon, as I imagine it probably usually is. I'll keep an eye on it.

ty
Post Reply