BSODs on GPU jobs
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 71
- Joined: Sun Dec 02, 2007 4:29 pm
- Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
- Location: Fair Play, SC
BSODs on GPU jobs
I have had three BSODs on GPU jobs in the last two days related to fah.exe. I have re-imaged my HD to one week ago, removed F@H, and tested my computer and found no problems on my end. I am running an HPE-h9-1135 Phoenix with AMD FX-8350 processor, AMD HD 7790 video card, 16 GB RAM, and Windows 7 Home Premium 64b. I am reinstalling F@h in a few minutes and see how it goes.
[edit]Driver Packaging Version 14.10.1006-140417a-171099C
Catalyst Version 14.4
Provider Advanced Micro Devices, Inc.
2D Driver Version 8.01.01.1390
Direct3D Version 9.14.10.01029
OpenGL Version 6.14.10.12874
AMD Catalyst Control Center Version 2014.0417.2226.38446
AMD Audio Driver Version 7.12.0.7718[/edit]
[edit]Driver Packaging Version 14.10.1006-140417a-171099C
Catalyst Version 14.4
Provider Advanced Micro Devices, Inc.
2D Driver Version 8.01.01.1390
Direct3D Version 9.14.10.01029
OpenGL Version 6.14.10.12874
AMD Catalyst Control Center Version 2014.0417.2226.38446
AMD Audio Driver Version 7.12.0.7718[/edit]
Re: BSODs on GPU jobs
My experience with BSODs usually implicates hardware problems, which can include over-heating. Are you overclocking either the GPU or the CPU? Warmer weather could bring out latent problems. Less likely, but also possible, are memory errors (either in the DDR3 main memory, or the video card memory).
Re: BSODs on GPU jobs
(Even without overclocking,) when you added the GPU card, did you do anything to make sure that the fans would be able handle the increased heat without an increase in internal temperatures?
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Site Moderator
- Posts: 6986
- Joined: Wed Dec 23, 2009 9:33 am
- Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB
Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400 - Location: Land Of The Long White Cloud
- Contact:
Re: BSODs on GPU jobs
I do hope that your PSU is able to power the system once loaded. BTW, have you considered running other stability benchmarks to see if this is a F@H only issue or a stability issue in general? Finally, did you choose the updated version of 14.4 WHQL drivers or the initial release?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time
Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time
Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
-
- Posts: 71
- Joined: Sun Dec 02, 2007 4:29 pm
- Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
- Location: Fair Play, SC
Re: BSODs on GPU jobs
I have an 800 watt power supply. This CPU has never been overclocked. CPU temp (liquid cooler) is 52C and GPU temp is 60C. I run without the side cover. This morning I uninstalled the video drivers and Catalyst Control Center and reinstalled using a new download of 14-4-win7-win8-win8.1-64-dd-ccc-whql.exe. It has been running satisfactorily since. This is an error I found in Event Viewer | Applications:
After the BSOD early today, I restarted (before computer testing/reinstalling) and had a popup when the desktop loaded that said Fah had faulted. We'll see how it goes. I was a moderator on another project and I often said, "The client won't break your computer but it will reveal computer problems." I've been sorting this computer for 20 months. Thanks.
The message could be related to the recovery using a six day old image of the drive.Faulting application name: FahCore_a3.exe, version: 0.0.0.0, time stamp: 0x4d4720af
Faulting module name: FahCore_a3.exe, version: 0.0.0.0, time stamp: 0x4d4720af
Exception code: 0xc0000005
Fault offset: 0x00261bf6
Faulting process id: 0xb2c
Faulting application start time: 0x01cf8db6bf51d914
Faulting application path: C:\Users\Vester\AppData\Roaming\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\Core_a3.fah\FahCore_a3.exe
Faulting module path: C:\Users\Vester\AppData\Roaming\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\Core_a3.fah\FahCore_a3.exe
Report Id: ca1dcce0-fa2b-11e3-a5d1-9cb70d9c72c4
After the BSOD early today, I restarted (before computer testing/reinstalling) and had a popup when the desktop loaded that said Fah had faulted. We'll see how it goes. I was a moderator on another project and I often said, "The client won't break your computer but it will reveal computer problems." I've been sorting this computer for 20 months. Thanks.
Re: BSODs on GPU jobs
I don't run CPU jobs, but that error could be a result of the crash rather than the cause. I practically never see errors in the GPU jobs and don't even think of them as a possible cause, though it could happen of course. Let us know if you discover any hardware problems (not due to temperature though!).
-
- Posts: 2948
- Joined: Sun Dec 02, 2007 4:36 am
- Hardware configuration: Machine #1:
Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).
Machine #2:
Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.
Machine 3:
Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32
I am currently folding just on the 5x GTX 460's for aprox. 70K PPD - Location: Salem. OR USA
Re: BSODs on GPU jobs
The 0xc00000005 is a conventional memory access error and the FAHCore_A3 is a CPU folding core so I'm not sure why the BSOD would have anything to do with the GPU unless we are dealing with power issues where the video card is pulling so much current that it is dropping the voltage for the CPU/MB/RAM. May I suggest that you check your PS rails to make sure that the video card is on a separate rail from everything else.
Re: BSODs on GPU jobs
You do realize that there's a false assumption there. You didn't test the computer under identical conditions. Put simply, a computer running FAH is drawing more power and generating more heat and undergoing more double-checking of results than a computer that's not running FAH.Vester wrote:I have had three BSODs on GPU jobs in the last two days related to fah.exe. I have re-imaged my HD to one week ago, removed F@H, and tested my computer and found no problems on my end.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 1094
- Joined: Wed Nov 05, 2008 3:19 pm
- Location: Cambridge, UK
Re: BSODs on GPU jobs
Why do you think the BSODs were related to GPU jobs? The one Win log extract relates to a CPU job (Core_A3). Do you have any other error logs?
Memory access errors are often the result of failing RAM, or sometimes the motherboard circuitry. Run a memory check, preferably for several hours. Check memory is correctly seated. If errors persist, remove or swap memory. I have seen machines fail with two identical sticks, but work with either one alone. Otherwise, check power distribution and voltages on load as P5-133XL suggests above. Could also be a local hotspot near the memory, but this is unlikely with the temp you noted.
However, if the machine is now working properly after driver update, and if that was the only change since faults were appearing, there could have been a problem in the previous driver version that FAH revealed. I don't think the Win log records enough information to determine the actual faulty software module -- at least not in a form mere mortals can interpret.
Memory access errors are often the result of failing RAM, or sometimes the motherboard circuitry. Run a memory check, preferably for several hours. Check memory is correctly seated. If errors persist, remove or swap memory. I have seen machines fail with two identical sticks, but work with either one alone. Otherwise, check power distribution and voltages on load as P5-133XL suggests above. Could also be a local hotspot near the memory, but this is unlikely with the temp you noted.
However, if the machine is now working properly after driver update, and if that was the only change since faults were appearing, there could have been a problem in the previous driver version that FAH revealed. I don't think the Win log records enough information to determine the actual faulty software module -- at least not in a form mere mortals can interpret.
-
- Posts: 71
- Joined: Sun Dec 02, 2007 4:29 pm
- Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
- Location: Fair Play, SC
Re: BSODs on GPU jobs
OK, accept this statement: "This computer crashed three times while crunching seven CPU jobs and one GPU job. I am done. I don't have time for it anymore. I have a new boat. Bye."
-
- Posts: 61
- Joined: Thu Oct 02, 2008 1:15 pm
- Hardware configuration: Asus Crosshair Hero VIII, AMD Ryzen 3950x, 2x8GB 3600MHz DDR4, Radeon VII, Win10
Asus Crosshair Hero VII, Amd Ryzen 3900x, 2x16GB 3200MHz DDR4, GeForce 980 TI, Kubuntu 19.10 - Location: Finland
Re: BSODs on GPU jobs
Although this is bit late, all of my BSODs have happened on 14.xx series drivers. Even the newest 14.6's cause BSOD together with folding and firefox/flash. 13.12 drivers have been bluescreen free for me, so I'll probably will be going back to them.
Re: BSODs on GPU jobs
I have been Folding on two HD 7790s on the same motherboard with the 14.4 drivers (Win7 64-bit) since they came out with no problems. A couple of days ago I switched the cards from a Haswell machine to an Ivy Bridge machine. These are the Power Color cards, and they run hotter than most cards (typically 70 to 80 C), but with a side case fan on them that has not been a problem thus far.