Lots of problems with random FAH crashes on GPU (RX 5600 XT)
Posted: Mon Apr 06, 2020 4:03 pm
I don't know if this is a driver or a BIOS issue, or if F@H Core v22 just doesn't like the 5600 XT, or if my card is just butts
Note: This is (primarily) concerning the FAH core crashing, becoming unstable, or otherwise just not working. At this time, while I have encountered one blue-screen, I have not noticed full system or GPU crashes as a result of running FAH, but I haven't been able to successfully fold on my GPU.
I've had seemingly every issue in the book:
- clCreateCommandQueue (-6) errors getting the program started
- Reaching bad (NaN) states and being unable to load from a checkpoint
- Random crashes in the F@H core
- Core refuses to boot up and no log
Some sample log files can be found here: viewtopic.php?f=61&t=33972
I'm running a Sapphire Pulse graphics card at factory-stock settings, and it was able to run multiple consecutive runs of FurMark (max temp on the last run was 75 C) without any system errors, benchmark shutdowns, or other evidence of instability. I have downloaded the latest AMD drivers, and GPU-Z is detecting that OpenCL support is present on the card.
I'm sort of at wit's end; right now, I have disabled GPU folding since I have yet to successfully fold a protein and constantly submitting bad WU's doesn't help advance the science, but I'd very much like to be able to use this card for the betterment of kicking CoVID in the teeth.
GPU-Z Profile: https://i.imgur.com/AEZoBdO.gif
From what I've seen, there are some issues being reported with GCN-based cards but this is a Navi-based card which is having the issues, so... Not sure where to go from here. Maybe a BIOS flash would fix this?
Note: This is (primarily) concerning the FAH core crashing, becoming unstable, or otherwise just not working. At this time, while I have encountered one blue-screen, I have not noticed full system or GPU crashes as a result of running FAH, but I haven't been able to successfully fold on my GPU.
I've had seemingly every issue in the book:
- clCreateCommandQueue (-6) errors getting the program started
- Reaching bad (NaN) states and being unable to load from a checkpoint
- Random crashes in the F@H core
- Core refuses to boot up and no log
Some sample log files can be found here: viewtopic.php?f=61&t=33972
I'm running a Sapphire Pulse graphics card at factory-stock settings, and it was able to run multiple consecutive runs of FurMark (max temp on the last run was 75 C) without any system errors, benchmark shutdowns, or other evidence of instability. I have downloaded the latest AMD drivers, and GPU-Z is detecting that OpenCL support is present on the card.
I'm sort of at wit's end; right now, I have disabled GPU folding since I have yet to successfully fold a protein and constantly submitting bad WU's doesn't help advance the science, but I'd very much like to be able to use this card for the betterment of kicking CoVID in the teeth.
GPU-Z Profile: https://i.imgur.com/AEZoBdO.gif
From what I've seen, there are some issues being reported with GCN-based cards but this is a Navi-based card which is having the issues, so... Not sure where to go from here. Maybe a BIOS flash would fix this?