Page 1 of 1

RX580 stops after 5 mins

Posted: Fri Mar 06, 2020 4:59 pm
by joshj71
New to F@H, After some wrangling with finding opencl for amd I have had some minor success in getting GPU loads running. For a whopping 5 or so minutes. If I restart the FAHclient GPU load will go up to 100% for around 5mins or so then stop, for good, no spikes later on or anything. Nothing in the log either. Any suggestions?

FYI I'm using GPU-Z watch load on the GPU. And I have power slider at full, while I'm working.

Code: Select all

*********************** Log Started 2020-03-06T16:34:58Z ***********************
16:34:58:WU02:FS01:Starting
16:34:58:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Josh\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 02 -suffix 01 -version 705 -lifeline 7684 -checkpoint 15 -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
16:34:58:WU02:FS01:Started FahCore on PID 11024
16:34:58:WU02:FS01:Core PID:9264
16:34:58:WU02:FS01:FahCore 0x22 started
16:34:58:WU02:FS01:0x22:*********************** Log Started 2020-03-06T16:34:58Z ***********************
16:34:58:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
16:34:58:WU02:FS01:0x22:       Type: 0x22
16:34:58:WU02:FS01:0x22:       Core: Core22
16:34:58:WU02:FS01:0x22:    Website: https://foldingathome.org/
16:34:58:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
16:34:58:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
16:34:58:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
16:34:58:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 11024 -checkpoint 15
16:34:58:WU02:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
16:34:58:WU02:FS01:0x22:     Config: <none>
16:34:58:WU02:FS01:0x22:************************************ Build *************************************
16:34:58:WU02:FS01:0x22:    Version: 0.0.2
16:34:58:WU02:FS01:0x22:       Date: Dec 6 2019
16:34:58:WU02:FS01:0x22:       Time: 21:30:31
16:34:58:WU02:FS01:0x22: Repository: Git
16:34:58:WU02:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
16:34:59:WU02:FS01:0x22:     Branch: HEAD
16:34:59:WU02:FS01:0x22:   Compiler: Visual C++ 2008
16:34:59:WU02:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
16:34:59:WU02:FS01:0x22:   Platform: win32 10
16:34:59:WU02:FS01:0x22:       Bits: 64
16:34:59:WU02:FS01:0x22:       Mode: Release
16:34:59:WU02:FS01:0x22:************************************ System ************************************
16:34:59:WU02:FS01:0x22:        CPU: AMD Ryzen 7 1700X Eight-Core Processor
16:34:59:WU02:FS01:0x22:     CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
16:34:59:WU02:FS01:0x22:       CPUs: 16
16:34:59:WU02:FS01:0x22:     Memory: 63.95GiB
16:34:59:WU02:FS01:0x22:Free Memory: 55.42GiB
16:34:59:WU02:FS01:0x22:    Threads: WINDOWS_THREADS
16:34:59:WU02:FS01:0x22: OS Version: 6.2
16:34:59:WU02:FS01:0x22:Has Battery: false
16:34:59:WU02:FS01:0x22: On Battery: false
16:34:59:WU02:FS01:0x22: UTC Offset: -8
16:34:59:WU02:FS01:0x22:        PID: 9264
16:34:59:WU02:FS01:0x22:        CWD: C:\Users\Josh\AppData\Roaming\FAHClient\work
16:34:59:WU02:FS01:0x22:         OS: Windows 10 Pro
16:34:59:WU02:FS01:0x22:    OS Arch: AMD64
16:34:59:WU02:FS01:0x22:********************************************************************************
16:34:59:WU02:FS01:0x22:Project: 11737 (Run 0, Clone 3132, Gen 96)
16:34:59:WU02:FS01:0x22:Unit: 0x000000888ca304f15e34fa3d497b6aa1
16:34:59:WU02:FS01:0x22:Reading tar file core.xml
16:34:59:WU02:FS01:0x22:Reading tar file integrator.xml
16:34:59:WU02:FS01:0x22:Reading tar file state.xml
16:34:59:WU02:FS01:0x22:Reading tar file system.xml
16:34:59:WU02:FS01:0x22:Digital signatures verified
16:34:59:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
16:34:59:WU02:FS01:0x22:Version 0.0.2

Re: RX580 stops after 5 mins

Posted: Fri Mar 06, 2020 5:05 pm
by bruce
Have you modified any of FAH's settings? When you post the log, we really need the second page from the beginning to understand about your system. Where is the slider set? Have you configured FAH to run on IDLE? (etc.)

Is there enough cooling air to keep your hardware from overheating? Most modern hardware will protect itself by shutting down if it's getting too hot.

Re: RX580 stops after 5 mins

Posted: Fri Mar 06, 2020 5:23 pm
by foldy
FAH is very demanding on the hardware. Could it be a hardware issue?

Re: RX580 stops after 5 mins

Posted: Sat Mar 28, 2020 9:53 pm
by Demandzm
I have the same issue. I get a few gpu WU every day, but after a few minutes it stops. The logs show a bad WU error. Next time that happens i will post the log.

Re: RX580 stops after 5 mins

Posted: Sat Mar 28, 2020 11:33 pm
by BJMcGee
I am running a rig with multiple RX580s ... and can confirm, they will overheat after 5~10 minutes without increasing the airflow. The stock fan settings allow things to get too hot before it reacts. You can override your fan settings using a multitude of tools. I'm on Ubuntu, so my solution is certainly different. Find an AMD, or third party, GPU fan controller. Here is one example from AMD: https://www.amd.com/en/support/kb/faq/dh-020

One thing to keep in mind with temperatures on graphics cards: The temperature readings and limits are typically focused on the GPU itself, while the memory may be your actual heat source. So if you are seeing steady 75C GPU temps and think you have ~20C margin, you may have less margin on the memory chips.

Once you have your GPU fan operating a bit better, make sure you have enough air moving through your case. Check your case fans and air filters.

Re: RX580 stops after 5 mins

Posted: Sun Mar 29, 2020 9:00 am
by Demandzm
Using hardware monitor no temperature reaches 70 degrees before it stops folding. I might need to start a new post as I think my issue is different.

Re: RX580 stops after 5 mins

Posted: Sun Mar 29, 2020 4:01 pm
by Joe_H
BJMcGee wrote:One thing to keep in mind with temperatures on graphics cards: The temperature readings and limits are typically focused on the GPU itself, while the memory may be your actual heat source. So if you are seeing steady 75C GPU temps and think you have ~20C margin, you may have less margin on the memory chips.
One note about memory and folding, the RAM speed is not as important for the speed of folding calculations on a GPU. So reducing the memory clock will have little impact on folding speed, and can reduce the heat and temperature created.