RX580 stops after 5 mins

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
joshj71
Posts: 1
Joined: Fri Mar 06, 2020 4:42 pm

RX580 stops after 5 mins

Post by joshj71 »

New to F@H, After some wrangling with finding opencl for amd I have had some minor success in getting GPU loads running. For a whopping 5 or so minutes. If I restart the FAHclient GPU load will go up to 100% for around 5mins or so then stop, for good, no spikes later on or anything. Nothing in the log either. Any suggestions?

FYI I'm using GPU-Z watch load on the GPU. And I have power slider at full, while I'm working.

Code: Select all

*********************** Log Started 2020-03-06T16:34:58Z ***********************
16:34:58:WU02:FS01:Starting
16:34:58:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Josh\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 02 -suffix 01 -version 705 -lifeline 7684 -checkpoint 15 -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
16:34:58:WU02:FS01:Started FahCore on PID 11024
16:34:58:WU02:FS01:Core PID:9264
16:34:58:WU02:FS01:FahCore 0x22 started
16:34:58:WU02:FS01:0x22:*********************** Log Started 2020-03-06T16:34:58Z ***********************
16:34:58:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
16:34:58:WU02:FS01:0x22:       Type: 0x22
16:34:58:WU02:FS01:0x22:       Core: Core22
16:34:58:WU02:FS01:0x22:    Website: https://foldingathome.org/
16:34:58:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
16:34:58:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
16:34:58:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
16:34:58:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 11024 -checkpoint 15
16:34:58:WU02:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
16:34:58:WU02:FS01:0x22:     Config: <none>
16:34:58:WU02:FS01:0x22:************************************ Build *************************************
16:34:58:WU02:FS01:0x22:    Version: 0.0.2
16:34:58:WU02:FS01:0x22:       Date: Dec 6 2019
16:34:58:WU02:FS01:0x22:       Time: 21:30:31
16:34:58:WU02:FS01:0x22: Repository: Git
16:34:58:WU02:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
16:34:59:WU02:FS01:0x22:     Branch: HEAD
16:34:59:WU02:FS01:0x22:   Compiler: Visual C++ 2008
16:34:59:WU02:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
16:34:59:WU02:FS01:0x22:   Platform: win32 10
16:34:59:WU02:FS01:0x22:       Bits: 64
16:34:59:WU02:FS01:0x22:       Mode: Release
16:34:59:WU02:FS01:0x22:************************************ System ************************************
16:34:59:WU02:FS01:0x22:        CPU: AMD Ryzen 7 1700X Eight-Core Processor
16:34:59:WU02:FS01:0x22:     CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
16:34:59:WU02:FS01:0x22:       CPUs: 16
16:34:59:WU02:FS01:0x22:     Memory: 63.95GiB
16:34:59:WU02:FS01:0x22:Free Memory: 55.42GiB
16:34:59:WU02:FS01:0x22:    Threads: WINDOWS_THREADS
16:34:59:WU02:FS01:0x22: OS Version: 6.2
16:34:59:WU02:FS01:0x22:Has Battery: false
16:34:59:WU02:FS01:0x22: On Battery: false
16:34:59:WU02:FS01:0x22: UTC Offset: -8
16:34:59:WU02:FS01:0x22:        PID: 9264
16:34:59:WU02:FS01:0x22:        CWD: C:\Users\Josh\AppData\Roaming\FAHClient\work
16:34:59:WU02:FS01:0x22:         OS: Windows 10 Pro
16:34:59:WU02:FS01:0x22:    OS Arch: AMD64
16:34:59:WU02:FS01:0x22:********************************************************************************
16:34:59:WU02:FS01:0x22:Project: 11737 (Run 0, Clone 3132, Gen 96)
16:34:59:WU02:FS01:0x22:Unit: 0x000000888ca304f15e34fa3d497b6aa1
16:34:59:WU02:FS01:0x22:Reading tar file core.xml
16:34:59:WU02:FS01:0x22:Reading tar file integrator.xml
16:34:59:WU02:FS01:0x22:Reading tar file state.xml
16:34:59:WU02:FS01:0x22:Reading tar file system.xml
16:34:59:WU02:FS01:0x22:Digital signatures verified
16:34:59:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
16:34:59:WU02:FS01:0x22:Version 0.0.2
Last edited by joshj71 on Fri Mar 06, 2020 5:06 pm, edited 1 time in total.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: RX580 stops after 5 mins

Post by bruce »

Have you modified any of FAH's settings? When you post the log, we really need the second page from the beginning to understand about your system. Where is the slider set? Have you configured FAH to run on IDLE? (etc.)

Is there enough cooling air to keep your hardware from overheating? Most modern hardware will protect itself by shutting down if it's getting too hot.
foldy
Posts: 2040
Joined: Sat Dec 01, 2012 3:43 pm
Hardware configuration: Folding@Home Client 7.6.13 (1 GPU slots)
Windows 7 64bit
Intel Core i5 2500k@4Ghz
Nvidia gtx 1080ti driver 441

Re: RX580 stops after 5 mins

Post by foldy »

FAH is very demanding on the hardware. Could it be a hardware issue?
Demandzm
Posts: 13
Joined: Sat Mar 28, 2020 9:45 pm

Re: RX580 stops after 5 mins

Post by Demandzm »

I have the same issue. I get a few gpu WU every day, but after a few minutes it stops. The logs show a bad WU error. Next time that happens i will post the log.
BJMcGee
Posts: 5
Joined: Sun Mar 22, 2020 6:31 pm

Re: RX580 stops after 5 mins

Post by BJMcGee »

I am running a rig with multiple RX580s ... and can confirm, they will overheat after 5~10 minutes without increasing the airflow. The stock fan settings allow things to get too hot before it reacts. You can override your fan settings using a multitude of tools. I'm on Ubuntu, so my solution is certainly different. Find an AMD, or third party, GPU fan controller. Here is one example from AMD: https://www.amd.com/en/support/kb/faq/dh-020

One thing to keep in mind with temperatures on graphics cards: The temperature readings and limits are typically focused on the GPU itself, while the memory may be your actual heat source. So if you are seeing steady 75C GPU temps and think you have ~20C margin, you may have less margin on the memory chips.

Once you have your GPU fan operating a bit better, make sure you have enough air moving through your case. Check your case fans and air filters.
Demandzm
Posts: 13
Joined: Sat Mar 28, 2020 9:45 pm

Re: RX580 stops after 5 mins

Post by Demandzm »

Using hardware monitor no temperature reaches 70 degrees before it stops folding. I might need to start a new post as I think my issue is different.
Joe_H
Site Admin
Posts: 7927
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: RX580 stops after 5 mins

Post by Joe_H »

BJMcGee wrote:One thing to keep in mind with temperatures on graphics cards: The temperature readings and limits are typically focused on the GPU itself, while the memory may be your actual heat source. So if you are seeing steady 75C GPU temps and think you have ~20C margin, you may have less margin on the memory chips.
One note about memory and folding, the RAM speed is not as important for the speed of folding calculations on a GPU. So reducing the memory clock will have little impact on folding speed, and can reduce the heat and temperature created.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Post Reply