Page 1 of 1

Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 8:45 pm
by an00bis
Hi,

I am running an AMD Radeon Vega 56 with amdgpu driver and rocm-opencl-runtime on Manjaro Linux Kernel 5.4.24-1 and get the following error in the Log:

Code: Select all

20:32:58:WU02:FS01:0x22:*********************** Log Started 2020-03-15T20:32:58Z ***********************
20:32:58:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
20:32:58:WU02:FS01:0x22:       Type: 0x22
20:32:58:WU02:FS01:0x22:       Core: Core22
20:32:58:WU02:FS01:0x22:    Website: https://foldingathome.org/
20:32:58:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
20:32:58:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
20:32:58:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
20:32:58:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 10581 -checkpoint 15
20:32:58:WU02:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
20:32:58:WU02:FS01:0x22:     Config: <none>
20:32:58:WU02:FS01:0x22:************************************ Build *************************************
20:32:58:WU02:FS01:0x22:    Version: 0.0.2
20:32:58:WU02:FS01:0x22:       Date: Dec 6 2019
20:32:58:WU02:FS01:0x22:       Time: 21:20:17
20:32:58:WU02:FS01:0x22: Repository: Git
20:32:58:WU02:FS01:0x22:   Revision: f87d92b58abdf7e6bf2e173cfbc4dc3e837c7042
20:32:58:WU02:FS01:0x22:     Branch: core22
20:32:58:WU02:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
20:32:58:WU02:FS01:0x22:    Options: -std=gnu++98 -O3 -funroll-loops
20:32:58:WU02:FS01:0x22:   Platform: linux2 4.9.87-linuxkit-aufs
20:32:58:WU02:FS01:0x22:       Bits: 64
20:32:58:WU02:FS01:0x22:       Mode: Release
20:32:58:WU02:FS01:0x22:************************************ System ************************************
20:32:58:WU02:FS01:0x22:        CPU: AMD Ryzen 5 3600 6-Core Processor
20:32:58:WU02:FS01:0x22:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
20:32:58:WU02:FS01:0x22:       CPUs: 12
20:32:58:WU02:FS01:0x22:     Memory: 15.58GiB
20:32:58:WU02:FS01:0x22:Free Memory: 11.78GiB
20:32:58:WU02:FS01:0x22:    Threads: POSIX_THREADS
20:32:58:WU02:FS01:0x22: OS Version: 5.4
20:32:58:WU02:FS01:0x22:Has Battery: false
20:32:58:WU02:FS01:0x22: On Battery: false
20:32:58:WU02:FS01:0x22: UTC Offset: 1
20:32:58:WU02:FS01:0x22:        PID: 10585
20:32:58:WU02:FS01:0x22:        CWD: /opt/fah/work
20:32:58:WU02:FS01:0x22:         OS: Linux 5.4.24-1-MANJARO x86_64
20:32:58:WU02:FS01:0x22:    OS Arch: AMD64
20:32:58:WU02:FS01:0x22:********************************************************************************
20:32:58:WU02:FS01:0x22:Project: 11746 (Run 0, Clone 3415, Gen 2)
20:32:58:WU02:FS01:0x22:Unit: 0x000000058ca304f15e6aa875e6fe39fb
20:32:58:WU02:FS01:0x22:Reading tar file core.xml
20:32:58:WU02:FS01:0x22:Reading tar file integrator.xml
20:32:58:WU02:FS01:0x22:Reading tar file state.xml
20:32:58:WU02:FS01:0x22:Reading tar file system.xml
20:32:58:WU02:FS01:0x22:Digital signatures verified
20:32:58:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
20:32:58:WU02:FS01:0x22:Version 0.0.2
20:33:02:WU02:FS01:0x22:ERROR:exception: Error initializing context: clCreateCommandQueue (-6)
20:33:02:WU02:FS01:0x22:Saving result file ../logfile_01.txt
20:33:02:WU02:FS01:0x22:Saving result file science.log
20:33:02:WU02:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
20:33:03:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:33:03:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:11746 run:0 clone:3415 gen:2 core:0x22 unit:0x000000058ca304f15e6aa875e6fe39fb
20:33:03:WU02:FS01:Uploading 8.00KiB to 140.163.4.241
20:33:03:WU02:FS01:Connecting to 140.163.4.241:8080
20:33:03:WU03:FS01:Connecting to 65.254.110.245:8080
20:33:03:WU03:FS01:Assigned to work server 140.163.4.241
20:33:03:WU03:FS01:Requesting new work unit for slot 01: READY gpu:0:Vega 10 XL/XT [Radeon RX Vega 56/64] from 140.163.4.241
20:33:03:WU03:FS01:Connecting to 140.163.4.241:8080
Can someone help?

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:02 pm
by JimboPalmer
If you posted the first 200 lines of the log, it would tell us more about your PC. I can't see what AMD driver you are using. (No generic driver will work, you need the AMD Pro driver)

I am not a linux user but often you need to install the OpenCL support separate from the driver.

https://www.amd.com/en/support/kb/faq/gpu-56

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:09 pm
by an00bis
amdgpu driver and rocm-opencl-runtime

Code: Select all

*********************** Log Started 2020-03-15T21:06:56Z ***********************
21:06:56:************************* Folding@home Client *************************
21:06:56:        Website: https://foldingathome.org/
21:06:56:      Copyright: (c) 2009-2018 foldingathome.org
21:06:56:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:06:56:           Args: --config /opt/fah/config.xml --exec-directory=/opt/fah
21:06:56:                 --data-directory=/opt/fah
21:06:56:         Config: /opt/fah/config.xml
21:06:56:******************************** Build ********************************
21:06:56:        Version: 7.5.1
21:06:56:           Date: May 11 2018
21:06:56:           Time: 19:59:04
21:06:56:     Repository: Git
21:06:56:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
21:06:56:         Branch: master
21:06:56:       Compiler: GNU 6.3.0 20170516
21:06:56:        Options: -std=gnu++98 -O3 -funroll-loops
21:06:56:       Platform: linux2 4.14.0-3-amd64
21:06:56:           Bits: 64
21:06:56:           Mode: Release
21:06:56:******************************* System ********************************
21:06:56:            CPU: AMD Ryzen 5 3600 6-Core Processor
21:06:56:         CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
21:06:56:           CPUs: 12
21:06:56:         Memory: 15.58GiB
21:06:56:    Free Memory: 10.79GiB
21:06:56:        Threads: POSIX_THREADS
21:06:56:     OS Version: 5.4
21:06:56:    Has Battery: false
21:06:56:     On Battery: false
21:06:56:     UTC Offset: 1
21:06:56:            PID: 21930
21:06:56:            CWD: /opt/fah
21:06:56:             OS: Linux 5.4.24-1-MANJARO x86_64
21:06:56:        OS Arch: AMD64
21:06:56:           GPUs: 1
21:06:56:          GPU 0: Bus:48 Slot:0 Func:0 AMD:5 Vega 10 XL/XT [Radeon RX Vega 56/64]
21:06:56:           CUDA: Not detected: Failed to open dynamic library 'libcuda.so':
21:06:56:                 libcuda.so: cannot open shared object file: No such file or
21:06:56:                 directory
21:06:56:OpenCL Device 0: Platform:0 Device:0 Bus:48 Slot:0 Compute:2.0 Driver:3084.0
21:06:56:OpenCL Device 1: Platform:1 Device:0 Bus:NA Slot:NA Compute:1.1 Driver:19.3
21:06:56:***********************************************************************
21:06:56:<config>
21:06:56:  <!-- Network -->
21:06:56:  <proxy v=':8080'/>
21:06:56:
21:06:56:  <!-- User Information -->
21:06:56:  <team v='45032'/>
21:06:56:  <user v='an00bis'/>
21:06:56:
21:06:56:  <!-- Folding Slots -->
21:06:56:  <slot id='0' type='CPU'/>
21:06:56:  <slot id='1' type='GPU'/>
21:06:56:</config>
21:06:56:Trying to access database...
21:06:56:Successfully acquired database lock
21:06:56:Enabled folding slot 00: READY cpu:10
21:06:56:Enabled folding slot 01: READY gpu:0:Vega 10 XL/XT [Radeon RX Vega 56/64]
21:06:56:WU00:FS00:Connecting to 65.254.110.245:8080
21:06:56:WU01:FS01:Connecting to 65.254.110.245:8080
21:06:57:WU00:FS00:Assigned to work server 128.252.203.9
21:06:57:WU00:FS00:Requesting new work unit for slot 00: READY cpu:10 from 128.252.203.9
21:06:57:WU01:FS01:Assigned to work server 140.163.4.231
21:06:57:WU00:FS00:Connecting to 128.252.203.9:8080
21:06:57:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:Vega 10 XL/XT [Radeon RX Vega 56/64] from 140.163.4.231
21:06:57:WU01:FS01:Connecting to 140.163.4.231:8080
21:08:14:ERROR:WU01:FS01:Exception: 10002: Received short response, expected 512 bytes, got 0
21:08:14:WU01:FS01:Connecting to 65.254.110.245:8080
21:08:15:WU01:FS01:Assigned to work server 140.163.4.241
21:08:15:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:Vega 10 XL/XT [Radeon RX Vega 56/64] from 140.163.4.241
21:08:15:WU01:FS01:Connecting to 140.163.4.241:8080

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:15 pm
by foldy
Maybe that guide can help you
viewtopic.php?f=89&t=31205#p304145

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:16 pm
by JimboPalmer
21:08:14:ERROR:WU01:FS01:Exception: 10002: Received short response, expected 512 bytes, got 0

This error (in Windows) often means some firewall is not passing F@H data, I have no reason to suspect Linux is different in this regard. Can you disable the firewalls for 10 minutes to test?

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:17 pm
by lordasshat
rocm-opencl-rutime did not work for me Arch/Manjaro user with a vega 64.
I was however able to fold with opencl-amd installed out of the aur.

i have the folowing installed and am folding
opencl-mesa

##aur
aur/opencl-amd
aur/foldingathome
aur/fahcontrol
aur/fahviewer

Hope this helps

Re: Error with Radeon Vega on Arch Linux

Posted: Sun Mar 15, 2020 9:39 pm
by an00bis
lordasshat wrote:rocm-opencl-rutime did not work for me Arch/Manjaro user with a vega 64.
I was however able to fold with opencl-amd installed out of the aur.

i have the folowing installed and am folding
opencl-mesa

##aur
aur/opencl-amd
aur/foldingathome
aur/fahcontrol
aur/fahviewer

Hope this helps
It sure did help, I am now folding ;) Thank you!