AMD is a hippy here
Which device are the tasks running on?
Moderators: Site Moderators, FAHC Science Team
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
It is a distinct runtime API from CUDA, but hipify can do a very efficient one-to-one conversion from CUDA to HIP because it supports all the same features. Unlike OpenCL where there are automatic transpilers but they are not smart enough to rewrite CUDA-isms in OpenCL efficiently.
It works fine on Nvidia but FAH does not use it.Peter_Hucker wrote: ↑Sun Apr 06, 2025 2:36 pm From the AMD website:
"The Heterogeneous-computing Interface for Portability (HIP) is a C++ runtime API and kernel language that lets you create portable applications for AMD and NVIDIA GPUs from a single source code."
https://rocm.docs.amd.com/projects/HIP/ ... index.html
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
"transpiler" and "CUDA-isms" - I must remember those terms
I like.
-
muziqaz
- Posts: 2130
- Joined: Sun Dec 16, 2007 6:22 pm
- Hardware configuration: 9950x, 9950x3D, 5950x, 5800x3D
7900xtx, RX9070, Radeon 7, 5700xt, 6900xt, Intel B580 - Location: London
- Contact:
Re: Which device are the tasks running on?
I don't see the point of anyone using HIP over CUDA.
If you already have Nvidia GPU, your code is CUDA, unless you are some sort of masochistic enthusiast, or someone planning to ditch Nvidia, which never happens these days
If you already have Nvidia GPU, your code is CUDA, unless you are some sort of masochistic enthusiast, or someone planning to ditch Nvidia, which never happens these days
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
Because someone like Folding could write in HIP only and not have to bother with AMD/Nvidia considerations.
-
muziqaz
- Posts: 2130
- Joined: Sun Dec 16, 2007 6:22 pm
- Hardware configuration: 9950x, 9950x3D, 5950x, 5800x3D
7900xtx, RX9070, Radeon 7, 5700xt, 6900xt, Intel B580 - Location: London
- Contact:
Re: Which device are the tasks running on?
Yeah, no.Peter_Hucker wrote: ↑Mon Apr 07, 2025 7:29 am Because someone like Folding could write in HIP only and not have to bother with AMD/Nvidia considerations.
CUDA has been in fah waaay before HIP. Hell, HIP is still not out for AMD. So, no one will ditch perfectly working CUDA, which is constantly improved and looked after by well funded Nvidia software team, and go with HIP, which I had to beg OpenMM Devs to support, and which is developed and looked after hopes and dreams of upper management of AMD.
Let's be real here, please.
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
When someone comes to write a new program, they will obviously want to write it only once. So they will obviously choose something which works on as many platforms as possible to get the most work units done. This has to be HIP. Sure, Nvidia will run OpenCL, but apparently not as efficiently.
-
muziqaz
- Posts: 2130
- Joined: Sun Dec 16, 2007 6:22 pm
- Hardware configuration: 9950x, 9950x3D, 5950x, 5800x3D
7900xtx, RX9070, Radeon 7, 5700xt, 6900xt, Intel B580 - Location: London
- Contact:
Re: Which device are the tasks running on?
Sigh, no one in their right mind in current market will choose HIP over CUDA for Nvidia.Peter_Hucker wrote: ↑Mon Apr 07, 2025 8:23 am When someone comes to write a new program, they will obviously want to write it only once. So they will obviously choose something which works on as many platforms as possible to get the most work units done. This has to be HIP. Sure, Nvidia will run OpenCL, but apparently not as efficiently.
They'd rather implement CUDA first, ignoring AMD completely or they gonna implement CUDA ignoring AMD and leaving it for later when they have some spare time.
CUDA has been in development for over a decade, it is feature rich and well supported by Nvidia who seem to have a plan in place for it. It is being developed by software guys who have been with CUDA for very very long time.
HIP has been developed for 3 years, it is still in development and is trying to catch up to CUDA and is still quite some distance away from it. It is being developed a small group of developers who have just been hired for it. AMD upper management only very recently realised that HIP needs extra attention.
Now looking at these two scenarios, do you still think anyone (besides diehard fanboys or mental patients) will pick HIP over CUDA for nVidia GPUs?
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
If I was working for a project and I was writing a new program for crowd computing, I'd want to write it only once to save time and money, and get as many volunteers crunching as possible. There are lots of AMDs out there, why would I want to ignore them and get only half the work achieved? I want to use a program which will run on everything. So my choices are simply OpenCL or HIP. From what you've said, I gather HIP is more efficient than OpenCL.
Re: Which device are the tasks running on?
Writing it in CUDA once and then using hipify to convert it into HIP would be better. Then if HIP goes tits up the only thing of value you lose is AMD devices. But if you write in HIP and HIP goes tits up, you lose everything.Peter_Hucker wrote: ↑Mon Apr 07, 2025 9:19 am If I was working for a project and I was writing a new program for crowd computing, I'd want to write it only once to save time and money, and get as many volunteers crunching as possible. There are lots of AMDs out there, why would I want to ignore them and get only half the work achieved? I want to use a program which will run on everything. So my choices are simply OpenCL or HIP. From what you've said, I gather HIP is more efficient than OpenCL.
-
Peter_Hucker
- Posts: 370
- Joined: Wed Feb 16, 2022 1:18 am
- Hardware configuration: Ryzen 9 3900XT: 24 cores, 128GB RAM, 1TB NVME, 4TB HDD, R9 Nano (Fiji) GPU.
Ryzen 9 3900X: 24 cores, 64GB RAM, 250GB NVME.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME, R9 290(Hawaii) GPU.
Xeon X5650 dual CPU server: 24 cores, 64GB RAM, 250GB NVME.
I3-6100: 4 cores, 32GB RAM, 250GB NVME, 2 of R9 2980X (Tahiti) GPUs.
5 other smaller computers. - Location: Scotland
Re: Which device are the tasks running on?
If the conversion is still just as efficient as writing it natively, then yes, good idea.
Re: Which device are the tasks running on?
Yes, the hipify conversion is just as efficient as using the native CUDA. Unlike a direct conversion to OpenCL, where lots of performance is lost.
-
bobdarnall
- Posts: 3
- Joined: Sat Dec 21, 2024 6:08 am
Re: Which device are the tasks running on?
I know this thread went a few directions, this is all I was trying to say, which sounded like the original ask:

