Tesla V100 classification

Gnomuz · Post by **Gnomuz** » Sun Jan 03, 2021 10:40 am

I'm currently folding thanks to Knish great guide on a temporary (free credit) Azure VM instance with a Tesla V100 GPU PCIe 16GB. This GPU works great and went as high as 6.5 mio PPD on projects like 13434, 13437, 13438, 17316, with an average PPD above 5mio.
Despite, it has just folded a few WUs from project 16928 where the PDD was circa 1.6 mio. I've checked, and the GPU utilisation was around 80%, and the power draw 100W out of 250W (vs 200W on other projects). To sum up, I have the impression this GPU is widely underused by such WUs.

I had a look at GPUs.txt, and this GPU appears like that :
"0x10de:0x1db4:2:7:GV100GL [Tesla V100 PCIe 16GB] M 14028"
I understand the "7" is the "category" of GPU and has an impact on the assignment process. I also noticed my personal modest RTX 3060 Ti is classified as "8", although it has an average 3mio PPD on equivalent WUs.

So, I just wonder whether this GPU is properly identified and should not be "upgraded" to "8" to avoid, provided there's no complex WUs shortage, being assigned WUs which don't optimize its computing power.
I hope it makes sense.

Happy folding all !

JimboPalmer · Post by **JimboPalmer** » Sun Jan 03, 2021 12:12 pm

7 is Volta like your card, Ampere is 8.
They are numbered by features, not performance.

https://en.wikipedia.org/wiki/Volta_(microarchitecture)
https://en.wikipedia.org/wiki/Ampere_(m ... hitecture)
The wiki on Ampere lists the differences.

I have an older Pascal card that outperforms my newer Turing cards, so newer is not always better.

Gnomuz · Post by **Gnomuz** » Sun Jan 03, 2021 12:38 pm

Thanks for the reply, I hadn't found any recent explanation about this classification, again I have learnt something

Post by **bruce** » Sun Jan 03, 2021 2:31 pm

Two more facts:

* Productivity is NOT linear. Take two projects and two GPUs and benchmark the 4 combinations and the productivity figures don't quite make sense. One project may be much more productive that the other while on another project, the differences may be small. There are many factors, but the most obvious is that a protein with a small number of atoms running on a GPU with a large number of shaders performs poorly in comparison. Nothing can really be done about that except to assign big proteins to GPUs with larger numbers of shaders ... but with the variations in the projects which happen to be active, that's not always possible.

* There's an ongoing project to revamp the CPU Specie structure. GPUs are being benchmarked and the plan is to restructure everything based on performance instead of GPU Generation. It's a complex project and there's no predicting when it will alter production assignments. Given the facts in the previous paragraph, it's nearly an impossible task.

Folding Forum

Tesla V100 classification

Tesla V100 classification

Re: Tesla V100 classification

Re: Tesla V100 classification

Re: Tesla V100 classification