CMP 90HX for non-FP32 workloads
Posted: Mon Dec 02, 2024 2:59 pm
Hello,
I recently came across some NVIDIA CMP 90HX mining cards which are allegedly just 3080 dies with many functions disabled by fuse/firmware/driver, thought they would be good for folding however they were only getting 200-400K PPD, while my daily driver 3080 gets ~7M/day. Looking more into these cards I found they have massively reduced performance in non-integer loads, although seemingly less of a hit if there is less/no FP32 FMA calls or FP64, and then of course nearly no hit for integer loads since it's needed for mining.
From what I've seen somehow disabling FMA (Fused Multiply-Add) leads to mostly unlocked performance, however you then run into precision errors so it looks like that would be an issue. I would also think the core would need to be recompiled without FMA in order for this to work, which seems unlikely.
Are there any causes I could go for or other things I could change to work around these cards limitations?
I recently came across some NVIDIA CMP 90HX mining cards which are allegedly just 3080 dies with many functions disabled by fuse/firmware/driver, thought they would be good for folding however they were only getting 200-400K PPD, while my daily driver 3080 gets ~7M/day. Looking more into these cards I found they have massively reduced performance in non-integer loads, although seemingly less of a hit if there is less/no FP32 FMA calls or FP64, and then of course nearly no hit for integer loads since it's needed for mining.
From what I've seen somehow disabling FMA (Fused Multiply-Add) leads to mostly unlocked performance, however you then run into precision errors so it looks like that would be an issue. I would also think the core would need to be recompiled without FMA in order for this to work, which seems unlikely.
Are there any causes I could go for or other things I could change to work around these cards limitations?