FP64 trial run, anyone?
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 887
- Joined: Wed May 26, 2010 2:31 pm
- Hardware configuration: Atom330 (overclocked):
Windows 7 Ultimate 64bit
Intel Atom330 dualcore (4 HyperThreads)
NVidia GT430, core_15 work
2x2GB Kingston KVR1333D3N9K2/4G 1333MHz memory kit
Asus AT3IONT-I Deluxe motherboard - Location: Finland
FP64 trial run, anyone?
I'm wondering if it might be worthwhile to start two trial projects side-by-side, FP32 vs FP64 once again?
-
- Site Admin
- Posts: 7927
- Joined: Tue Apr 21, 2009 4:41 pm
- Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2 - Location: W. MA
Re: FP64 trial run, anyone?
What exactly are you proposing? The tests of extended precision calculations done years ago showed that the extra precision was not needed, and caused much longer run times. This might be better asked of the PG on the reddit.
iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
-
- Site Moderator
- Posts: 6349
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: FP64 trial run, anyone?
And with the current orientation of GPU manufacturers, DP performance is being left behind to save power ... so it wouldn't be wise move to start using DP again.
-
- Posts: 887
- Joined: Wed May 26, 2010 2:31 pm
- Hardware configuration: Atom330 (overclocked):
Windows 7 Ultimate 64bit
Intel Atom330 dualcore (4 HyperThreads)
NVidia GT430, core_15 work
2x2GB Kingston KVR1333D3N9K2/4G 1333MHz memory kit
Asus AT3IONT-I Deluxe motherboard - Location: Finland
Re: FP64 trial run, anyone?
Definitely not for GPUs. However, previous FP64 core was 0x79 - that was a long time ago. I was actually wondering if AVX could boost FP64 performance so much that it would be about as fast as current SSE2 implementations with FP32? PG is cooking AVX versions of cores, thus increasing the number of core versions they need to support and test anyway. Maybe AVX with FP64 wouldn't be an entirely unreasonable addition.
Anyway, Joe_H reminded me that this isn't the best place for such discussion... presuming there's any need to continue the discussion.
Anyway, Joe_H reminded me that this isn't the best place for such discussion... presuming there's any need to continue the discussion.
Re: FP64 trial run, anyone?
FP64 always involves moving more data and using other resources than FP32. The key, here is that additional resolution that FP64 might provide does not add useful scientific information, so why consider using it? If I want to predict the tides, knowing the distance to the moon within 1 km is good enough and knowing it within a fraction of a mm adds no useful information ... especially if it takes the computer longer to determine that number but even if it just generates more heat.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 50
- Joined: Mon Jan 16, 2017 11:40 am
- Hardware configuration: 4x1080Ti + 2x1050Ti
- Location: Russia, Moscow
Re: FP64 trial run, anyone?
Hi everyone!
two months ago i own couple of FirePro W8100 (Tonga-chip). Tested one with f@h and get ~150k points at 800 GHz chip and 1250 Ghz memory. Are that point numbers ok? Are there any fp64 projects in f@h? couse w8100 is ~1,9 GFlops in fp64 and should be good for that kind of projects
two months ago i own couple of FirePro W8100 (Tonga-chip). Tested one with f@h and get ~150k points at 800 GHz chip and 1250 Ghz memory. Are that point numbers ok? Are there any fp64 projects in f@h? couse w8100 is ~1,9 GFlops in fp64 and should be good for that kind of projects
Re: FP64 trial run, anyone?
At the time that Core 0x79 was cloned from 0x78 and recompiled using FP64, it was an either/or proposition. By setting a simple compiler option, FP32 could be compiled using FP64 instructions. The conclusion were as stated above: Costly to (heat/speed) / performance with no improvement in science. That same choice can easily be made available today and I have little doubt the conclusion would be the same, whether we're talking x86 hardware, AVX hardware, or GPU hardware -- though that has not been tested publicly. The only real hardware requirement is that the FP64 operations be supported, whether by semi-dedicated hardware or by some kind of emulation. (And it should be obvious that emulation will be more costly that discrete circuitry.)
What's different, though, is that there have been improvements to the GROMACS/OpenMM code since the proposition was tested. FAHCores now use what's called MIXED precision. The bulk of operations are carried out using FP32 operations while designated portions of the code use FH64 instructions. This allows the code to provide FP64 precision when it's needed while avoiding wasting resources where the added precision is unnecessary. (This does limit project assignments for some of the oldest of GPUs.)
The basic assumptions underlying the original question that FAH has to make a choice between FP32 and FP64 make the question no longer applicable.
What's different, though, is that there have been improvements to the GROMACS/OpenMM code since the proposition was tested. FAHCores now use what's called MIXED precision. The bulk of operations are carried out using FP32 operations while designated portions of the code use FH64 instructions. This allows the code to provide FP64 precision when it's needed while avoiding wasting resources where the added precision is unnecessary. (This does limit project assignments for some of the oldest of GPUs.)
The basic assumptions underlying the original question that FAH has to make a choice between FP32 and FP64 make the question no longer applicable.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.