CPU Architecture and FAH

A forum for discussing FAH-related hardware choices and info on actual products (not speculation).

Moderator: Site Moderators

Forum rules
Please read the forum rules before posting.
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: CPU Architecture and FAH

Post by 7im »

Buy the fastest processor with the more cores that you can afford. Yes, quad cores will increase your PPD quite a bit, while not increasing your electricity cost that much.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
shatteredsilicon
Posts: 87
Joined: Tue Jul 08, 2008 2:27 pm
Hardware configuration: 1x Q6600 @ 3.2GHz, 4GB DDR3-1333
1x Phenom X4 9950 @ 2.6GHz, 4GB DDR2-1066
3x GeForce 9800GX2
1x GeForce 8800GT
CentOS 5 x86-64, WINE 1.x with CUDA wrappers

Re: CPU Architecture and FAH

Post by shatteredsilicon »

So, if I'm following this correctly, you are asking what CPU will be sufficient to saturate whatever number of GPUs you are planning to have. If that's the case, then I can confirm that my own experience indicates that it takes approximately 15-20% (depending on the WU) of a 2.6GHz+ Core2/Phenom class CPU to keep a G92 GPU saturated. G200s may take a bit more. So, if you are planning to run <= 5 GPUs (probably 4 if you are running G200s) in the system, a Core2 Solo would be sufficient to keep the GPUs fed (you'd have no CPU time left for an SMP client, though). This is under Linux with the wine cuda wrappers, and a kernel with high-res timer support.
Image
1x Q6600 @ 3.2GHz, 4GB DDR3-1333
1x Phenom X4 9950 @ 2.6GHz, 4GB DDR2-1066
3x GeForce 9800GX2
1x GeForce 8800GT
CentOS 5 x86-64, WINE 1.x with CUDA wrappers
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: CPU Architecture and FAH

Post by bruce »

Welcome to the foldingforum, Archangelboy.

The FAH software does it's best to take advantage of whatever hardware you have. Clearly a Quad can process twice as many instructions in a given time compared to a Duo so (assuming other factors don't change) you'd expect nearly twice the PPD. The CPU will cost more and will draw more power, but the RAM, Video, HD, etc. will not change so you'll get a more efficient system.

Comparing the Core2 architecture to older intel architecture, they did improve the efficiency of Floating Point (and SSE) math, so yes, it's more efficient than older machines. Actual FAH speed improvements comparing i7 with Core2 are small, although HyperThreading can help some non-FAH code quite a bit.

GPUs are able to process a lot more instructions in parallel than today's CPUs, so they get higher throughput.

As far as which architecture the Pande Group would prefer, they are thankful for anything you're willing to contribute to the project (Duh). If you're willing to invest more money or more electricity in newer/faster technology, that's even better, but there's no simple answer to your question. They'd also be happier if you fold on a dozen computers instead of just one or two.

Let me turn the question around another way. If you have a computer that could be folding but isn't, why isn't it?
Archangelboy
Posts: 9
Joined: Thu Jul 09, 2009 5:52 pm
Hardware configuration: i7 920 on EVGA x58 SLI Vanilla @ 3.66 GHz Watercooled
6GB DDR3
3x GTX 280 GPU
4 notfreds smp clients, 3 GPU clients
Location: Bozeman, MT

Re: CPU Architecture and FAH

Post by Archangelboy »

Thanks, Bruce, and well put!

Just picked up a(nother) used i7, plan on putting it and a 295 online as soon as I can afford RAM and a PSU in addition to what I've got running now.

Any word on how i9 Intels will perform? similarly to i7 (with scale-up for additional cores) or will there be an improvement/core (or per client) with that architecture?
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: CPU Architecture and FAH

Post by toTOW »

See this news on FAH-Addict : first review of Core i9 ;) (yes I know, it's in French ... or Chinise for the review :roll: )
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Archangelboy
Posts: 9
Joined: Thu Jul 09, 2009 5:52 pm
Hardware configuration: i7 920 on EVGA x58 SLI Vanilla @ 3.66 GHz Watercooled
6GB DDR3
3x GTX 280 GPU
4 notfreds smp clients, 3 GPU clients
Location: Bozeman, MT

Re: CPU Architecture and FAH

Post by Archangelboy »

C'est d'accord, toTow, j'en parle un peu. Merci :)
divery4eyes
Posts: 41
Joined: Tue Jun 03, 2008 10:10 pm
Hardware configuration: E2180 @1.08, 2Gb ddr2, 500Gb hd, Last XP, 8800 GT 1835 shader
AMD Athlon x2 2200, 2GB DDR2, ?? GB HD, Last XP, GT 240 1600 shader
Location: off of I40 in Arkansas

Re: CPU Architecture and FAH

Post by divery4eyes »

sad to say, but I do not foresee AMD being with us much longer :-(
over 30,000 werk units
"Everyone is in the same boat. Grab an oar, and paddle some WU's." -RAH

Still GPU folding with 8.2 teraflops
Archangelboy
Posts: 9
Joined: Thu Jul 09, 2009 5:52 pm
Hardware configuration: i7 920 on EVGA x58 SLI Vanilla @ 3.66 GHz Watercooled
6GB DDR3
3x GTX 280 GPU
4 notfreds smp clients, 3 GPU clients
Location: Bozeman, MT

Re: CPU Architecture and FAH

Post by Archangelboy »

Lol nah, if AMD can't compete, they'll just whine until somebody makes Intel pay to keep their operation afloat. Thank you, EU >.<
shatteredsilicon
Posts: 87
Joined: Tue Jul 08, 2008 2:27 pm
Hardware configuration: 1x Q6600 @ 3.2GHz, 4GB DDR3-1333
1x Phenom X4 9950 @ 2.6GHz, 4GB DDR2-1066
3x GeForce 9800GX2
1x GeForce 8800GT
CentOS 5 x86-64, WINE 1.x with CUDA wrappers

Re: CPU Architecture and FAH

Post by shatteredsilicon »

divery4eyes wrote:sad to say, but I do not foresee AMD being with us much longer :-(
We can but hope. Then maybe we can finally knock x86 on the head once and for all.
Image
1x Q6600 @ 3.2GHz, 4GB DDR3-1333
1x Phenom X4 9950 @ 2.6GHz, 4GB DDR2-1066
3x GeForce 9800GX2
1x GeForce 8800GT
CentOS 5 x86-64, WINE 1.x with CUDA wrappers
YashBudini
Posts: 3
Joined: Wed Dec 12, 2007 1:44 am

Re: CPU Architecture and FAH

Post by YashBudini »

My question would be in regards to quad cores and SMP under Windows. We've seen that the 4th core is less busy, and some people have dealt with that in various ways, like running 2 iterations of SMP on a quad core (I'm not advocating that nor do I want to start a discussion about it.) My question? Given Windows behavior and disregarding the L3 cache differences for the moment, would an AMD X3 710 (3 cores @ 2.6GHZ) fold significantly slower (25% or more) than an X4 810 (4 cores @ same speed)? The idea being to keep all cores as busy as possible, presumably when the threads outnumber the cores?

I'd expect the 710 to be slower than the 810, but it would be interesting to run a test and see how much.

Last question - does this Windows inefficiency still exist in Win 7?
Where ever you are, you're there.
John Naylor
Posts: 357
Joined: Mon Dec 03, 2007 4:36 pm
Hardware configuration: Q9450 OC @ 3.2GHz (Win7 Home Premium) - SMP2
E7500 OC @ 3.66GHz (Windows Home Server) - SMP2
i5-3750k @ 3.8GHz (Win7 Pro) - SMP2
Location: University of Birmingham, UK

Re: CPU Architecture and FAH

Post by John Naylor »

The windows inefficiency is related to the design of the core rather than the OS. However if the OS could affect it, it is unlikely it would be fixed by Win7 as Win7's kernel is a development of Vista's kernel.
Folding whatever I'm sent since March 2006 :) Beta testing since October 2006. www.FAH-Addict.net Administrator since August 2009.
MtM
Posts: 1579
Joined: Fri Jun 27, 2008 2:20 pm
Hardware configuration: Q6600 - 8gb - p5q deluxe - gtx275 - hd4350 ( not folding ) win7 x64 - smp:4 - gpu slot
E6600 - 4gb - p5wdh deluxe - 9600gt - 9600gso - win7 x64 - smp:2 - 2 gpu slots
E2160 - 2gb - ?? - onboard gpu - win7 x32 - 2 uniprocessor slots
T5450 - 4gb - ?? - 8600M GT 512 ( DDR2 ) - win7 x64 - smp:2 - gpu slot
Location: The Netherlands
Contact:

Re: CPU Architecture and FAH

Post by MtM »

John if the os was not a problem a port from linux a2 to windows a2 wouldn't take this long. There are inherant issues with the scheduler which prevented a quick port, but I'm not aware of the specific details. Maybe 7im or Bruce can elaborate ( or correct me ) on the issue?

@yashbudini it's not only relevant how high the utilisation of each core is, it's also very relevant to how balanced the execution of each thread is in regards to the others. This is also the main argument toTow focuses on with his view on the current linux 2.10 core issues. If one thread has to wait for the others it slows down the computing process, so while in theory a tri core could offer the same performance for each thread as a quad core ( assuming about 75% usage on the quad which isn't exactly true but let's use it for arguments sake ) the synchronisation between the threads would negatively impact the final performance.

That's in theory, in practise I'm fairly certain people have tested tri vs quad core but I'm not one of them and can't recall from memory what the outcome has been ( or for that matter at what forum the thread was which I read about it, I am almost certain it was not here but probably on one of the russian/ukraine forums. I think Anglik666 was the one who lead me to that forum so it might be the polish team forum as well ).

I'll send him a pm with link to this thread, I think he can say more about the testing and results :)
John Naylor
Posts: 357
Joined: Mon Dec 03, 2007 4:36 pm
Hardware configuration: Q9450 OC @ 3.2GHz (Win7 Home Premium) - SMP2
E7500 OC @ 3.66GHz (Windows Home Server) - SMP2
i5-3750k @ 3.8GHz (Win7 Pro) - SMP2
Location: University of Birmingham, UK

Re: CPU Architecture and FAH

Post by John Naylor »

I meant that a1 had/has efficiency problems on all 3 platforms, so win7 is unlikely to fix these problems :P not referring to a2 lol

EDIT: Also afaik the issues with the a2 core were down to Gromacs 4.0 compiling on Windows but not producing accurate results... since we know SMP2 development is now underway I doubt they would bother to try to port a2 now, even though the existence of the released core a4 shows that gromacs 4 now produces correct results on windows.
Last edited by John Naylor on Mon Sep 07, 2009 11:32 pm, edited 1 time in total.
Folding whatever I'm sent since March 2006 :) Beta testing since October 2006. www.FAH-Addict.net Administrator since August 2009.
Anglik666
Posts: 33
Joined: Tue Dec 16, 2008 9:35 pm

Re: CPU Architecture and FAH

Post by Anglik666 »

Well, this days I have had Radeon HD and Phenom CPU, GPU's taken 100% of one core so 3 left for SMP.
Using Mpich (for win with core a1) I've got almost twice worst PPD for 3 cores then for 4, so I used this one: viewtopic.php?f=14&t=5564.
I was sorting processes by mem, cpu usage, rotating them, made the magic 12 pairs of cpus for 4 processes etc.
The best effect I've got sorting them by mem (randomize like) and for 12 pairs. It still was more then 30%-40% worst then 4 cores but not 2 times.
After that I found the way to run Deino in 64bits M$ systems and it gave me results only 30% worst ish without any 3rd party program at all.
So in my opinion Deino much better scales itself.

But, remember, I've never checked that with stopper, always more or less.
PC1: PII 940, 790GX-P, 4x2GB, HD 6870
PC2: P 9750, 790GX-P, 4x1GB, GTS 250
PC3: 2x O 2427, SM H8DII+-F, 4x2GB ECC
PC4: 2x X e5420, SM X8DLT-I, 6x1GB ECC
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: CPU Architecture and FAH

Post by bruce »

MtM wrote:John if the os was not a problem a port from linux a2 to windows a2 wouldn't take this long. There are inherant issues with the scheduler which prevented a quick port, but I'm not aware of the specific details. Maybe 7im or Bruce can elaborate ( or correct me ) on the issue?
The only think I've heard is that A2 does not work on Windows. I don't know the details of why, but since it's apparently "impossible" to port it, you're wrong about how long it "should take."

I agree with John.
John Naylor wrote:Also afaik the issues with the a2 core were down to Gromacs 4.0 compiling on Windows but not producing accurate results... since we know SMP2 development is now underway I doubt they would bother to try to port a2 now, even though the existence of the released core a4 shows that gromacs 4 now produces correct results on windows.
I expect that A1 (and maybe A2, too) will be replaced with whatever code they're talking about when they mention "SMP2" but there's no way to predict when that might be.
Locked