Statistics (not the usual...)

Moderators: Site Moderators, FAHC Science Team

Post Reply
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Statistics (not the usual...)

Post by ajm »

I would be nice to have some current stats expressed in more universal values than the "points", at least for FAH's overall performance. I don't know, peta/teraFLOPS might be appropriate?
It was published recently that FAH is now faster than the world's top 7 supercomputers combined: https://www.extremetech.com/extreme/308 ... s-combined
This kind of stats (real-time possible?) would help finding support for FAH at a more institutional level. And for users, it also would be more down-to-earth than the points, somehow.
anandhanju
Posts: 522
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: Statistics (not the usual...)

Post by anandhanju »

Artemios
Posts: 39
Joined: Tue Dec 10, 2019 3:39 pm
Location: Athens, Greece

Re: Statistics (not the usual...)

Post by Artemios »

There following stats are posted on the F@H site:

https://stats.foldingathome.org/os
(CPU=AMD 2700x) (GPU=2X RX 5700XT) (OS= win10)(client type=beta)
Image
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: Statistics (not the usual...)

Post by ajm »

Thanks!
VAcharonD1
Posts: 14
Joined: Tue Mar 24, 2020 2:46 am

Re: Statistics (not the usual...)

Post by VAcharonD1 »

If I understand correctly, those stats are based on the capacity of all clients that submitted work in the last 50 days ("active CPUs and GPUs"). I don't think it's the kind of real-time data the OP is looking for because not all of those machines will be folding 24/7, some have dropped out for various reasons, some could be double-counted, and there were some very large resources added in the last month but quickly removed (e.g. the points peak on 3/20). It measures what FAH is capable of, but I don't think we have a measure of how much we are actually doing, other than the points.

Of course if I am grossly wrong I welcome correction.
anandhanju
Posts: 522
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: Statistics (not the usual...)

Post by anandhanju »

https://api.foldingathome.org/os?days=1 gives you data for clients that have returned results in the past day. For the most part, this should eliminate duplicates (e.g., an uninstall and a reinstall). There isn't a way to separate out those that do not fold the entire day but this gives a good idea.
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: Statistics (not the usual...)

Post by ajm »

Hm, I suspect that VAcharonD1 is right. There's that sentence: "FLOPS per core is estimated." It looks more like a broad estimate than like a statistics of the effective performance.
Maybe it should be done the other way around: not from the number and hardware of donors, but from the results that scientists are effectively getting?
Joe_H
Site Admin
Posts: 7937
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Statistics (not the usual...)

Post by Joe_H »

That is the what they are estimating from. They have data estimates of how many FLOPS completing WU's takes depending on the size and how many time steps are involved, and can back out an estimate of the total being done in a second over all the returns.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Nert
Posts: 154
Joined: Wed Mar 26, 2014 7:46 pm

Re: Statistics (not the usual...)

Post by Nert »

I'm with OP, but recognize that this sort of thing is in the lowest priority "nice to have" category. I could imagine a speedometer visualization updated hourly showing # of Tera/Exa flops currently being donated. That would be really cool and generate continuing buzz around the project. In the meantime, let's hope that FAH is successful in fully simulating all 20 proteins in SARS-CoV-2. When that happens, I'm hoping that the recognition resulting from that will make additional resources available that could make something "cool" like real time stats available.
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: Statistics (not the usual...)

Post by ajm »

Joe_H wrote:That is the what they are estimating from. They have data estimates of how many FLOPS completing WU's takes depending on the size and how many time steps are involved, and can back out an estimate of the total being done in a second over all the returns.
OK, thank you to take the time!
Can we then say that this stat: https://stats.foldingathome.org/os shows the theoretical capacity of FAH, as that of Summit, for example, is of 200 petaFLOPS, whereas FAH hasn't been (can't?) be clocked, as Summit has been at 148.6 petaFLOPS?
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: Statistics (not the usual...)

Post by ajm »

Nert wrote:I'm with OP, but recognize that this sort of thing is in the lowest priority "nice to have" category. I could imagine a speedometer visualization updated hourly showing # of Tera/Exa flops currently being donated. That would be really cool and generate continuing buzz around the project. In the meantime, let's hope that FAH is successful in fully simulating all 20 proteins in SARS-CoV-2. When that happens, I'm hoping that the recognition resulting from that will make additional resources available that could make something "cool" like real time stats available.
Yes, agreed 100%. It's just that I'm trying to put together a case for gathering those ressources. I'm looking for solid arguments in favor of distributed supercomputing in general and FAH in particular. There are now more processors at large than anybody can gather in a supercomputer. I think it's the future of the sector, at least for the next 2-3 decades. But it needs convincing.
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Statistics (not the usual...)

Post by Neil-B »

From my relatively non technical perspective it might be reasonable to suggest that the one day stats linked above is a near correlation to a "clocked" figure, whereas the longer time period estimates err towards a correlation with theoretical potential … The one day are estimates of the actual FLOPS needed to complete the work actually done during that period (if I understand it correctly it averages "highs and lows" over the 24hr period so at times it may peak higher or lower so FAH has effectively been "clocked"?

I have watched HPC vendors argue over benchmarking - Each seems to prefer their own subtle definitions and distinctions of how benchmarks are defined and measured - and observed that system performance against real tasks can have significant variance against such benchmarked tests … FAH is real performance against the task that is required - the figures might be inferred estimates of FLOPs but to me that is still a healthy benchmark.

… but irrespective of how things are actually measured I would be love to see (at some point when everything is less loaded) some real time visualisations of various performance metrics with periodic min/max as that would be quite cool (and no doubt non-trivial to actually make happen) … whether it will ever be "worth" doing this is thankfully not my call :)
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Nert
Posts: 154
Joined: Wed Mar 26, 2014 7:46 pm

Re: Statistics (not the usual...)

Post by Nert »

ajm wrote:It's just that I'm trying to put together a case for gathering those ressources. I'm looking for solid arguments in favor of distributed supercomputing in general and FAH in particular.
I think the use case will come after FAH delivers a big win for COVID-19. If that happens, I think that resources available for the project will expand a lot. One of the risks is that the success won't be credited to FAH. My concern is that some pharmaceutical company uses the results to develop a treatment for infected patients, but doesn't properly credit the project with providing the research that led to the treatment.
Neil-B wrote: … whether it will ever be "worth" doing this is thankfully not my call
Yup. :D ... appreciate your other comments as well.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Statistics (not the usual...)

Post by PantherX »

I came across this link which has historic information regarding F@H OS Stats from 2013-12-15 to 2017-07-09 and thought it might be insightful: https://app.johanssonrobotics.com/folding/
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply