core22 0.0.10 released to full FAH!

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Crawdaddy79
Posts: 73
Joined: Sat Mar 21, 2020 3:56 pm

Re: core22 0.0.10 released to full FAH!

Post by Crawdaddy79 »

There is no real difference in performance in this project number (13415) and 13409 that myself and other high-end AMD GPU users have complained about in this thread I have gotten 13415 WUs exclusively since Friday night. It's killed my PPD but it's also helped me to cope with not being so obsessed with my points. My WU count has jumped, so there's that I guess.
Image
HaloJones
Posts: 906
Joined: Thu Jul 24, 2008 10:16 am

Re: core22 0.0.10 released to full FAH!

Post by HaloJones »

FAH has been working with "standard users" for over a decade. It relies on simple installation (which it is for most people), and no monitoring, fine tuning or controlling the donors' systems. I don't understand why you think that has to change?
single 1070

Image
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: core22 0.0.10 released to full FAH!

Post by ajm »

It doesn't have to change, but it does have a real chance to get to a new level. Hardware and software evolve constantly, and so should FAH.
HaloJones
Posts: 906
Joined: Thu Jul 24, 2008 10:16 am

Re: core22 0.0.10 released to full FAH!

Post by HaloJones »

ajm wrote:It doesn't have to change, but it does have a real chance to get to a new level. Hardware and software evolve constantly, and so should FAH.
Change has to be for a beneficial reason that can justify the opportunity cost. I would love to see a solution that aligns work perfectly with the capacity of each donor but FAH cannot know for sure that a particular piece of hardware will be used 24/7 dedicated excusively to FAH. This has a huge impact on the value of the donor's hardware.
single 1070

Image
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: core22 0.0.10 released to full FAH!

Post by ajm »

Indeed. But this aspect can be handled with a couple check boxes somewhere and then maybe an algorithm based on the observed behavior of the slot. There are countless such problems to solve of course and, yes, it would be a very ambitious endeavor. But then FAH is the most powerful supercomputer on Earth, after all. Huge resources have been leveraged these last months, by individuals and entreprises - this shows that such a project can gather energies and goodwill.

As for the deployment solution, many people came to this forum to ask for it, because they had the hardware but not enough time on their hands for configuring hundreds of kits one after the other, and then monitoring them efficiently.
TPL
Posts: 103
Joined: Sun Apr 19, 2020 11:37 am

Re: core22 0.0.10 released to full FAH!

Post by TPL »

One good question important to ask is how long those big companies will be interested in folding? Will it last after SARS-Co-V2 is in control some day?

How much money and effort is reasonable to spend hoping they will stay?
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: core22 0.0.10 released to full FAH!

Post by ajm »

The truth is that we don't know. What we do know is that they did come. FAH asked for donors and they came. Then FAH asked for servers and they gave them.
Now, if FAH asks for programming manpower, with a couple interesting forward-looking projects, the chances are good that they come.
HugoNotte
Posts: 66
Joined: Tue Apr 07, 2020 7:09 pm

Re: core22 0.0.10 released to full FAH!

Post by HugoNotte »

ajm wrote:The truth is that we don't know. What we do know is that they did come. FAH asked for donors and they came. Then FAH asked for servers and they gave them.
Now, if FAH asks for programming manpower, with a couple interesting forward-looking projects, the chances are good that they come.
A lot of server and computing capacity has been made available by companies because it had become available due to shut downs which let hardware run idle. Others have donated resources in hope that it will shorten and lessen the economical impact. I expect that as soon as things do return to normal, a lot of resources will be withdrawn from FAH again because they will be required to generate profit.
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: core22 0.0.10 released to full FAH!

Post by ajm »

Yes, that is highly probable. But we can argue that the virus will come back, this one or another one, and that we'd better be really prepared, then. Another good reason to shift the focus on programming solutions that will make FAH more efficient and reactive.
TPL
Posts: 103
Joined: Sun Apr 19, 2020 11:37 am

Re: core22 0.0.10 released to full FAH!

Post by TPL »

Me neither am not believing in Santa Claus no more.
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: core22 0.0.10 released to full FAH!

Post by ajm »

Well, I'd say that there are quite a few Santas here, on this forum... :D

Besides, large companies have budgets for this kind of things. Call it marketing. They may want to be associated with FAH and the general idea of fighting diseases and protecting the population. Be it with lending hardware, bandwidth or manpower.
TPL
Posts: 103
Joined: Sun Apr 19, 2020 11:37 am

Re: core22 0.0.10 released to full FAH!

Post by TPL »

Marketing? Thats pretty much where I see a problem here. Where people loose their interest there's no marketing value any more...
Joe_H
Site Admin
Posts: 7937
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: core22 0.0.10 released to full FAH!

Post by Joe_H »

Yes, there has been a variety of resources provided the F@h, over and above prior support given. But that support has to be coordinated by a relatively small group, and any changes, enhancements, or other modifications still need testing before being put out to an active client/server system. From posts here it should be fairly obvious that small changes can have major impact on how the system works. Much of what is being requested is in the area of large changes, those are going to take longer, and many will be on hold for deploying so as not to cause major disruptions during this period.

Depending on how that outside support stays around after the COVID-19 push, F2hmay have to pick which to put out as well based on what they can support.

Finally, in one way COVID-19 came along at a poor time for F@h. It is in the middle of a transition from being mostly based out of Dr Pande's lab at Stanford to a more distributed model. Work was in progress to make the software more open source so that others could contribute easily, that is mostly on hold while working to meet the current needs for servers and getting projects ready.. There is more to come out in the near future, but probably not on the server side of things. These things will be announced as they are ready to be tested and then put into production once validated. Some may not make it out at all.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Nuitari
Posts: 78
Joined: Sun Jun 09, 2019 4:03 am
Hardware configuration: 1x Nvidia 1050ti
1x Nvidia 1660Super
1x Nvidia GTX 660
1x Nvidia 1060 3gb
1x AMD rx570
2x AMD rx560
1x AMD Ryzen 7 PRO 1700
1x AMD Ryzen 7 3700X
1x AMD Phenom II
1x AMD A8-9600
1x Intel i5-4590S

Re: core22 0.0.10 released to full FAH!

Post by Nuitari »

I had a power failure and when the power came back all 6 units that were interrupted went faulty with "Forces are blowing up!" when processing resumed.
Now I have 7 slots with the 13415 projects. Going to be like those horse races to see which one finishes first :)
Image
foldy
Posts: 2040
Joined: Sat Dec 01, 2012 3:43 pm
Hardware configuration: Folding@Home Client 7.6.13 (1 GPU slots)
Windows 7 64bit
Intel Core i5 2500k@4Ghz
Nvidia gtx 1080ti driver 441

Re: core22 0.0.10 released to full FAH!

Post by foldy »

After Standby/Hibernate and resume on Windows the GPU FahCore_22 0.0.10 shows an error and resumes from last checkpoint. But after several Standby/Hibernate the work unit gets dumped as BAD because of max error retries reached. Standby/Hibernate should not count as error.
https://github.com/FoldingAtHome/fah-issues/issues/1529
Post Reply