I'm not a new user but am curious to know how F@H sieves out completed WUs that are bad.
There are computer users out there who push their CPU, RAM or GPU overclock a little too far, which can result in a WU being completed incorrectly.
(Need not be overclockers but defective CPU, cache and RAM can result in errornous completed WUs)
I've read of WUs ending in EUE (early unit end) etc. in unstable PCs, but do improperly completed WUs always end in EUE?
How does F@H protect against errornous completed WUs?
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: How does F@H protect against errornous completed WUs?
Usually, yes. However there are additional data integrity checks after the WU is uploaded. See the FAH WIKI entry on Common Errors, for the message Server reports problem with this work unit.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Site Moderator
- Posts: 2850
- Joined: Mon Jul 18, 2011 4:44 am
- Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4 - Location: Western Washington
Re: How does F@H protect against errornous completed WUs?
The core does more than just simply run calculations. It also checks to see if the values it's working with makes sense. Are these numbers legal and in their proper range? Are the atoms and forces reasonably configured? Do the integrity checks pass? Are the checkpoint hashsums correct? etc etc. As 7im said, some of this is also done server-side.
There's also a function in place to send out WUs again when someone encounters a problem with it. If others successfully complete it, then it's likely a hardware problem for that first user. If not, then it's a bad WU and a researcher needs to debug the problem.
There's also a function in place to send out WUs again when someone encounters a problem with it. If others successfully complete it, then it's likely a hardware problem for that first user. If not, then it's a bad WU and a researcher needs to debug the problem.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
Re: How does F@H protect against errornous completed WUs?
When a DC project is originally designed, they can take one of two opposing approaches: They can spend a lot of time and effort on error checking or they can waste an appreciable fraction of the donor's efforts by reprocessing everything and confirming the data is repeatable. FAH put their money into error checking as a way to minimize (as much as possible) the number of WUs that need to be reprocessed. To make that work, they have to have validation tests that can be trusted to detect errors earlier rather than later (i.e.- a bad WU should NOT be completed) so that good results only need to be processed once. This will also minimize the number of errors that make it all the way to the server-based checking stage.
Unfortunately, in addition to hardware errors and a small percentage of WUs which are inherently unstable, there is also the possibility of hacking. FAH points have an intrinsic value even less than bitcoin, but that doesn't guarantee that all donors are ethical. Extra levels of error checking are also a lot like locking your house when you're away. If everybody is ethical, that wouldn't be necessary. If a occasional thief happens to visit, the person in the neighborhood with the best security system will have the lowest chance of being broken into.
Unfortunately, in addition to hardware errors and a small percentage of WUs which are inherently unstable, there is also the possibility of hacking. FAH points have an intrinsic value even less than bitcoin, but that doesn't guarantee that all donors are ethical. Extra levels of error checking are also a lot like locking your house when you're away. If everybody is ethical, that wouldn't be necessary. If a occasional thief happens to visit, the person in the neighborhood with the best security system will have the lowest chance of being broken into.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
Re: How does F@H protect against errornous completed WUs?
Thanks for the informative replies.