Automatically expire WU
Moderators: Site Moderators, FAHC Science Team
Automatically expire WU
I have a WU on my laptop that has an ETA of 1 day but the expiration time is in 1 hour. Why does the software not calculate that there isn't enough time to complete the unit and expire it earlier?
Re: Automatically expire WU
It's maybe because that WU has been in your system for several days, idle/paused? What is its Assigned date?
Re: Automatically expire WU
Expiration times are set long before they're assigned to the user.
I have 2 Atom based (and I should clarify, like 8 years old) systems that, despite having "4 CPUs" (dual core hyperthreaded) and not really doing anything else, they can't complete work assigned to them in time and I had to just shut them down.
I have 2 Atom based (and I should clarify, like 8 years old) systems that, despite having "4 CPUs" (dual core hyperthreaded) and not really doing anything else, they can't complete work assigned to them in time and I had to just shut them down.
Re: Automatically expire WU
The "software" doesn't necessarily know, what caused the time to complete to extend beyond the expiration time. E.g. when a new WU starts, during the first 5% or so, there can be fluctuations of quite a few hours to the ETA.KimboJ wrote:I have a WU on my laptop that has an ETA of 1 day but the expiration time is in 1 hour. Why does the software not calculate that there isn't enough time to complete the unit and expire it earlier?
The software might also not know whether you are running other programs on that computer which take priority and temporarily slow down the progress of that WU. Therefore it won't dispose of the WU before the expiration time, I guess.
Re: Automatically expire WU
There should maybe then be a warning that states 'based on the average, this work unit will not complete in time - abort?'
I had a WU that was only 50% complete and was on full power, and only had 1 hour left before the expiration time, and had an ETA of 1 day. Fluctuations aside, this work would never be able to complete earlier and should have aborted earlier. The ETA is far longer than the real time to the expiration time. It's a waste of resources and makes me less likely to run the software constantly if it has too many flaws like this.
My new unit is already 23% complete and has an ETA of 8 hours, after only a few hours of running.
I had a WU that was only 50% complete and was on full power, and only had 1 hour left before the expiration time, and had an ETA of 1 day. Fluctuations aside, this work would never be able to complete earlier and should have aborted earlier. The ETA is far longer than the real time to the expiration time. It's a waste of resources and makes me less likely to run the software constantly if it has too many flaws like this.
My new unit is already 23% complete and has an ETA of 8 hours, after only a few hours of running.
-
- Posts: 1996
- Joined: Sun Mar 22, 2020 5:52 pm
- Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21 - Location: UK
Re: Automatically expire WU
I agree with you - it would be nice to have some form of "heads up" - it could even go further - "are you sure you want to pause/change power setting/reduce CPU cores … If you do you may not complete WU before Timeout/Deadline" - but not sure realistically how much chance there is of getting this enhancement through or quite how some of the logic might be coded … Would be worth raising an enhancement on GitHub - https://github.com/FoldingAtHome/fah-issues/issues
Could you post your log with both the top 100 lines or so which should have your configuration in it and the part of the log at the start and the end of the WU that took so long - there might have been an issue with the WU if you are completing some WUs in 12 hours - or it may just have been a much "larger" Wu - but could be worth a check? … For guidance on posting logs (if needed) - viewtopic.php?f=61&t=26036
Could you post your log with both the top 100 lines or so which should have your configuration in it and the part of the log at the start and the end of the WU that took so long - there might have been an issue with the WU if you are completing some WUs in 12 hours - or it may just have been a much "larger" Wu - but could be worth a check? … For guidance on posting logs (if needed) - viewtopic.php?f=61&t=26036
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
-
- Site Moderator
- Posts: 6986
- Joined: Wed Dec 23, 2009 9:33 am
- Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB
Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400 - Location: Land Of The Long White Cloud
- Contact:
Re: Automatically expire WU
Recently, due to the COVID-19 research being time critical, the Timeout was reduced to 1 day to ensure the fastest turnaround for the scientists to work on. Given that part of the COVID-19 research ties in with the Moonshot project (https://covid.postera.ai/covid) where they need data ASAP which has caused a reduction of Timeout only for those specific projects. Other projects for Cancer, and rest do have a reasonable timeout period. Generally speaking, the older the hardware, the more hours it needs to fold. I have read that Core i5/i7 first generation CPUs running 24/7 are able to meet the timeout deadline. Those CPUs are about 10 years old.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time
Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time
Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
-
- Posts: 1996
- Joined: Sun Mar 22, 2020 5:52 pm
- Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21 - Location: UK
Re: Automatically expire WU
Slight digression but relevant in a way ... there is a difference between timeout which is the point at which WU is reissued but still valuable for you to return it and the expiration which is when the client will dump the WU as it no longer has value.
Some people may treat the timeout as if it is the expiration which it isnt ... obviously good to get WU back asap and preferably by timeout but the WU may well still add value right up go expiration.
Some people may treat the timeout as if it is the expiration which it isnt ... obviously good to get WU back asap and preferably by timeout but the WU may well still add value right up go expiration.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)