Potentially issues for work units for project 124xx
Posted: Fri Aug 11, 2023 9:53 am
I have 10 Pi 4Bs doing F@H work.
All errors are The total potential energy is nan resulting in WU_STALLED (127 = 0x7f)
Pi 1:
12409 (Run 83, Clone 3, Gen 22) failed with too many errors
Pi 2:
12417 (Run 113, Clone 7, Gen 26) completed with 2 errors
12411 (Run 120, Clone 5, Gen 24) completed without errors
Pi 3:
completed multiple 124xx projects without any errors
Pi 4:
12403 (Run 29, Clone 0, Gen 28) completed with 1 error
12400 (Run 41, Clone 6, Gen 30) completed with 2 errors
Pi 5:
completed multiple 124xx projects without errors before encountering:
12410 (Run 5, Clone 9, Gen 9) completed with 1 error that appears to have stalled the Pi, requiring a reboot.
Pi 7:
completed multiple 124xx projects without any errors
Pi 8:
12419 (Run 121, Clone 8, Gen 27) failed with too many errors
12416 (Run 154, Clone 3, Gen 20) failed with too many errors
12401 (Run 25, Clone 7, Gen 21) completed despite experiencing 9 errors! A reboot mid-project might have helped.
12419 (Run 15, Clone 7, Gen 22) completed with 1 error
12419 (Run 18, Clone 1, Gen 25) failed with too many errors
12400 (Run 80, Clone 1, Gen 23) failed after 10 errors! Reboots mid-project might be delayed the WU failing.
12400 (Run 25, Clone 5, Gen 20) started with 2 errors before completing 1%
While Pi 8 's problem might be due to the machine,I don't understand why a machine that keep failing 124xx projects, is assigned only more of them!
All errors are The total potential energy is nan resulting in WU_STALLED (127 = 0x7f)
Pi 1:
12409 (Run 83, Clone 3, Gen 22) failed with too many errors
Pi 2:
12417 (Run 113, Clone 7, Gen 26) completed with 2 errors
12411 (Run 120, Clone 5, Gen 24) completed without errors
Pi 3:
completed multiple 124xx projects without any errors
Pi 4:
12403 (Run 29, Clone 0, Gen 28) completed with 1 error
12400 (Run 41, Clone 6, Gen 30) completed with 2 errors
Pi 5:
completed multiple 124xx projects without errors before encountering:
12410 (Run 5, Clone 9, Gen 9) completed with 1 error that appears to have stalled the Pi, requiring a reboot.
Pi 7:
completed multiple 124xx projects without any errors
Pi 8:
12419 (Run 121, Clone 8, Gen 27) failed with too many errors
12416 (Run 154, Clone 3, Gen 20) failed with too many errors
12401 (Run 25, Clone 7, Gen 21) completed despite experiencing 9 errors! A reboot mid-project might have helped.
12419 (Run 15, Clone 7, Gen 22) completed with 1 error
12419 (Run 18, Clone 1, Gen 25) failed with too many errors
12400 (Run 80, Clone 1, Gen 23) failed after 10 errors! Reboots mid-project might be delayed the WU failing.
12400 (Run 25, Clone 5, Gen 20) started with 2 errors before completing 1%
While Pi 8 's problem might be due to the machine,I don't understand why a machine that keep failing 124xx projects, is assigned only more of them!