12474 crashes repeatedly

Moderators: Site Moderators, FAHC Science Team

Post Reply
bikeaddict
Posts: 214
Joined: Sun May 03, 2020 1:20 am

12474 crashes repeatedly

Post by bikeaddict »

On two machines, project 12474 crashes repeatedly with FahCore returned: INTERRUPTED (102 = 0x66). Short excerpts of logs below.

Code: Select all

03:42:42:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:12474 run:24 clone:5 gen:104 core:0xa8 unit:0x680000000500000018000000ba300000
03:42:42:WU01:FS00:Starting
03:42:42:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 01 -suffix 01 -version 706 -lifeline 7133 -checkpoint 15 -np 31
03:42:42:WU01:FS00:Started FahCore on PID 997539
03:42:42:WU01:FS00:Core PID:997543
03:42:42:WU01:FS00:FahCore 0xa8 started
03:42:43:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
03:42:43:WU01:FS00:Starting
03:42:43:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 01 -suffix 01 -version 706 -lifeline 7133 -checkpoint 15 -np 31
03:42:43:WU01:FS00:Started FahCore on PID 997561
03:42:43:WU01:FS00:Core PID:997565
03:42:43:WU01:FS00:FahCore 0xa8 started
03:42:44:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
03:42:52:WU00:FS02:0x24:Completed 750000 out of 5000000 steps (15%)
03:42:52:WU00:FS02:0x24:Checkpoint completed at step 750000
03:43:43:WU01:FS00:Starting
03:43:43:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 01 -suffix 01 -version 706 -lifeline 7133 -checkpoint 15 -np 31
03:43:43:WU01:FS00:Started FahCore on PID 997582
03:43:43:WU01:FS00:Core PID:997586
03:43:43:WU01:FS00:FahCore 0xa8 started
03:43:44:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
03:44:43:WU01:FS00:Starting
03:44:43:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 01 -suffix 01 -version 706 -lifeline 7133 -checkpoint 15 -np 31
03:44:43:WU01:FS00:Started FahCore on PID 997604
03:44:43:WU01:FS00:Core PID:997608
03:44:43:WU01:FS00:FahCore 0xa8 started
03:44:44:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
03:44:59:WU00:FS02:0x24:Completed 800000 out of 5000000 steps (16%)

Code: Select all

07:06:26:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:12474 run:12 clone:6 gen:80 core:0xa8 unit:0x50000000060000000c000000ba300000
07:06:26:WU02:FS01:Starting
07:06:26:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 02 -suffix 01 -version 706 -lifeline 1811 -checkpoint 15 -np 31
07:06:26:WU02:FS01:Started FahCore on PID 1366389
07:06:26:WU02:FS01:Core PID:1366393
07:06:26:WU02:FS01:FahCore 0xa8 started
07:06:26:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
07:06:27:WU02:FS01:Starting
07:06:27:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 02 -suffix 01 -version 706 -lifeline 1811 -checkpoint 15 -np 31
07:06:27:WU02:FS01:Started FahCore on PID 1366410
07:06:27:WU02:FS01:Core PID:1366414
07:06:27:WU02:FS01:FahCore 0xa8 started
07:06:27:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
07:06:52:WU01:FS00:0x23:Completed 87500 out of 1250000 steps (7%)
07:07:27:WU02:FS01:Starting
07:07:27:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 02 -suffix 01 -version 706 -lifeline 1811 -checkpoint 15 -np 31
07:07:27:WU02:FS01:Started FahCore on PID 1366432
07:07:27:WU02:FS01:Core PID:1366436
07:07:27:WU02:FS01:FahCore 0xa8 started
07:07:27:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
07:07:34:WU01:FS00:0x23:Completed 100000 out of 1250000 steps (8%)
07:07:35:WU01:FS00:0x23:Checkpoint completed at step 100000
07:08:17:WU01:FS00:0x23:Completed 112500 out of 1250000 steps (9%)
07:08:27:WU02:FS01:Starting
07:08:27:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 02 -suffix 01 -version 706 -lifeline 1811 -checkpoint 15 -np 31
07:08:27:WU02:FS01:Started FahCore on PID 1366454
07:08:27:WU02:FS01:Core PID:1366458
07:08:27:WU02:FS01:FahCore 0xa8 started
07:08:27:WU02:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
07:08:59:WU01:FS00:0x23:Completed 125000 out of 1250000 steps (10%)
07:08:59:WU01:FS00:0x23:Checkpoint completed at step 125000
muziqaz
Posts: 1722
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 9950x, 7950x3D, 5950x, 5800x3D
7900xtx, RX9070, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: 12474 crashes repeatedly

Post by muziqaz »

Permission issue?
Unstable machine?
Try deleting /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8
directory and restart fah-client, and let it download the core again
FAH Omega tester
Image
bikeaddict
Posts: 214
Joined: Sun May 03, 2020 1:20 am

Re: 12474 crashes repeatedly

Post by bikeaddict »

Every other project on these machines has been fine for over a year.

Also curious that one WU was assigned to another user at almost the same time who completed it in 6.5 minutes for 1.89M points.

https://apps.foldingathome.org/wu#proje ... =5&gen=104
arisu
Posts: 466
Joined: Mon Feb 24, 2025 11:11 pm

Re: 12474 crashes repeatedly

Post by arisu »

bikeaddict wrote: Tue May 20, 2025 1:32 pm Every other project on these machines has been fine for over a year.

Also curious that one WU was assigned to another user at almost the same time who completed it in 6.5 minutes for 1.89M points.

https://apps.foldingathome.org/wu#proje ... =5&gen=104
That has to be a server bug. That equates to over 421M PPD on a CPU, which is plainly impossible. When that user returned their WU, it was re-sent to you, so that means they must have failed it and the server improperly credited their failed/dumped WU as a success.
Post Reply