Page 1 of 1

WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)

Posted: Mon Dec 07, 2020 8:46 am
by superpan
Hi, a newbie here !
I have had several errors like this.
My question is:
  • Is this a problem with my system or the work unit ?
    What, if any, action should I take ?

Code: Select all

07:10:27:WU00:FS00:FahCore 0xa7 started
07:10:28:WU00:FS00:0xa7:*********************** Log Started 2020-12-07T07:10:27Z ***********************
07:10:28:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
07:10:28:WU00:FS00:0xa7:       Type: 0xa7
07:10:28:WU00:FS00:0xa7:       Core: Gromacs
07:10:28:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 706 -lifeline 223503 -checkpoint 15 -np
07:10:28:WU00:FS00:0xa7:             16
07:10:28:WU00:FS00:0xa7:************************************ CBang *************************************
07:10:28:WU00:FS00:0xa7:       Date: Nov 27 2019
07:10:28:WU00:FS00:0xa7:       Time: 11:26:54
07:10:28:WU00:FS00:0xa7:   Revision: d25803215b59272441049dfa05a0a9bf7a6e3c48
07:10:28:WU00:FS00:0xa7:     Branch: master
07:10:28:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
07:10:28:WU00:FS00:0xa7:    Options: -std=c++11 -ffunction-sections -fdata-sections -O3 -funroll-loops
07:10:28:WU00:FS00:0xa7:             -fno-pie -fPIC
07:10:28:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
07:10:28:WU00:FS00:0xa7:       Bits: 64
07:10:28:WU00:FS00:0xa7:       Mode: Release
07:10:28:WU00:FS00:0xa7:************************************ System ************************************
07:10:28:WU00:FS00:0xa7:        CPU: AMD Ryzen 7 3700X 8-Core Processor
07:10:28:WU00:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
07:10:28:WU00:FS00:0xa7:       CPUs: 16
07:10:28:WU00:FS00:0xa7:     Memory: 31.37GiB
07:10:28:WU00:FS00:0xa7:Free Memory: 591.95MiB
07:10:28:WU00:FS00:0xa7:    Threads: POSIX_THREADS
07:10:28:WU00:FS00:0xa7: OS Version: 5.8
07:10:28:WU00:FS00:0xa7:Has Battery: false
07:10:28:WU00:FS00:0xa7: On Battery: false
07:10:28:WU00:FS00:0xa7: UTC Offset: 0
07:10:28:WU00:FS00:0xa7:        PID: 223507
07:10:28:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
07:10:28:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
07:10:28:WU00:FS00:0xa7:    Version: 0.0.19
07:10:28:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
07:10:28:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
07:10:28:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
07:10:28:WU00:FS00:0xa7:       Date: Nov 26 2019
07:10:28:WU00:FS00:0xa7:       Time: 00:41:42
07:10:28:WU00:FS00:0xa7:   Revision: d5b5c747532224f986b7cd02c968ed9a20c16d6e
07:10:28:WU00:FS00:0xa7:     Branch: master
07:10:28:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
07:10:28:WU00:FS00:0xa7:    Options: -std=c++11 -ffunction-sections -fdata-sections -O3 -funroll-loops
07:10:28:WU00:FS00:0xa7:             -fno-pie
07:10:28:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
07:10:28:WU00:FS00:0xa7:       Bits: 64
07:10:28:WU00:FS00:0xa7:       Mode: Release
07:10:28:WU00:FS00:0xa7:************************************ Build *************************************
07:10:28:WU00:FS00:0xa7:       SIMD: avx_256
07:10:28:WU00:FS00:0xa7:********************************************************************************
07:10:28:WU00:FS00:0xa7:Project: 16927 (Run 3, Clone 102, Gen 28)
07:10:28:WU00:FS00:0xa7:Unit: 0x0000001d8120d1c90000000000030066
07:10:28:WU00:FS00:0xa7:Digital signatures verified
07:10:28:WU00:FS00:0xa7:Calling: mdrun -s frame28.tpr -o frame28.trr -cpi state.cpt -cpt 15 -nt 16
07:10:28:WU00:FS00:0xa7:Steps: first=14000000 total=500000
07:10:29:WU00:FS00:0xa7:Completed 150061 out of 500000 steps (30%)
....
07:53:08:WU00:FS00:0xa7:Completed 370000 out of 500000 steps (74%)
07:54:06:WU00:FS00:0xa7:Completed 375000 out of 500000 steps (75%)
07:54:22:WU00:FS00:0xa7:ERROR:
07:54:22:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
07:54:22:WU00:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
07:54:22:WU00:FS00:0xa7:ERROR:Source code file: /host/debian-stable-64bit-core-a7-avx-release/gromacs-core/build/gromacs/src/gromacs/mdlib/pme.c, line: 754
07:54:22:WU00:FS00:0xa7:ERROR:
07:54:22:WU00:FS00:0xa7:ERROR:Fatal error:
07:54:22:WU00:FS00:0xa7:ERROR:1 particles communicated to PME rank 10 are more than 2/3 times the cut-off out of the domain decomposition cell of their charge group in dimension x.
07:54:22:WU00:FS00:0xa7:ERROR:This usually means that your system is not well equilibrated.
07:54:22:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
07:54:22:WU00:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
07:54:22:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
07:54:28:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)


Re: WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)

Posted: Mon Dec 07, 2020 9:31 pm
by Joe_H
This appears to be a problem with the WU, possibly connected with the number of CPU threads being used in this case - 16. Usually you would not need to take any action, after a few retries the client should return the WU including the errors logged for the researcher to check on. On Linux systems there is a bug where some errors are treated as minor, and the client may loop instead of moving on to another WU. If that happens we can give directions for how to clear that up.

In the case it appears the WU was returned by you, it can be looked up in the stats database here - https://apps.foldingathome.org/wu#proje ... 102&gen=28.

Re: WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)

Posted: Tue Dec 08, 2020 12:17 am
by superpan
Thank-you for your answer. Appreciated.