Page 1 of 1

FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 1:38 pm
by zookeeny
Hi - I've been running FaH for a couple of weeks now with no problem, but now my machine is running a task where it's at 0% CPU for about a minute, then 100% for 7-8 seconds, repeating over and over again. This has been going on for about 12 hours now. The progress bar is progressing very slowly during the 100% CPU times (up to 4% now), but drops back to 0% when the CPU usage goes back to 0%. Is this normal behavior, or do I need to reinstall my client? Thanks!

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 1:56 pm
by Neil-B
Welcome to the Forums:

Could you please post a log - viewtopic.php?f=24&t=26036 - with that someone may be able to see what is going on and help you.

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 2:05 pm
by zookeeny
Sure thing - here are the last 150 lines from log.txt.

Code: Select all

13:56:49:WU00:FS00:0xa7:       Core: Gromacs
13:56:49:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 3437 -checkpoint 15 -np
13:56:49:WU00:FS00:0xa7:             24
13:56:49:WU00:FS00:0xa7:************************************ CBang *************************************
13:56:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:56:49:WU00:FS00:0xa7:       Time: 06:06:57
13:56:49:WU00:FS00:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
13:56:49:WU00:FS00:0xa7:     Branch: master
13:56:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:56:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
13:56:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:56:49:WU00:FS00:0xa7:       Bits: 64
13:56:49:WU00:FS00:0xa7:       Mode: Release
13:56:49:WU00:FS00:0xa7:************************************ System ************************************
13:56:49:WU00:FS00:0xa7:        CPU: AMD Ryzen 9 3900X 12-Core Processor
13:56:49:WU00:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
13:56:49:WU00:FS00:0xa7:       CPUs: 24
13:56:49:WU00:FS00:0xa7:     Memory: 31.37GiB
13:56:49:WU00:FS00:0xa7:Free Memory: 26.43GiB
13:56:49:WU00:FS00:0xa7:    Threads: POSIX_THREADS
13:56:49:WU00:FS00:0xa7: OS Version: 5.3
13:56:49:WU00:FS00:0xa7:Has Battery: false
13:56:49:WU00:FS00:0xa7: On Battery: false
13:56:49:WU00:FS00:0xa7: UTC Offset: -4
13:56:49:WU00:FS00:0xa7:        PID: 3441
13:56:49:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
13:56:49:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
13:56:49:WU00:FS00:0xa7:    Version: 0.0.18
13:56:49:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:56:49:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
13:56:49:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
13:56:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:56:49:WU00:FS00:0xa7:       Time: 06:13:26
13:56:49:WU00:FS00:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
13:56:49:WU00:FS00:0xa7:     Branch: master
13:56:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:56:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
13:56:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:56:49:WU00:FS00:0xa7:       Bits: 64
13:56:49:WU00:FS00:0xa7:       Mode: Release
13:56:49:WU00:FS00:0xa7:************************************ Build *************************************
13:56:49:WU00:FS00:0xa7:       SIMD: avx_256
13:56:49:WU00:FS00:0xa7:********************************************************************************
13:56:49:WU00:FS00:0xa7:Project: 14576 (Run 0, Clone 3083, Gen 10)
13:56:49:WU00:FS00:0xa7:Unit: 0x00000011287234c95e7b86ce0c65d526
13:56:49:WU00:FS00:0xa7:Reading tar file core.xml
13:56:49:WU00:FS00:0xa7:Reading tar file frame10.tpr
13:56:49:WU00:FS00:0xa7:Digital signatures verified
13:56:49:WU00:FS00:0xa7:Calling: mdrun -s frame10.tpr -o frame10.trr -x frame10.xtc -cpt 15 -nt 24
13:56:49:WU00:FS00:0xa7:Steps: first=5000000 total=500000
13:56:49:WU00:FS00:0xa7:ERROR:
13:56:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:56:49:WU00:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
13:56:49:WU00:FS00:0xa7:ERROR:Source code file: /host/debian-stable-64bit-core-a7-avx-release/gromacs-core/build/gromacs/src/gromacs/mdlib/domdec.c, line: 6902
13:56:49:WU00:FS00:0xa7:ERROR:
13:56:49:WU00:FS00:0xa7:ERROR:Fatal error:
13:56:49:WU00:FS00:0xa7:ERROR:There is no domain decomposition for 20 ranks that is compatible with the given box and a minimum cell size of 1.37225 nm
13:56:49:WU00:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
13:56:49:WU00:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
13:56:49:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
13:56:49:WU00:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
13:56:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:56:54:WU00:FS00:0xa7:WARNING:Unexpected exit() call
13:56:54:WU00:FS00:0xa7:WARNING:Unexpected exit from science code
13:56:54:WU00:FS00:0xa7:Saving result file ../logfile_01.txt
13:56:54:WU00:FS00:0xa7:Saving result file md.log
13:56:54:WU00:FS00:0xa7:Saving result file science.log
13:56:54:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
13:57:49:WU00:FS00:Starting
13:57:49:WU00:FS00:Removing old file './work/00/logfile_01-20200403-132547.txt'
13:57:49:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 705 -lifeline 2123 -checkpoint 15 -np 24
13:57:49:WU00:FS00:Started FahCore on PID 3503
13:57:49:WU00:FS00:Core PID:3507
13:57:49:WU00:FS00:FahCore 0xa7 started
13:57:49:WU00:FS00:0xa7:*********************** Log Started 2020-04-03T13:57:49Z ***********************
13:57:49:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
13:57:49:WU00:FS00:0xa7:       Type: 0xa7
13:57:49:WU00:FS00:0xa7:       Core: Gromacs
13:57:49:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 3503 -checkpoint 15 -np
13:57:49:WU00:FS00:0xa7:             24
13:57:49:WU00:FS00:0xa7:************************************ CBang *************************************
13:57:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:57:49:WU00:FS00:0xa7:       Time: 06:06:57
13:57:49:WU00:FS00:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
13:57:49:WU00:FS00:0xa7:     Branch: master
13:57:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:57:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
13:57:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:57:49:WU00:FS00:0xa7:       Bits: 64
13:57:49:WU00:FS00:0xa7:       Mode: Release
13:57:49:WU00:FS00:0xa7:************************************ System ************************************
13:57:49:WU00:FS00:0xa7:        CPU: AMD Ryzen 9 3900X 12-Core Processor
13:57:49:WU00:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
13:57:49:WU00:FS00:0xa7:       CPUs: 24
13:57:49:WU00:FS00:0xa7:     Memory: 31.37GiB
13:57:49:WU00:FS00:0xa7:Free Memory: 26.41GiB
13:57:49:WU00:FS00:0xa7:    Threads: POSIX_THREADS
13:57:49:WU00:FS00:0xa7: OS Version: 5.3
13:57:49:WU00:FS00:0xa7:Has Battery: false
13:57:49:WU00:FS00:0xa7: On Battery: false
13:57:49:WU00:FS00:0xa7: UTC Offset: -4
13:57:49:WU00:FS00:0xa7:        PID: 3507
13:57:49:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
13:57:49:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
13:57:49:WU00:FS00:0xa7:    Version: 0.0.18
13:57:49:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:57:49:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
13:57:49:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
13:57:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:57:49:WU00:FS00:0xa7:       Time: 06:13:26
13:57:49:WU00:FS00:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
13:57:49:WU00:FS00:0xa7:     Branch: master
13:57:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:57:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
13:57:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:57:49:WU00:FS00:0xa7:       Bits: 64
13:57:49:WU00:FS00:0xa7:       Mode: Release
13:57:49:WU00:FS00:0xa7:************************************ Build *************************************
13:57:49:WU00:FS00:0xa7:       SIMD: avx_256
13:57:49:WU00:FS00:0xa7:********************************************************************************
13:57:49:WU00:FS00:0xa7:Project: 14576 (Run 0, Clone 3083, Gen 10)
13:57:49:WU00:FS00:0xa7:Unit: 0x00000011287234c95e7b86ce0c65d526
13:57:49:WU00:FS00:0xa7:Reading tar file core.xml
13:57:49:WU00:FS00:0xa7:Reading tar file frame10.tpr
13:57:49:WU00:FS00:0xa7:Digital signatures verified
13:57:49:WU00:FS00:0xa7:Calling: mdrun -s frame10.tpr -o frame10.trr -x frame10.xtc -cpt 15 -nt 24
13:57:49:WU00:FS00:0xa7:Steps: first=5000000 total=500000
13:57:49:WU00:FS00:0xa7:ERROR:
13:57:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:57:49:WU00:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
13:57:49:WU00:FS00:0xa7:ERROR:Source code file: /host/debian-stable-64bit-core-a7-avx-release/gromacs-core/build/gromacs/src/gromacs/mdlib/domdec.c, line: 6902
13:57:49:WU00:FS00:0xa7:ERROR:
13:57:49:WU00:FS00:0xa7:ERROR:Fatal error:
13:57:49:WU00:FS00:0xa7:ERROR:There is no domain decomposition for 20 ranks that is compatible with the given box and a minimum cell size of 1.37225 nm
13:57:49:WU00:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
13:57:49:WU00:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
13:57:49:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
13:57:49:WU00:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
13:57:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:57:54:WU00:FS00:0xa7:WARNING:Unexpected exit() call
13:57:54:WU00:FS00:0xa7:WARNING:Unexpected exit from science code
13:57:54:WU00:FS00:0xa7:Saving result file ../logfile_01.txt
13:57:54:WU00:FS00:0xa7:Saving result file md.log
13:57:54:WU00:FS00:0xa7:Saving result file science.log
13:57:54:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
13:58:49:WU00:FS00:Starting
13:58:49:WU00:FS00:Removing old file './work/00/logfile_01-20200403-132647.txt'
13:58:49:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 705 -lifeline 2123 -checkpoint 15 -np 24
13:58:49:WU00:FS00:Started FahCore on PID 3618
13:58:49:WU00:FS00:Core PID:3622
13:58:49:WU00:FS00:FahCore 0xa7 started
13:58:49:WU00:FS00:0xa7:*********************** Log Started 2020-04-03T13:58:49Z ***********************
13:58:49:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
13:58:49:WU00:FS00:0xa7:       Type: 0xa7
13:58:49:WU00:FS00:0xa7:       Core: Gromacs
13:58:49:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 3618 -checkpoint 15 -np
13:58:49:WU00:FS00:0xa7:             24
13:58:49:WU00:FS00:0xa7:************************************ CBang *************************************
13:58:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:58:49:WU00:FS00:0xa7:       Time: 06:06:57
13:58:49:WU00:FS00:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
13:58:49:WU00:FS00:0xa7:     Branch: master
13:58:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:58:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
13:58:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:58:49:WU00:FS00:0xa7:       Bits: 64
13:58:49:WU00:FS00:0xa7:       Mode: Release
13:58:49:WU00:FS00:0xa7:************************************ System ************************************
13:58:49:WU00:FS00:0xa7:        CPU: AMD Ryzen 9 3900X 12-Core Processor
13:58:49:WU00:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
13:58:49:WU00:FS00:0xa7:       CPUs: 24
13:58:49:WU00:FS00:0xa7:     Memory: 31.37GiB
13:58:49:WU00:FS00:0xa7:Free Memory: 26.38GiB
13:58:49:WU00:FS00:0xa7:    Threads: POSIX_THREADS
13:58:49:WU00:FS00:0xa7: OS Version: 5.3
13:58:49:WU00:FS00:0xa7:Has Battery: false
13:58:49:WU00:FS00:0xa7: On Battery: false
13:58:49:WU00:FS00:0xa7: UTC Offset: -4
13:58:49:WU00:FS00:0xa7:        PID: 3622
13:58:49:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
13:58:49:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
13:58:49:WU00:FS00:0xa7:    Version: 0.0.18
13:58:49:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:58:49:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
13:58:49:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
13:58:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:58:49:WU00:FS00:0xa7:       Time: 06:13:26
13:58:49:WU00:FS00:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
13:58:49:WU00:FS00:0xa7:     Branch: master
13:58:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:58:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
13:58:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:58:49:WU00:FS00:0xa7:       Bits: 64
13:58:49:WU00:FS00:0xa7:       Mode: Release
13:58:49:WU00:FS00:0xa7:************************************ Build *************************************
13:58:49:WU00:FS00:0xa7:       SIMD: avx_256
13:58:49:WU00:FS00:0xa7:********************************************************************************
13:58:49:WU00:FS00:0xa7:Project: 14576 (Run 0, Clone 3083, Gen 10)
13:58:49:WU00:FS00:0xa7:Unit: 0x00000011287234c95e7b86ce0c65d526
13:58:49:WU00:FS00:0xa7:Reading tar file core.xml
13:58:49:WU00:FS00:0xa7:Reading tar file frame10.tpr
13:58:49:WU00:FS00:0xa7:Digital signatures verified
13:58:49:WU00:FS00:0xa7:Calling: mdrun -s frame10.tpr -o frame10.trr -x frame10.xtc -cpt 15 -nt 24
13:58:49:WU00:FS00:0xa7:Steps: first=5000000 total=500000
13:58:49:WU00:FS00:0xa7:ERROR:
13:58:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:58:49:WU00:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
13:58:49:WU00:FS00:0xa7:ERROR:Source code file: /host/debian-stable-64bit-core-a7-avx-release/gromacs-core/build/gromacs/src/gromacs/mdlib/domdec.c, line: 6902
13:58:49:WU00:FS00:0xa7:ERROR:
13:58:49:WU00:FS00:0xa7:ERROR:Fatal error:
13:58:49:WU00:FS00:0xa7:ERROR:There is no domain decomposition for 20 ranks that is compatible with the given box and a minimum cell size of 1.37225 nm
13:58:49:WU00:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
13:58:49:WU00:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
13:58:49:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
13:58:49:WU00:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
13:58:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:58:54:WU00:FS00:0xa7:WARNING:Unexpected exit() call
13:58:54:WU00:FS00:0xa7:WARNING:Unexpected exit from science code
13:58:54:WU00:FS00:0xa7:Saving result file ../logfile_01.txt
13:58:54:WU00:FS00:0xa7:Saving result file md.log
13:58:54:WU00:FS00:0xa7:Saving result file science.log
13:58:54:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
******************************* Date: 2020-04-03 *******************************
13:59:49:WU00:FS00:Starting
13:59:49:WU00:FS00:Removing old file './work/00/logfile_01-20200403-132747.txt'
13:59:49:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 705 -lifeline 2123 -checkpoint 15 -np 24
13:59:49:WU00:FS00:Started FahCore on PID 3669
13:59:49:WU00:FS00:Core PID:3673
13:59:49:WU00:FS00:FahCore 0xa7 started
13:59:49:WU00:FS00:0xa7:*********************** Log Started 2020-04-03T13:59:49Z ***********************
13:59:49:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
13:59:49:WU00:FS00:0xa7:       Type: 0xa7
13:59:49:WU00:FS00:0xa7:       Core: Gromacs
13:59:49:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 3669 -checkpoint 15 -np
13:59:49:WU00:FS00:0xa7:             24
13:59:49:WU00:FS00:0xa7:************************************ CBang *************************************
13:59:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:59:49:WU00:FS00:0xa7:       Time: 06:06:57
13:59:49:WU00:FS00:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
13:59:49:WU00:FS00:0xa7:     Branch: master
13:59:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:59:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
13:59:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:59:49:WU00:FS00:0xa7:       Bits: 64
13:59:49:WU00:FS00:0xa7:       Mode: Release
13:59:49:WU00:FS00:0xa7:************************************ System ************************************
13:59:49:WU00:FS00:0xa7:        CPU: AMD Ryzen 9 3900X 12-Core Processor
13:59:49:WU00:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
13:59:49:WU00:FS00:0xa7:       CPUs: 24
13:59:49:WU00:FS00:0xa7:     Memory: 31.37GiB
13:59:49:WU00:FS00:0xa7:Free Memory: 26.37GiB
13:59:49:WU00:FS00:0xa7:    Threads: POSIX_THREADS
13:59:49:WU00:FS00:0xa7: OS Version: 5.3
13:59:49:WU00:FS00:0xa7:Has Battery: false
13:59:49:WU00:FS00:0xa7: On Battery: false
13:59:49:WU00:FS00:0xa7: UTC Offset: -4
13:59:49:WU00:FS00:0xa7:        PID: 3673
13:59:49:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
13:59:49:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
13:59:49:WU00:FS00:0xa7:    Version: 0.0.18
13:59:49:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:59:49:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
13:59:49:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
13:59:49:WU00:FS00:0xa7:       Date: Nov 5 2019
13:59:49:WU00:FS00:0xa7:       Time: 06:13:26
13:59:49:WU00:FS00:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
13:59:49:WU00:FS00:0xa7:     Branch: master
13:59:49:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
13:59:49:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
13:59:49:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
13:59:49:WU00:FS00:0xa7:       Bits: 64
13:59:49:WU00:FS00:0xa7:       Mode: Release
13:59:49:WU00:FS00:0xa7:************************************ Build *************************************
13:59:49:WU00:FS00:0xa7:       SIMD: avx_256
13:59:49:WU00:FS00:0xa7:********************************************************************************
13:59:49:WU00:FS00:0xa7:Project: 14576 (Run 0, Clone 3083, Gen 10)
13:59:49:WU00:FS00:0xa7:Unit: 0x00000011287234c95e7b86ce0c65d526
13:59:49:WU00:FS00:0xa7:Reading tar file core.xml
13:59:49:WU00:FS00:0xa7:Reading tar file frame10.tpr
13:59:49:WU00:FS00:0xa7:Digital signatures verified
13:59:49:WU00:FS00:0xa7:Calling: mdrun -s frame10.tpr -o frame10.trr -x frame10.xtc -cpt 15 -nt 24
13:59:49:WU00:FS00:0xa7:Steps: first=5000000 total=500000
13:59:49:WU00:FS00:0xa7:ERROR:
13:59:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:59:49:WU00:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
13:59:49:WU00:FS00:0xa7:ERROR:Source code file: /host/debian-stable-64bit-core-a7-avx-release/gromacs-core/build/gromacs/src/gromacs/mdlib/domdec.c, line: 6902
13:59:49:WU00:FS00:0xa7:ERROR:
13:59:49:WU00:FS00:0xa7:ERROR:Fatal error:
13:59:49:WU00:FS00:0xa7:ERROR:There is no domain decomposition for 20 ranks that is compatible with the given box and a minimum cell size of 1.37225 nm
13:59:49:WU00:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
13:59:49:WU00:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
13:59:49:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
13:59:49:WU00:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
13:59:49:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
13:59:54:WU00:FS00:0xa7:WARNING:Unexpected exit() call
13:59:54:WU00:FS00:0xa7:WARNING:Unexpected exit from science code
13:59:54:WU00:FS00:0xa7:Saving result file ../logfile_01.txt
13:59:54:WU00:FS00:0xa7:Saving result file md.log
13:59:54:WU00:FS00:0xa7:Saving result file science.log
13:59:54:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 2:29 pm
by Neil-B
Joe_H posted something in viewtopic.php?f=19&t=33871 (2nd response) that I think will link to your issue … When he sees this he will be able to walk you though what to do.

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 2:36 pm
by zookeeny
Thanks for the quick response Neil-B! After looking at the thread you mentioned, I was able to resolve my problem by changing my FaH client from Full Power (24 CPUs) to Medium Power (23 CPUs). ETA is now 20 minutes. Thanks again for the help!

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 2:42 pm
by Joe_H
On Medium how many CPU threads is it able to run on? That could be useful in setting upper bounds on the CPU threads it will be assigned to.

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 2:51 pm
by zookeeny
Here are a few lines from the log after it began successfully folding again... does this help?

Code: Select all

14:32:51:WU00:FS00:0xa7:Project: 14576 (Run 0, Clone 3083, Gen 10)
14:32:51:WU00:FS00:0xa7:Unit: 0x00000011287234c95e7b86ce0c65d526
14:32:51:WU00:FS00:0xa7:Reading tar file core.xml
14:32:51:WU00:FS00:0xa7:Reading tar file frame10.tpr
14:32:51:WU00:FS00:0xa7:Digital signatures verified
14:32:51:WU00:FS00:0xa7:Reducing thread count from 23 to 22 to avoid domain decomposition by a prime number > 3
14:32:51:WU00:FS00:0xa7:Reducing thread count from 22 to 21 to avoid domain decomposition with large prime factor 11
14:32:51:WU00:FS00:0xa7:Calling: mdrun -s frame10.tpr -o frame10.trr -x frame10.xtc -cpt 15 -nt 21
14:32:51:WU00:FS00:0xa7:Steps: first=5000000 total=500000
14:32:51:WU00:FS00:0xa7:Completed 1 out of 500000 steps (0%)
14:33:05:WU00:FS00:0xa7:Completed 5000 out of 500000 steps (1%)
14:33:19:WU00:FS00:0xa7:Completed 10000 out of 500000 steps (2%)
14:33:33:WU00:FS00:0xa7:Completed 15000 out of 500000 steps (3%)
14:33:47:WU00:FS00:0xa7:Completed 20000 out of 500000 steps (4%)
14:34:02:WU00:FS00:0xa7:Completed 25000 out of 500000 steps (5%)
14:34:16:WU00:FS00:0xa7:Completed 30000 out of 500000 steps (6%)
14:34:30:WU00:FS00:0xa7:Completed 35000 out of 500000 steps (7%)

Re: FaH client is cycling on and off repeatedly

Posted: Fri Apr 03, 2020 3:51 pm
by Joe_H
Yes, thank you.