This error is easily reproducible by running a plain a8 core in command line and then killing it, or by backing up the work directory while the core is running and then restoring it. Some portion of the time, the dhdl.xvg file gets corrupted (it's always the xvg file). I'm not sure why, because the modifications to the checkpoint code that FAH has made to GROMACS is intended to stop just that by truncating files that are appended to (like xvg files) back to their state during the last checkpoint.
Code: Select all
06:44:50:I1:WU269:*********************** Log Started 2025-04-04T06:44:49Z ***********************
06:44:50:I1:WU269:************************** Gromacs Folding@home Core ***************************
06:44:50:I1:WU269: Core: Gromacs
06:44:50:I1:WU269: Type: 0xa8
06:44:50:I1:WU269: Version: 0.0.12
06:44:50:I1:WU269: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
06:44:50:I1:WU269: Copyright: 2020 foldingathome.org
06:44:50:I1:WU269: Homepage: https://foldingathome.org/
06:44:50:I1:WU269: Date: Jan 16 2021
06:44:50:I1:WU269: Time: 19:24:44
06:44:50:I1:WU269: Compiler: GNU 8.3.0
06:44:50:I1:WU269: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
06:44:50:I1:WU269: -fdata-sections -O3 -funroll-loops -fno-pie
06:44:50:I1:WU269: Platform: linux2 4.15.0-128-generic
06:44:50:I1:WU269: Bits: 64
06:44:50:I1:WU269: Mode: Release
06:44:50:I1:WU269: SIMD: avx2_256
06:44:50:I1:WU269: OpenMP: ON
06:44:50:I1:WU269: CUDA: OFF
06:44:50:I1:WU269: Args: -dir aqglezsapduPAa44rC_rXPzXVe-6QMe9WP0YFIHrzrQ -suffix 01
06:44:50:I1:WU269: -version 8.4.10 -lifeline 18990 -np 7
06:44:50:I1:WU269:************************************ libFAH ************************************
06:44:50:I1:WU269: Date: Jan 16 2021
06:44:50:I1:WU269: Time: 19:21:38
06:44:50:I1:WU269: Compiler: GNU 8.3.0
06:44:50:I1:WU269: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
06:44:50:I1:WU269: -fdata-sections -O3 -funroll-loops -fno-pie
06:44:50:I1:WU269: Platform: linux2 4.15.0-128-generic
06:44:50:I1:WU269: Bits: 64
06:44:50:I1:WU269: Mode: Release
06:44:50:I1:WU269:************************************ CBang *************************************
06:44:50:I1:WU269: Date: Jan 16 2021
06:44:50:I1:WU269: Time: 19:21:24
06:44:50:I1:WU269: Compiler: GNU 8.3.0
06:44:50:I1:WU269: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
06:44:50:I1:WU269: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
06:44:50:I1:WU269: Platform: linux2 4.15.0-128-generic
06:44:50:I1:WU269: Bits: 64
06:44:50:I1:WU269: Mode: Release
06:44:50:I1:WU269:************************************ System ************************************
06:44:50:I1:WU269: CPU: AMD Ryzen 7 7840U w/ Radeon 780M Graphics
06:44:50:I1:WU269: CPU ID: AuthenticAMD Family 25 Model 116 Stepping 1
06:44:50:I1:WU269: CPUs: 8
06:44:50:I1:WU269: Memory: 30.58GiB
06:44:50:I1:WU269:Free Memory: 23.49GiB
06:44:50:I1:WU269: Threads: POSIX_THREADS
06:44:50:I1:WU269: OS Version: 6.1
06:44:50:I1:WU269:Has Battery: true
06:44:50:I1:WU269: On Battery: false
06:44:50:I1:WU269: UTC Offset: 5
06:44:50:I1:WU269: PID: 18995
06:44:50:I1:WU269: CWD: /var/lib/fah-client/work
06:44:50:I1:WU269:********************************************************************************
06:44:50:I1:WU269:Project: 19228 (Run 6060, Clone 7, Gen 3)
06:44:50:I1:WU269:Unit: 0x00000000000000000000000000000000
06:44:50:I1:WU269:Digital signatures verified
06:44:50:I1:WU269:Calling: mdrun -c md3.gro -s md3.tpr -x md3.xtc -cpi state.cpt -cpt 5 -nt 7 -ntmpi 1
06:44:50:I1:WU269:ERROR:Guru Meditation #6a6fa21db879dcae.47018a7a424ec6e2 (14621.17230) 'aqglezsapduPAa44rC_rXPzXVe-6QMe9WP0YFIHrzrQ/01/dhdl.xvg'
06:44:50:I4:REQ2:> HTTP/1.1 101 HTTP_SWITCHING_PROTOCOLS
06:44:50:E :WU269:Core returned BAD_FRAME_CHECKSUM (112)
06:44:50:E :WU269:Run did not produce any results. Dumping WU