Code: Select all
Compiled SIMD instructions: AVX_256 (Gromacs could use AVX2_256 on this machine, which is better)
The current CPU can measure timings more accurately than the code in
GROMACS was configured to use. This might affect your simulation
speed as accurate timings are needed for load-balancing.
Please consider rebuilding GROMACS with the GMX_USE_RDTSCP=OFF CMake option.
NOTE: The number of threads is not equal to the number of (logical) cores
and the -pin option is set to auto: will not pin thread to cores.
This can lead to significant performance degradation.
Consider using -pin on (and -pinoffset in case you run multiple jobs).
2. On the timing measurements, I guess it's compiled with GMX_USE_RDTSCP=OFF to cater for CPUs which don't support the RDTSCP instruction, correct? Pity that it doesn't support runtime auto-detection.
But at least according according to this: https://github.com/gromacs/gromacs/blob ... eLists.txt there is a correlation between AVX and RDTSCP support, so for AVX compiles, RDTSCP support could probably be on...?
3. For the third one, I'm running the core on 6 out of 8 threads (as the other two are running my GPU slot). That's already passed to mdrun as -nt 6. But GROMACS complains that "This can lead to significant performance degradation.", apparently due to "OS switching threads across physical cores, which may result in performance loss.". So, I guess my question is why the client is not using -pin on and -pinoffset?
I know almost nothing about GROMACS, so please excuse my ignorance in advance Just curious. Thanks.