Code: Select all
OS: Kubuntu 18.04 LTS
FAHClient: 7.6.21
Nvidia Driver: 460.39
And GPU utilisation
Each dip corresponds to the checkpoint session. Even though I have set the interval to 30 minutes, it seems that my client is checkpointing every 5-6 minutes. At first I thought this might be due to some latency from the write time of my HDD (USB 3.1) but I have seen other work units on the card perform with much more stability, eg the work unit on the far left of this graph from a separate GPU (another 3070)
No error logs in the FAHClient console. Possibly still working through my first 10 WUs. Can provide nvidia log dump if needed or can provide a snippet if directed.
Thoughts?