I have a rig with three RTX cards: 1x 4070, 2x5070.
I'm running the nvidia-driver-580-open packages.
One of the 5070s is new, replacing a 3080.
When I was running with the 4070, 3080 and 5070, creating a CUDA context on any card was pretty quick. Sub 1-minute. After replacing the 3080 with another 5070 it can take up to 7 minutes or so, but only on the 5070s.
Anyone else seeing this? Any ideas what's going on and how to "fix" it?
thanks!
P.S.: I didn't notice any startup lag on any card before I replaced the 3080, but now I'm paranoid I just didn't notice that the 5070 was being slow, so apologies if this is WAI.
4070 startup
Code: Select all
22:28:01:I1:WU5025: Branch: HEAD
22:28:01:I1:WU5025: Compiler: GNU 9.4.0
22:28:01:I1:WU5025: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:28:01:I1:WU5025: -fdata-sections -O3 -funroll-loops -fno-pie
22:28:01:I1:WU5025: -DOPENMM_VERSION="\"7.7.0\""
22:28:01:I1:WU5025: Platform: linux 5.11.0-1025-azure
22:28:01:I1:WU5025: Bits: 64
22:28:01:I1:WU5025: Mode: Release
22:28:01:I1:WU5025:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
22:28:01:I1:WU5025: <peastman@stanford.edu>
22:28:01:I1:WU5025: Args: -dir 9zV1kMPvvp7MdF2Qj58BDcf5qaVLceQaPefRtDA-5s4 -suffix 01
22:28:01:I1:WU5025: -version 8.4.9 -lifeline 169 -gpu-uuid
22:28:01:I1:WU5025: d65f3411-caf4-41a8-0248-0dcf200dee35 -gpu-platform cuda -gpu-vendor
22:28:01:I1:WU5025: nvidia -opencl-platform 0 -opencl-device 0 -cuda-platform 0
22:28:01:I1:WU5025: -cuda-device 0 -gpu 0
22:28:01:I1:WU5025:************************************ libFAH ************************************
22:28:01:I1:WU5025: Date: Jan 20 2022
22:28:01:I1:WU5025: Time: 00:57:22
22:28:01:I1:WU5025: Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
22:28:01:I1:WU5025: Branch: HEAD
22:28:01:I1:WU5025: Compiler: GNU 9.4.0
22:28:01:I1:WU5025: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:28:01:I1:WU5025: -fdata-sections -O3 -funroll-loops -fno-pie
22:28:01:I1:WU5025: Platform: linux 5.11.0-1025-azure
22:28:01:I1:WU5025: Bits: 64
22:28:01:I1:WU5025: Mode: Release
22:28:01:I1:WU5025:************************************ CBang *************************************
22:28:01:I1:WU5025: Date: Jan 20 2022
22:28:01:I1:WU5025: Time: 00:57:00
22:28:01:I1:WU5025: Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
22:28:01:I1:WU5025: Branch: HEAD
22:28:01:I1:WU5025: Compiler: GNU 9.4.0
22:28:01:I1:WU5025: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:28:01:I1:WU5025: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
22:28:01:I1:WU5025: Platform: linux 5.11.0-1025-azure
22:28:01:I1:WU5025: Bits: 64
22:28:01:I1:WU5025: Mode: Release
22:28:01:I1:WU5025:************************************ System ************************************
22:28:01:I1:WU5025: CPU: Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz
22:28:01:I1:WU5025: CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
22:28:01:I1:WU5025: CPUs: 28
22:28:01:I1:WU5025: Memory: 31.04GiB
22:28:01:I1:WU5025:Free Memory: 11.31GiB
22:28:01:I1:WU5025: Threads: POSIX_THREADS
22:28:01:I1:WU5025: OS Version: 6.8
22:28:01:I1:WU5025:Has Battery: false
22:28:01:I1:WU5025: On Battery: false
22:28:01:I1:WU5025: UTC Offset: -7
22:28:01:I1:WU5025: PID: 6837
22:28:01:I1:WU5025: CWD: /config/work
22:28:01:I1:WU5025:************************************ OpenMM ************************************
22:28:01:I1:WU5025: Version: 7.7.0
22:28:01:I1:WU5025:********************************************************************************
22:28:01:I1:WU5025:Project: 14973 (Run 1, Clone 59, Gen 12)
22:28:01:I1:WU5025:Reading tar file core.xml
22:28:01:I1:WU5025:Reading tar file integrator.xml
22:28:01:I1:WU5025:Reading tar file state.xml
22:28:02:I1:WU5025:Reading tar file system.xml
22:28:04:I1:WU5025:Digital signatures verified
22:28:04:I1:WU5025:Folding@home GPU Core22 Folding@home Core
22:28:04:I1:WU5025:Version 0.0.20
22:28:04:I1:WU5025: Checkpoint write interval: 50000 steps (5%) [20 total]
22:28:04:I1:WU5025: JSON viewer frame write interval: 10000 steps (1%) [100 total]
22:28:04:I1:WU5025: XTC frame write interval: 25000 steps (2.5%) [40 total]
22:28:04:I1:WU5025: Global context and integrator variables write interval: disabled
22:28:04:I1:WU5025:There are 4 platforms available.
22:28:04:I1:WU5025:Platform 0: Reference
22:28:04:I1:WU5025:Platform 1: CPU
22:28:04:I1:WU5025:Platform 2: OpenCL
22:28:04:I1:WU5025: opencl-device 0 specified
22:28:04:I1:WU5025:Platform 3: CUDA
22:28:04:I1:WU5025: cuda-device 0 specified
22:28:33:I1:WU5025:Attempting to create CUDA context: <
22:28:33:I1:WU5025: Configuring platform CUDA < 22s startup
22:28:55:I1:WU5025: Using CUDA and gpu 0. <
22:28:55:I1:WU5025:Completed 0 out of 1000000 steps (0%)
22:28:57:I1:WU5025:Checkpoint completed at step 0
22:29:52:I1:WU5025:Completed 10000 out of 1000000 steps (1%)
Code: Select all
22:14:33:I1:WU5023: dd40d876-3e63-1967-9ae5-8227782a482e -gpu-platform cuda -gpu-vendor
22:14:33:I1:WU5023: nvidia -opencl-platform 0 -opencl-device 1 -cuda-platform 0
22:14:33:I1:WU5023: -cuda-device 1 -gpu 1
22:14:33:I1:WU5023:************************************ libFAH ************************************
22:14:33:I1:WU5023: Date: Jan 20 2022
22:14:33:I1:WU5023: Time: 00:57:22
22:14:33:I1:WU5023: Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
22:14:33:I1:WU5023: Branch: HEAD
22:14:33:I1:WU5023: Compiler: GNU 9.4.0
22:14:33:I1:WU5023: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:14:33:I1:WU5023: -fdata-sections -O3 -funroll-loops -fno-pie
22:14:33:I1:WU5023: Platform: linux 5.11.0-1025-azure
22:14:33:I1:WU5023: Bits: 64
22:14:33:I1:WU5023: Mode: Release
22:14:33:I1:WU5023:************************************ CBang *************************************
22:14:33:I1:WU5023: Date: Jan 20 2022
22:14:33:I1:WU5023: Time: 00:57:00
22:14:33:I1:WU5023: Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
22:14:33:I1:WU5023: Branch: HEAD
22:14:33:I1:WU5023: Compiler: GNU 9.4.0
22:14:33:I1:WU5023: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:14:33:I1:WU5023: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
22:14:33:I1:WU5023: Platform: linux 5.11.0-1025-azure
22:14:33:I1:WU5023: Bits: 64
22:14:33:I1:WU5023: Mode: Release
22:14:33:I1:WU5023:************************************ System ************************************
22:14:33:I1:WU5023: CPU: Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz
22:14:33:I1:WU5023: CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
22:14:33:I1:WU5023: CPUs: 28
22:14:33:I1:WU5023: Memory: 31.04GiB
22:14:33:I1:WU5023:Free Memory: 14.61GiB
22:14:33:I1:WU5023: Threads: POSIX_THREADS
22:14:33:I1:WU5023: OS Version: 6.8
22:14:33:I1:WU5023:Has Battery: false
22:14:33:I1:WU5023: On Battery: false
22:14:33:I1:WU5023: UTC Offset: -7
22:14:33:I1:WU5023: PID: 6443
22:14:33:I1:WU5023: CWD: /config/work
22:14:33:I1:WU5023:************************************ OpenMM ************************************
22:14:33:I1:WU5023: Version: 7.7.0
22:14:33:I1:WU5023:********************************************************************************
22:14:33:I1:WU5023:Project: 14973 (Run 2, Clone 67, Gen 2)
22:14:33:I1:WU5023:Reading tar file core.xml
22:14:33:I1:WU5023:Reading tar file integrator.xml
22:14:33:I1:WU5023:Reading tar file state.xml
22:14:34:I1:WU5023:Reading tar file system.xml
22:14:35:I1:WU5023:Digital signatures verified
22:14:35:I1:WU5023:Folding@home GPU Core22 Folding@home Core
22:14:35:I1:WU5023:Version 0.0.20
22:14:35:I1:WU5023: Checkpoint write interval: 50000 steps (5%) [20 total]
22:14:35:I1:WU5023: JSON viewer frame write interval: 10000 steps (1%) [100 total]
22:14:35:I1:WU5023: XTC frame write interval: 25000 steps (2.5%) [40 total]
22:14:35:I1:WU5023: Global context and integrator variables write interval: disabled
22:14:35:I1:WU5023:There are 4 platforms available.
22:14:35:I1:WU5023:Platform 0: Reference
22:14:35:I1:WU5023:Platform 1: CPU
22:14:35:I1:WU5023:Platform 2: OpenCL
22:14:35:I1:WU5023: opencl-device 1 specified
22:14:35:I1:WU5023:Platform 3: CUDA
22:14:35:I1:WU5023: cuda-device 1 specified
22:15:01:I1:WU5023:Attempting to create CUDA context: <
22:15:01:I1:WU5023: Configuring platform CUDA < 6m38s startup
22:21:39:I1:WU5023: Using CUDA and gpu <
22:21:39:I1:WU5023:Completed 0 out of 1000000 steps (0%)
Code: Select all
22:15:01:I1:WU5024: -cuda-device 2 -gpu 2
22:15:01:I1:WU5024:************************************ libFAH ************************************
22:15:01:I1:WU5024: Date: Jan 20 2022
22:15:01:I1:WU5024: Time: 00:57:22
22:15:01:I1:WU5024: Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
22:15:01:I1:WU5024: Branch: HEAD
22:15:01:I1:WU5024: Compiler: GNU 9.4.0
22:15:01:I1:WU5024: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:15:01:I1:WU5024: -fdata-sections -O3 -funroll-loops -fno-pie
22:15:01:I1:WU5024: Platform: linux 5.11.0-1025-azure
22:15:01:I1:WU5024: Bits: 64
22:15:01:I1:WU5024: Mode: Release
22:15:01:I1:WU5024:************************************ CBang *************************************
22:15:01:I1:WU5024: Date: Jan 20 2022
22:15:01:I1:WU5024: Time: 00:57:00
22:15:01:I1:WU5024: Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
22:15:01:I1:WU5024: Branch: HEAD
22:15:01:I1:WU5024: Compiler: GNU 9.4.0
22:15:01:I1:WU5024: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:15:01:I1:WU5024: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
22:15:01:I1:WU5024: Platform: linux 5.11.0-1025-azure
22:15:01:I1:WU5024: Bits: 64
22:15:01:I1:WU5024: Mode: Release
22:15:01:I1:WU5024:************************************ System ************************************
22:15:01:I1:WU5024: CPU: Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz
22:15:01:I1:WU5024: CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
22:15:01:I1:WU5024: CPUs: 28
22:15:01:I1:WU5024: Memory: 31.04GiB
22:15:01:I1:WU5024:Free Memory: 12.71GiB
22:15:01:I1:WU5024: Threads: POSIX_THREADS
22:15:01:I1:WU5024: OS Version: 6.8
22:15:01:I1:WU5024:Has Battery: false
22:15:01:I1:WU5024: On Battery: false
22:15:01:I1:WU5024: UTC Offset: -7
22:15:01:I1:WU5024: PID: 6504
22:15:01:I1:WU5024: CWD: /config/work
22:15:01:I1:WU5024:************************************ OpenMM ************************************
22:15:01:I1:WU5024: Version: 7.7.0
22:15:01:I1:WU5024:********************************************************************************
22:15:01:I1:WU5024:Project: 14955 (Run 6, Clone 23, Gen 22)
22:15:01:I1:WU5024:Reading tar file core.xml
22:15:01:I1:WU5024:Reading tar file integrator.xml
22:15:01:I1:WU5024:Reading tar file state.xml
22:15:02:I1:WU5024:Reading tar file system.xml
22:15:04:I1:WU5024:Digital signatures verified
22:15:04:I1:WU5024:Folding@home GPU Core22 Folding@home Core
22:15:04:I1:WU5024:Version 0.0.20
22:15:04:I1:WU5024: Checkpoint write interval: 50000 steps (5%) [20 total]
22:15:04:I1:WU5024: JSON viewer frame write interval: 10000 steps (1%) [100 total]
22:15:04:I1:WU5024: XTC frame write interval: 25000 steps (2.5%) [40 total]
22:15:04:I1:WU5024: Global context and integrator variables write interval: disabled
22:15:04:I1:WU5024:There are 4 platforms available.
22:15:04:I1:WU5024:Platform 0: Reference
22:15:04:I1:WU5024:Platform 1: CPU
22:15:04:I1:WU5024:Platform 2: OpenCL
22:15:04:I1:WU5024: opencl-device 2 specified
22:15:04:I1:WU5024:Platform 3: CUDA
22:15:04:I1:WU5024: cuda-device 2 specified
22:15:33:I1:WU5024:Attempting to create CUDA context: <
22:15:33:I1:WU5024: Configuring platform CUDA < 4m12s startup
22:19:45:I1:WU5024: Using CUDA and gpu 2 <
22:19:45:I1:WU5024:Completed 0 out of 1000000 steps (0%)
22:19:48:I1:WU5024:Checkpoint completed at step 0
22:20:34:I1:WU5024:Completed 10000 out of 1000000 steps (1%)