Just some napkin math: each lane of PCIe 3.0 is a touch shy of a gigabyte per second of bidirectional throughput (eg it can be transmit, or receive, or both at the same time at the full ~985MB/s.) Meaning, at least in theory, a PCIe 3.0 x4 slot would be able to manage, with some to spare, the throughput needs of @arisu's GTX970M.
Part of me wonders if that particular project was perhaps VRAM heavy? Most of the 970M's were shipped with 3GB of VRAM, so maybe combined with the compute kernel and dataset, it needed to continually page in the data? That seems unlikely to be honest, but I can't imagine why else there would be a continuous stream of 3GB/sec to a 3GB VRAM card. Would be interesting to see that same WU on a card with larger VRAM, just for A/B comparison sake on the PCIe throughput.
PCIe bandwidth requirements (RTX 50xx edition)
Moderator: Site Moderators
Forum rules
Please read the forum rules before posting.
Please read the forum rules before posting.
-
- Posts: 9
- Joined: Wed Oct 01, 2025 3:05 am
- Hardware configuration: AMD 9800X3D + 5090, Windows 11
AMD 5950X + 4070 Super, Fedora 42 VM on Proxmox 8.3
Intel i7-3930k + 4070 Super + 4090, Fedora 42