ID :0x1b38
Thanks,
Code: Select all
[root@103-124 Pascal_PS]# FAHClient --lspci | grep -i nvidia
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
0x10de:0x1b38:NVIDIA Corporation:
Code: Select all
[root@103-124 Pascal_PS]# nvidia-smi
Mon Nov 7 18:14:42 2016
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 367.55 Driver Version: 367.55 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla P40 Off | 0000:04:00.0 Off | 0 |
| N/A 30C P0 50W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 Tesla P40 Off | 0000:05:00.0 Off | 0 |
| N/A 32C P0 53W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 2 Tesla P40 Off | 0000:06:00.0 Off | 0 |
| N/A 35C P0 52W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 3 Tesla P40 Off | 0000:07:00.0 Off | 0 |
| N/A 31C P0 52W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 4 Tesla P40 Off | 0000:08:00.0 Off | 0 |
| N/A 31C P0 51W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 5 Tesla P40 Off | 0000:0B:00.0 Off | 0 |
| N/A 31C P0 51W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 6 Tesla P40 Off | 0000:0C:00.0 Off | 0 |
| N/A 32C P0 52W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 7 Tesla P40 Off | 0000:0D:00.0 Off | 0 |
| N/A 33C P0 53W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 8 Tesla P40 Off | 0000:0E:00.0 Off | 0 |
| N/A 29C P0 54W / 250W | 0MiB / 22912MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 9 Tesla P40 Off | 0000:0F:00.0 Off | 0 |
| N/A 32C P0 53W / 250W | 0MiB / 22912MiB | 2% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
Code: Select all
[root@103-124 Pascal_PS]# nvidia-smi -i 0 -q
==============NVSMI LOG==============
Timestamp : Mon Nov 7 18:15:20 2016
Driver Version : 367.55
Attached GPUs : 10
GPU 0000:04:00.0
Product Name : Tesla P40
Product Brand : Tesla
Display Mode : Disabled
Display Active : Disabled
Persistence Mode : Disabled
Accounting Mode : Disabled
Accounting Mode Buffer Size : 1920
Driver Model
Current : N/A
Pending : N/A
Serial Number : 0333916020535
GPU UUID : GPU-cc788d9e-bc0c-73a0-73f0-b8c7f232bb2d
Minor Number : 0
VBIOS Version : 86.02.22.00.01
MultiGPU Board : No
Board ID : 0x400
GPU Part Number : 699-2G610-0200-100
Inforom Version
Image Version : G610.0200.00.03
OEM Object : 1.1
ECC Object : 4.1
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization mode : None
PCI
Bus : 0x04
Device : 0x00
Domain : 0x0000
Device Id : 0x1B3810DE
Bus Id : 0000:04:00.0
Sub System Id : 0x11D910DE
GPU Link Info
PCIe Generation
Max : 3
Current : 3
Link Width
Max : 16x
Current : 16x
Bridge Chip
Type : N/A
Firmware : N/A
Replays since reset : 0
Tx Throughput : 0 KB/s
Rx Throughput : 0 KB/s
Fan Speed : N/A
Performance State : P0
Clocks Throttle Reasons
Idle : Not Active
Applications Clocks Setting : Active
SW Power Cap : Not Active
HW Slowdown : Not Active
Sync Boost : Not Active
Unknown : Not Active
FB Memory Usage
Total : 22912 MiB
Used : 0 MiB
Free : 22912 MiB
BAR1 Memory Usage
Total : 32768 MiB
Used : 2 MiB
Free : 32766 MiB
Compute Mode : Default
Utilization
Gpu : 0 %
Memory : 0 %
Encoder : 0 %
Decoder : 0 %
Ecc Mode
Current : Enabled
Pending : Enabled
ECC Errors
Volatile
Single Bit
Device Memory : 0
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : 0
Double Bit
Device Memory : 0
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : 0
Aggregate
Single Bit
Device Memory : 0
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : 0
Double Bit
Device Memory : 0
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : 0
Retired Pages
Single Bit ECC : 0
Double Bit ECC : 0
Pending : No
Temperature
GPU Current Temp : 30 C
GPU Shutdown Temp : 95 C
GPU Slowdown Temp : 92 C
Power Readings
Power Management : Supported
Power Draw : 50.71 W
Power Limit : 250.00 W
Default Power Limit : 250.00 W
Enforced Power Limit : 250.00 W
Min Power Limit : 125.00 W
Max Power Limit : 250.00 W
Clocks
Graphics : 1303 MHz
SM : 1303 MHz
Memory : 3615 MHz
Video : 1164 MHz
Applications Clocks
Graphics : 1303 MHz
Memory : 3615 MHz
Default Applications Clocks
Graphics : 1303 MHz
Memory : 3615 MHz
Max Clocks
Graphics : 1531 MHz
SM : 1531 MHz
Memory : 3615 MHz
Video : 1379 MHz
Clock Policy
Auto Boost : N/A
Auto Boost Default : N/A
Processes : None
[root@103-124 Pascal_PS]#
Code: Select all
[root@103-124 gpu_burn]# ./deviceQuery
./deviceQuery Starting...
CUDA Device Query (Runtime API) version (CUDART static linking)
Detected 10 CUDA Capable device(s)
Device 0: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 4 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 1: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 5 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 2: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 6 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 3: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 7 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 4: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 8 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 5: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 11 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 6: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 12 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 7: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 13 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 8: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 14 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
Device 9: "Tesla P40"
CUDA Driver Version / Runtime Version 8.0 / 7.5
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 22913 MBytes (24025956352 bytes)
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
MapSMtoCores for SM 6.1 is undefined. Default to use 128 Cores/SM
(30) Multiprocessors, (128) CUDA Cores/MP: 3840 CUDA Cores
GPU Max Clock rate: 1531 MHz (1.53 GHz)
Memory Clock rate: 3615 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 3145728 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 15 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU0) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU1) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU2) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU3) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU4) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU5) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU6) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU8) : Yes
> Peer access from Tesla P40 (GPU7) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU8) -> Tesla P40 (GPU9) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU0) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU1) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU2) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU3) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU4) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU5) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU6) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU7) : Yes
> Peer access from Tesla P40 (GPU9) -> Tesla P40 (GPU8) : Yes
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 8.0, CUDA Runtime Version = 7.5, NumDevs = 10, Device0 = Tesla P40, Device1 = Tesla P40, Device2 = Tesla P40, Device3 = Tesla P40, Device4 = Tesla P40, Device5 = Tesla P40, Device6 = Tesla P40, Device7 = Tesla P40, Device8 = Tesla P40, Device9 = Tesla P40
Result = PASS