Powered by the NVIDIA Ampere Architecture Explore the heart of the world’s highest-performing, elastic data centers.
FP32
31.2 teraFLOPS
TF32 Tensor Core
62.5 teraFLOPS | 125 teraFLOPS*
BFLOAT16 Tensor Core
125 teraFLOPS | 250 teraFLOPS*
FP16 Tensor Core
125 teraFLOPS | 250 teraFLOPS*
INT8 Tensor Core
250 TOPS | 500 TOPS*
INT4 Tensor Core
500 TOPS | 1,000 TOPS*
RT Core
72 RT Cores
Encode/decode
1 encoder
2 decoder (+AV1 decode)
GPU memory
24GB GDDR6
GPU memory bandwidth
600GB/s
Interconnect
PCIe Gen4 64GB/s
Form factors
Single-slot, full-height, full-length (FHFL)
Max thermal design power (TDP)
150W