Versatile Entry-Level Inference The NVIDIA A2 Tensor Core GPU provides entry-level inference with low power, a small footprint, and high performance for NVIDIA AI at the edge.
Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.
Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.
Peak FP32
4.5 TF
TF32 Tensor Core
9 TF | 18 TF¹
BFLOAT16 Tensor Core
18 TF | 36 TF¹
Peak FP16 Tensor Core
18 TF | 36 TF¹
Peak INT8 Tensor Core
36 TOPS | 72 TOPS¹
Peak INT4 Tensor Core
72 TOPS | 144 TOPS¹
RT Cores
10
GPU memory
16GB GDDR6
GPU memory bandwidth
200GB/s