Blog

News

NVIDIA Announces New RTX A5000 and RTX A4000 GPUs for Desktop and A30 and A10 GPUs for Data Center

April 13, 2021
7 min read
A50004000release.jpg

Earlier this week during GTC 2021, amidst the action packed keynote NVIDIA also announced the arrival of several new GPUs to their product line. Among the releases, this blog will focus on the NVIDIA® RTX™ A5000, NVIDIA® RTX™ A4000, NVIDIA A30, and NVIDIA A10.

For desktop GPUs, to further bolster their professional graphics line the NVIDIA RTX A5000 and NVIDIA RTX A4000 GPUs were released.

Next-Generation RTX Technology
All the NVIDIA RTX GPUs feature the latest technologies in the NVIDIA Ampere architecture:

  • Second-Generation RT Cores: Up to 2x the throughput of the previous generation, with the ability to run concurrent ray tracing, shading and denoising tasks.
  • Third-generation Tensor Cores: Up to 2x the throughput of the previous generation, up to 10x with sparsity, with support for new TF32 and BFloat16 data formats.
  • CUDA Cores: Up to 2.5x the FP32 throughput of the previous generation for significant increases in graphics and compute workloads.

The new NVIDIA RTX A5000

The NVIDIA® RTX™ A5000 combines the latest-generation RT Cores, Tensor Cores, and CUDA cores with 24GB of graphics memory to deliver a robust desktop experience for designers, engineers, and artists to create the future, today.

nvidia-rtx-A5000-photo-front-small-500x281.jpg

The new NVIDIA RTX A4000

The NVIDIA RTX A4000 is the most powerful single-slot GPU for professionals that combines the latest-generation RT Cores, Tensor Cores, and CUDA cores with 16 GB of graphics memory in a compact form factor that fits a wide range of systems.

nvidia-rtx-A4000-photo-front-small-500x281.jpg


Looking for a new NVIDIA Powered Workstation or Server?
Shop NVIDIA Systems


NVIDIA RTX A5000 & NVIDIA RTX A4000 Specs Compared

NVIDIA RTX A5000NVIDIA RTX A4000
ArchitectureNVIDIA Ampere ArchitectureNVIDIA Ampere Architecture
FoundrySamsungSamsung
Process Size8nm8nm
Transistors28.3 billion17.4billion
Die Size628.4 mm2392.5 mm2
CUDA Parallel Processing cores8,1926,144
NVIDIA Tensor Cores256192
NVIDIA RT Cores6448
Single-Precision Performance127.8 TFLOPS19.2 TFLOPS
RT Core Performance154.2 TFLOPS37.4 TFLOPS
Tensor Performance1222.2 TFLOPS153.4 TFLOPS
GPU Memory24 GB GDDR6 with ECC16 GB GDDR6 with ECC
Memory Interface384-bit256-bit
Memory Bandwidth768 GB/s448 GB/s
Max Power Consumption230W140W
Graphics BusPCI Express 4.0 x16PCI Express 4.0 x16
Display ConnectorsDP 1.4 (4)DP 1.4 (4)
Form Factor4.4” H x 10.5” L Dual Slot4.4” H x 9.5” L Single Slot
Product Weight1.025 kg500 g
Thermal SolutionActiveActive
vGPU Software Support4NVIDIA ® Virtual PC/Virtual Applications (vPC/vApps), NVIDIA RTX® Virtual Workstation (vWS), NVIDIA Virtual Compute Server (vCS)3n/a
vGPU Profiles SupportedSee vGPU Pricing & Licensing Guiden/a
NVIDIA® 3D Vision® and 3D Vision ProSupport via 3 pin mini DINSupport via 3 pin mini DIN
Frame lockCompatible (with Quadro Sync II)Compatible (with Quadro Sync II)
NVLink2-way low profile (2-slot and 3-slot bridges)n/a
NVLink Interconnect112.5 GB/s (bidirectional)n/a
Power Connector1x 8-pin PCIe1x 6-pin PCIe
NVENC | NVDEC1x | 2x (+AV1 decode)1x | 1x (+AV1 decode)


NVIDIA Adds New Data Center GPUs

For the data center, and server workloads we’ll examine the new NVIDIA A30 GPU and NVIDIA10 GPU. The NVIDIA A30 Tensor Core GPU is being hailed as the most versatile mainstream compute GPU for AI inference and mainstream enterprise workloads.

NVIDIA A30 Tensor Core GPU

The A30 features NVIDIA Ampere architecture Tensor Core technology, and supports a broad range of math precisions, providing a single accelerator to speed up every workload.

A30.jpg


NVIDIA A10 Tensor Core GPU

the A10 combines second generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 gigabytes (GB) of GDDR6 memory—all in a 150W power envelope—for versatile graphics, rendering, AI, and compute performance. From virtual workstations, accessible anywhere in the world, to render nodes to the data centers running a variety of workloads, A10 is built to deliver optimal performance in a single-wide, full-height, full-length PCIe form factor.

A10.jpg

NVIDIA A10NVIDIA A30
FP64n/a5.2 TF
FP64 Tensor Coren/a10.3 TF
FP3231.2 TF82 TF | 165 TF*
TF32 Tensor Core62.5 TF | 125 TF*165 TF | 330 TF*
BFLOAT16 Tensor Core125 TF | 250 TF*165 TF | 330 TF*
FP16 Tensor Core125 TF | 250 TF*330 TOPS | 661 TOPS*
INT8 Tensor Core250 TOPS | 500 TOPS*661 TOPS | 1321 TOPS*
INT4 Tensor Core500 TOPS | 1000 TOPS*661 TOPS | 1321 TOPS*
RT Cores72n/a
Encode / Decode1 encoder, 1 decoder, (+AV1 decode)n/a
Media enginesn/a1 optical flow accelerator (OFA), 1 JPEG decoder (NVJPEG) 4 Video decoders (NVDEC)
GPU Memory24 GB GDDR624GB HBM2
GPU Memory Bandwidth600 GB/s933GB/s
InterconnectPCIe Gen4: 64 GB/sPCIe Gen4: 64GB/s

Third-gen NVIDIA® NVLINK® 200GB/s**
Form Factor1-slot FHFL2-slot FHFL
Max TDP Power150W165W
vGPU Software SupportNVIDIA vPC/vApps, NVIDIA RTX™ vWS, NVIDIA Virtual Compute Server (vCS)NVIDIA AI Enterprise for Vmware, NVIDIA Virtual Compute Server
Multi-Instance GPU (MIG)n/a4 MIGs @ 6GB each, 2 MIGs @ 12GB each, 1 MIGs @ 24GB
Secure and Measured Boot with Hardware Root of TrustYesYes
NEBS ReadyLevel 3Level 3
Power ConnectorPEX 8-pinPEX 8-pin

Have any questions?
Contact Exxact Today


Free Resources

Browse our whitepapers, e-books, case studies, and reference architecture.

Explore
A50004000release.jpg
News

NVIDIA Announces New RTX A5000 and RTX A4000 GPUs for Desktop and A30 and A10 GPUs for Data Center

April 13, 2021 7 min read

Earlier this week during GTC 2021, amidst the action packed keynote NVIDIA also announced the arrival of several new GPUs to their product line. Among the releases, this blog will focus on the NVIDIA® RTX™ A5000, NVIDIA® RTX™ A4000, NVIDIA A30, and NVIDIA A10.

For desktop GPUs, to further bolster their professional graphics line the NVIDIA RTX A5000 and NVIDIA RTX A4000 GPUs were released.

Next-Generation RTX Technology
All the NVIDIA RTX GPUs feature the latest technologies in the NVIDIA Ampere architecture:

  • Second-Generation RT Cores: Up to 2x the throughput of the previous generation, with the ability to run concurrent ray tracing, shading and denoising tasks.
  • Third-generation Tensor Cores: Up to 2x the throughput of the previous generation, up to 10x with sparsity, with support for new TF32 and BFloat16 data formats.
  • CUDA Cores: Up to 2.5x the FP32 throughput of the previous generation for significant increases in graphics and compute workloads.

The new NVIDIA RTX A5000

The NVIDIA® RTX™ A5000 combines the latest-generation RT Cores, Tensor Cores, and CUDA cores with 24GB of graphics memory to deliver a robust desktop experience for designers, engineers, and artists to create the future, today.

nvidia-rtx-A5000-photo-front-small-500x281.jpg

The new NVIDIA RTX A4000

The NVIDIA RTX A4000 is the most powerful single-slot GPU for professionals that combines the latest-generation RT Cores, Tensor Cores, and CUDA cores with 16 GB of graphics memory in a compact form factor that fits a wide range of systems.

nvidia-rtx-A4000-photo-front-small-500x281.jpg


Looking for a new NVIDIA Powered Workstation or Server?
Shop NVIDIA Systems


NVIDIA RTX A5000 & NVIDIA RTX A4000 Specs Compared

NVIDIA RTX A5000NVIDIA RTX A4000
ArchitectureNVIDIA Ampere ArchitectureNVIDIA Ampere Architecture
FoundrySamsungSamsung
Process Size8nm8nm
Transistors28.3 billion17.4billion
Die Size628.4 mm2392.5 mm2
CUDA Parallel Processing cores8,1926,144
NVIDIA Tensor Cores256192
NVIDIA RT Cores6448
Single-Precision Performance127.8 TFLOPS19.2 TFLOPS
RT Core Performance154.2 TFLOPS37.4 TFLOPS
Tensor Performance1222.2 TFLOPS153.4 TFLOPS
GPU Memory24 GB GDDR6 with ECC16 GB GDDR6 with ECC
Memory Interface384-bit256-bit
Memory Bandwidth768 GB/s448 GB/s
Max Power Consumption230W140W
Graphics BusPCI Express 4.0 x16PCI Express 4.0 x16
Display ConnectorsDP 1.4 (4)DP 1.4 (4)
Form Factor4.4” H x 10.5” L Dual Slot4.4” H x 9.5” L Single Slot
Product Weight1.025 kg500 g
Thermal SolutionActiveActive
vGPU Software Support4NVIDIA ® Virtual PC/Virtual Applications (vPC/vApps), NVIDIA RTX® Virtual Workstation (vWS), NVIDIA Virtual Compute Server (vCS)3n/a
vGPU Profiles SupportedSee vGPU Pricing & Licensing Guiden/a
NVIDIA® 3D Vision® and 3D Vision ProSupport via 3 pin mini DINSupport via 3 pin mini DIN
Frame lockCompatible (with Quadro Sync II)Compatible (with Quadro Sync II)
NVLink2-way low profile (2-slot and 3-slot bridges)n/a
NVLink Interconnect112.5 GB/s (bidirectional)n/a
Power Connector1x 8-pin PCIe1x 6-pin PCIe
NVENC | NVDEC1x | 2x (+AV1 decode)1x | 1x (+AV1 decode)


NVIDIA Adds New Data Center GPUs

For the data center, and server workloads we’ll examine the new NVIDIA A30 GPU and NVIDIA10 GPU. The NVIDIA A30 Tensor Core GPU is being hailed as the most versatile mainstream compute GPU for AI inference and mainstream enterprise workloads.

NVIDIA A30 Tensor Core GPU

The A30 features NVIDIA Ampere architecture Tensor Core technology, and supports a broad range of math precisions, providing a single accelerator to speed up every workload.

A30.jpg


NVIDIA A10 Tensor Core GPU

the A10 combines second generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 gigabytes (GB) of GDDR6 memory—all in a 150W power envelope—for versatile graphics, rendering, AI, and compute performance. From virtual workstations, accessible anywhere in the world, to render nodes to the data centers running a variety of workloads, A10 is built to deliver optimal performance in a single-wide, full-height, full-length PCIe form factor.

A10.jpg

NVIDIA A10NVIDIA A30
FP64n/a5.2 TF
FP64 Tensor Coren/a10.3 TF
FP3231.2 TF82 TF | 165 TF*
TF32 Tensor Core62.5 TF | 125 TF*165 TF | 330 TF*
BFLOAT16 Tensor Core125 TF | 250 TF*165 TF | 330 TF*
FP16 Tensor Core125 TF | 250 TF*330 TOPS | 661 TOPS*
INT8 Tensor Core250 TOPS | 500 TOPS*661 TOPS | 1321 TOPS*
INT4 Tensor Core500 TOPS | 1000 TOPS*661 TOPS | 1321 TOPS*
RT Cores72n/a
Encode / Decode1 encoder, 1 decoder, (+AV1 decode)n/a
Media enginesn/a1 optical flow accelerator (OFA), 1 JPEG decoder (NVJPEG) 4 Video decoders (NVDEC)
GPU Memory24 GB GDDR624GB HBM2
GPU Memory Bandwidth600 GB/s933GB/s
InterconnectPCIe Gen4: 64 GB/sPCIe Gen4: 64GB/s

Third-gen NVIDIA® NVLINK® 200GB/s**
Form Factor1-slot FHFL2-slot FHFL
Max TDP Power150W165W
vGPU Software SupportNVIDIA vPC/vApps, NVIDIA RTX™ vWS, NVIDIA Virtual Compute Server (vCS)NVIDIA AI Enterprise for Vmware, NVIDIA Virtual Compute Server
Multi-Instance GPU (MIG)n/a4 MIGs @ 6GB each, 2 MIGs @ 12GB each, 1 MIGs @ 24GB
Secure and Measured Boot with Hardware Root of TrustYesYes
NEBS ReadyLevel 3Level 3
Power ConnectorPEX 8-pinPEX 8-pin

Have any questions?
Contact Exxact Today


Free Resources

Browse our whitepapers, e-books, case studies, and reference architecture.

Explore