
NVIDIA® GPUs
The world's most electrifying accelerators

Visualization GPUs
NVIDIA RTX A5000
NVIDIA Ampere architecture brings the power of real-time ray tracing, AI, and advanced graphics to millions of designers, artists, scientists, and researchers. The NVIDIA RTX™ A5000 based on Ampere architecture perfectly balances power, performance, and memory to spearhead the future of innovation from your desktop.
Connect two RTX A5000s for 48 GB of combined GPU memory with NVIDIA NVLink, unlocking the ability to work with larger models, renders and scenes, tackle memory-intensive tasks like natural language processing, and run higher-fidelity simulations to enhance your product development process.
NVIDIA RTX A4000
The NVIDIA RTX™ A4000 built on the NVIDIA Ampere architecture is the most powerful single-slot GPU for professionals, delivering real-time ray tracing, AI-accelerated compute, and high-performance graphics performance to your desktop. You can engineer next-generation products, design cityscapes of the future, and create immersive entertainment experiences of tomorrow, today, from your desktop workstation.
NVIDIA RTX A6000
Built on the NVIDIA Ampere architecture, the RTX™ A6000 combines 84 second-generation RT Cores, 336 third-generation Tensor Cores, and 10,752 CUDA cores with 48 GB of graphics memory. Connect two RTX A6000s with NVIDIA NVLink for 96 GB of combined GPU memory, engineer amazing products, design state-of-the art buildings, drive scientific breakthroughs, and create immersive entertainment.
Accelerator GPUs
NVIDIA A40
Built on the NVIDIA Ampere architecture, the A40 combines the latest generation RT Cores, Tensor Cores, and CUDA Cores with 48GB of graphics memory for unprecedented graphics in advanced technologies such as real-time ray tracing, AI, compute, simulation, and VR are common across industries.
NVIDIA A16
Take remote work to the next level with NVIDIA A16, the ideal GPU for high-density, graphics rich VDI. NVIDIA A16 has 4x the encoder throughput versus NVIDIA T4 to provide the best user experience on a single board. Based on the latest NVIDIA Ampere architecture, A16 is purpose-built to achieve the highest user density, with up to 64 concurrent users per board in a dual slot form factor.
NVIDIA H100
NVIDIA H100 Tensor Core GPU accelerates exascale scale workloads with a dedicated Transformer Engine for trillion parameter language models. For small jobs, H100 can be partitioned down to right-sized Multi-Instance GPU (MIG) partitions. The NVIDIA™ Hopper architecture with the Transformer Engine, accelerates in an order of magnitude speedup over the prior generation.
NVIDIA A100 LIQUID COOLED PCIe
NVIDIA A100 Liquid Cooled GPU uses one PCIe slot. By recycle small amounts of fluids in closed systems focused on key hot spots, while using far less power—at every scale—to propel the world's highest performing elastic data centers for AI, data analytics, and high performance computing (HPC) applications.
NVIDIA A2
The NVIDIA A2 Tensor Core GPU provides entry-level inference with low power (40-60 watt), a small footprint (low profile PCIe Gen4), and high performance for intelligent video analytics (IVA) or NVIDIA® AI at the edge. Servers accelerated with A2 GPUs deliver up to 20X higher inference performance versus CPUs and 1.3x more efficient IVA deployments — all at an entry-level price point.
NVIDIA L4
L4 is optimized for video and inference at scale for a broad range of AI applications, including recommendations, voice-based AI avatar assistants, generative AI, visual search, and contact center automation. Servers equipped with L4 enable up to 120X higher AI Video performance over CPU solutions, while providing 2.7X more generative AI performance, and over 4X more graphics performance versus the previous generation.
NVIDIA A10
Built on the latest NVIDIA Ampere architecture, the NVIDIA A10 Tensor Core GPU combines second-generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 gigabytes (GB) of GDDR6 memory – all in a 150W power envelope – for versatile graphics, rendering, AI, and compute performance.