top of page

Schedule Contract # 47QTCA24D008H

NVIDIA® GPUs

The world's most electrifying accelerators

FP64 Workstation GPUs

NVIDIA A800 40GB Active

NVIDIA L4.png

The NVIDIA A800 40GB Active GPU accelerates data science, AI, and HPC workflows with 432 third-generation Tensor Cores to maximize AI performance and ultra-fast and efficient inference capabilities. With third-generation NVIDIA NVLink technology, A800 40GB Active offers scalable performance for heavy AI workloads, doubling the effective memory footprint and enabling GPU-to-GPU data transfers up to 400 GB/s of bidirectional bandwidth.

Visualization GPUs

NVIDIA RTX 2000 Ada Generation

NVIDIA RTX A6000.png

The NVIDIA RTX™ 2000 Ada Generation brings the cutting-edge Ada Lovelace architecture to more professionals, whether they use compact workstations or expansive full-sized towers, offering faster performance, advanced features, and up to 16GB of GPU memory. With its compact, power-efficient form factor, you can do your life’s work on a wide range of systems and free from limitations.

NVIDIA RTX 4500 Ada Generation

NVIDIA RTX A6000.png

The NVIDIA RTX™ 4500 Ada Generation is designed for professionals to tackle demanding creative, design, engineering, and scientific work from the desktop. Combining the latest generation of RT Cores, Tensor Cores, and CUDA® cores, alongside a generous 24GB of graphics memory, RTX 4500 unleashes powerful performance and efficiency for seamless productivity.

NVIDIA RTX 6000 Ada Generation

NVIDIA RTX A6000.png

The NVIDIA RTX™ 6000 Ada Generation is designed to deliver the next generation of AI graphics and petaflop inferencing performance for unprecedented speed-up of rendering, AI, graphics, and compute workloads.

NVIDIA RTX A4000

NVIDIA RTX A6000.png

The NVIDIA RTX™ A4000 built on the NVIDIA Ampere architecture is the most powerful single-slot GPU for professionals, delivering real-time ray tracing, AI-accelerated compute, and high-performance graphics performance to your desktop. You can engineer next-generation products, design cityscapes of the future, and create immersive entertainment experiences of tomorrow, today, from your desktop workstation.

NVIDIA RTX A6000

NVIDIA RTX A6000.png

Built on the NVIDIA Ampere architecture, the RTX™ A6000 combines 84 second-generation RT Cores, 336 third-generation Tensor Cores, and 10,752 CUDA cores with 48 GB of graphics memory. Connect two RTX A6000s with NVIDIA NVLink for 96 GB of combined GPU memory, engineer amazing products, design state-of-the art buildings, drive scientific breakthroughs, and create immersive entertainment.

NVIDIA RTX 4000 Ada Generation

NVIDIA RTX A6000.png

The NVIDIA RTX™ 4000 Ada Generation is the most powerful single-slot GPU for professionals, providing massive breakthroughs in speed and power efficiency to tackle demanding creative, design, and engineering workflows from the desktop. Harnessing the latest-generation RT Cores, Tensor Cores, and CUDA® cores alongside 20GB of graphics memory, RTX 4000 empowers professionals to create intricate product engineering, visionary cityscapes, and immersive entertainment.

NVIDIA RTX 5000 Ada Generation

NVIDIA RTX A6000.png

The NVIDIA RTX™ 5000 Ada Generation GPU, powered by the NVIDIA Ada Lovelace architecture, unlocks breakthroughs in generative AI and delivers the performance required to meet the challenges of today’s professional workflows. With 100 third-generation RT Cores, 400 fourth-generation Tensor Cores, 12,800 CUDA® cores, and 32GB of graphics memory, the RTX 5000 excels in rendering, AI, graphics, and compute workload performance.

NVIDIA RTX A2000 12GB

NVIDIA RTX A6000.png

NVIDIA RTX™ A2000 12GB based on Ampere architecture brings the power of RTX to more professionals with a powerful low-profile, dual-slot GPU design, delivering real-time ray tracing, AI-accelerated compute, and high-performance graphics to your desktop.

NVIDIA RTX A5000

NVIDIA RTX A6000.png

NVIDIA Ampere architecture brings the power of real-time ray tracing, AI, and advanced graphics to millions of designers, artists, scientists, and researchers. The NVIDIA RTX™ A5000 based on Ampere architecture perfectly balances power, performance, and memory to spearhead the future of innovation from your desktop.
Connect two RTX A5000s for 48 GB of combined GPU memory with NVIDIA NVLink, unlocking the ability to work with larger models, renders and scenes, tackle memory-intensive tasks like natural language processing, and run higher-fidelity simulations to enhance your product development process.

Data Center GPUs

NVIDIA H200

NVIDIA L4.png

The NVIDIA H200 Tensor Core GPU supercharges generative AI and high-performance computing (HPC) workloads with game-changing performance and memory capabilities. As the first GPU with HBM3e, the H200’s larger and faster memory fuels the acceleration of generative AI and large language models (LLMs) while advancing scientific computing for HPC workloads.

NVIDIA A16

NVIDIA L4.png

Take remote work to the next level with NVIDIA A16, the ideal GPU for high-density, graphics rich VDI. NVIDIA A16 has 4x the encoder throughput versus NVIDIA T4 to provide the best user experience on a single board. Based on the latest NVIDIA Ampere architecture, A16 is purpose-built to achieve the highest user density, with up to 64 concurrent users per board in a dual slot form factor.

NVIDIA H100 NVL

NVIDIA L4.png

Tap into exceptional performance, scalability, and security for every workload with the NVIDIA H100 NVL Tensor Core GPU. With 94GB of HBM3 memory per GPU, AI large language models are able to reach maximum deployment.

NVIDIA A2

NVIDIA L4.png

The NVIDIA A2 Tensor Core GPU provides entry-level inference with low power (40-60 watt), a small footprint (low profile PCIe Gen4), and high performance for intelligent video analytics (IVA) or NVIDIA® AI at the edge. Servers accelerated with A2 GPUs deliver up to 20X higher inference performance versus CPUs and 1.3x more efficient IVA deployments — all at an entry-level price point.

NVIDIA L40

NVIDIA L4.png

Based on the Ada Lovelace GPU architecture, the L40 features third-generation RT Cores that enhance real-time ray tracing capabilities, and fourth-generation Tensor Cores with support for the FP8 data format to deliver over a petaflop of inferencing performance.

NVIDIA L40S

NVIDIA L4.png

The NVIDIA L40S GPU, based on the Ada Lovelace architecture, is the most powerful universal GPU for the data center, delivering breakthrough multi-workload acceleration for large language model (LLM) inference and training, graphics, and video applications. As the premier platform for multi-modal generative AI, the L40S GPU provides end-to-end acceleration for inference, training, graphics, and video workflows to power the next generation of AI-enabled audio, speech, 2D, video, and 3D applications.

NVIDIA A10

NVIDIA L4.png

Built on the latest NVIDIA Ampere architecture, the NVIDIA A10 Tensor Core GPU combines second-generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 gigabytes (GB) of GDDR6 memory – all in a 150W power envelope – for versatile graphics, rendering, AI, and compute performance.

NVIDIA T4

NVIDIA L4.png

Powered by NVIDIA Turing Tensor Cores, T4 provides multi-precision inference performance to accelerate the diverse applications of modern AI. T4 supports all AI frameworks and provides comprehensive tooling and integrations to the deployment of advanced AI.

NVIDIA L4

NVIDIA L4.png

L4 is optimized for video and inference at scale for a broad range of AI applications, including recommendations, voice-based AI avatar assistants, generative AI, visual search, and contact center automation. Servers equipped with L4 enable up to 120X higher AI Video performance over CPU solutions, while providing 2.7X more generative AI performance, and over 4X more graphics performance versus the previous generation.

bottom of page