nvidia
GPU
48GB VRAM
workstation

NVIDIA A40

Ampere workstation/datacenter hybrid. 48GB GDDR6.

Released 2020

Overview

Ampere workstation/datacenter hybrid. 48GB GDDR6.

Specs

VRAM48 GB
Power draw300 W
Released2020
MSRP$5500
Backends
CUDA

Models that fit

Open-weight models small enough to run on NVIDIA A40 with usable context.

Frequently asked

What models can NVIDIA A40 run?

With 48GB VRAM, the NVIDIA A40 runs 70B models in 4-bit quantization, plus everything smaller. See the model list below for tested combinations.

Does NVIDIA A40 support CUDA?

Yes — NVIDIA A40 is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.