nvidia
GPU
48GB VRAM
workstation
NVIDIA A40
Ampere workstation/datacenter hybrid. 48GB GDDR6.
Released 2020
Overview
Ampere workstation/datacenter hybrid. 48GB GDDR6.
Specs
| VRAM | 48 GB |
| Power draw | 300 W |
| Released | 2020 |
| MSRP | $5500 |
| Backends | CUDA |
Models that fit
Open-weight models small enough to run on NVIDIA A40 with usable context.
Frequently asked
What models can NVIDIA A40 run?
With 48GB VRAM, the NVIDIA A40 runs 70B models in 4-bit quantization, plus everything smaller. See the model list below for tested combinations.
Does NVIDIA A40 support CUDA?
Yes — NVIDIA A40 is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.