nvidia
GPU
16GB VRAM
high

NVIDIA GeForce RTX 4080

Original 4080. 16GB GDDR6X. Still capable for 14B–32B Q4 work.

Released 2022·~$1099 street

Overview

Original 4080. 16GB GDDR6X. Still capable for 14B–32B Q4 work.

Where to buy
Geo-routed to your region. Approx. $1099.

Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.

Specs

VRAM16 GB
Power draw320 W
Released2022
MSRP$1199
Backends
CUDA
Vulkan

Models that fit

Open-weight models small enough to run on NVIDIA GeForce RTX 4080 with usable context.

Compare alternatives

Hardware worth comparing

Same VRAM tier and the one step above and below — so you can frame the buying decision against real options.

Same VRAM tier
Cards in the same memory band
Step down
Less VRAM — cheaper, more constrained
No verdicted hardware in the next tier down yet.

Frequently asked

What models can NVIDIA GeForce RTX 4080 run?

With 16GB VRAM, the NVIDIA GeForce RTX 4080 runs models up to 14B in 4-bit, or 7B at higher quantizations. See the model list below for tested combinations.

Does NVIDIA GeForce RTX 4080 support CUDA?

Yes — NVIDIA GeForce RTX 4080 is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.

How much does NVIDIA GeForce RTX 4080 cost?

Current street price for NVIDIA GeForce RTX 4080 is around $1099 (MSRP $1199). Prices vary by region and supply.

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.