NVIDIA RTX PRO 4000 Blackwell for local AI

NVIDIA RTX PRO 4000 Blackwell

NVDA · HARDWARE

NVIDIA RTX PRO 4000 Blackwell

No editorial image yet — generic vendor mark shown. Credentials in spec table below.

Single-slot 140W Blackwell workstation card with 24GB GDDR7. The low-power, compact entry to the RTX PRO Blackwell line — fits small workstations and dense multi-card builds for local inference.

Released 2025·672 GB/s memory bandwidth

What it does well

The RTX PRO 4000 Blackwell is the efficiency pick of the workstation line: 24GB CUDA VRAM in a single-slot, 140W card. That combination is rare and valuable — it drops into compact or SFF workstations, and its low power + single-slot width make it ideal for dense multi-card inference servers where a 4090/5090 would be too hot and wide. 24GB runs 32B-class models at Q4 and most diffusion workloads comfortably, with full CUDA and ECC.

Where it struggles

At ~$1,500 it's far pricier than a used RTX 3090 (also 24GB) or a new RTX 5070 Ti/5080-class card, and its 140W power budget caps raw throughput below those higher-wattage parts — you're paying for efficiency, single-slot density, ECC, and pro drivers, not speed. For a single hobbyist inference box, cheaper 24GB options give more tokens/sec/dollar.

Bottom line

The card to buy when you need 24GB of CUDA in a single slot at low power — compact workstations and multi-GPU inference racks. For a standalone budget 24GB build, a used 3090 remains the value king.

Frequently asked

What models can NVIDIA RTX PRO 4000 Blackwell run?

With 24GB VRAM, the NVIDIA RTX PRO 4000 Blackwell runs models up to ~32B in 4-bit, with room for context. See the model list below for tested combinations.

Does NVIDIA RTX PRO 4000 Blackwell support CUDA?

Yes — NVIDIA RTX PRO 4000 Blackwell is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.

VRAM	24 GB
Power draw (peak)	140 W
Released	2025
MSRP	$1500
Backends	CUDA Vulkan

NVIDIA RTX PRO 4000 Blackwell

Our verdict

What it does well

Where it struggles

Bottom line

Overview

Specs

Models that fit

Frequently asked

What models can NVIDIA RTX PRO 4000 Blackwell run?

Does NVIDIA RTX PRO 4000 Blackwell support CUDA?

Where next?

Hardware worth comparing