AMD EPYC 9005 (Zen 5, Turin)
No editorial image yet — generic vendor mark shown. Credentials in spec table below.
AMD's Zen 5 server CPU: up to 192 cores, 12-channel DDR5-6400 (~614 GB/s per socket), supporting terabyte-scale RAM. Anchors the CPU-inference strategy — running the full DeepSeek-R1 671B entirely in system memory.
Sub-scores sum to 230 / 1000. Headline = 230 × 0.70 (Estimated-confidence discount) = 161. This is an algorithmic performance-tier score — distinct from, and often lower than, the editorial “Our verdict” below, which weighs value and real-world fit (especially for hardware we haven’t measured yet). How scoring works →
Extrapolated from 614 GB/s bandwidth — 61.4 tok/s estimated. No measured benchmarks yet.
Plain-English: Doesn't fit modern chat models usefully — vision models won't fit.
Verdicts extrapolated from catalog VRAM + bandwidth + ecosystem flags. Hover any chip for the rationale. Want measured numbers? Submit your own run with runlocalai-bench --submit.
What it is
EPYC 9005 'Turin' is AMD's Zen 5 server platform: up to 192 cores, 12-channel DDR5-6400 for ~614 GB/s of memory bandwidth per socket, and support for terabyte-plus RAM capacities. It's the anchor of the catalog's CPU-inference category (previously empty).
Relevance to local AI
CPU inference is a real, distinct local-AI strategy that GPUs can't match at the extreme high end: with 768GB-1.5TB of system RAM, an EPYC 9005 box runs the full DeepSeek-R1 671B (and other giant MoE models) entirely in memory — something no single GPU and even few unified-memory machines can do. The tradeoff is speed: token generation is bandwidth-bound and slow (single-digit to low-double-digit tok/s on the largest models), but for running models that simply don't fit anywhere else, or for batch/offline workloads, a big-RAM EPYC is the answer. MoE models especially benefit since only a fraction of params activate per token.
Bottom line
The go-to for fitting the absolute largest models (671B-class) locally via system RAM, where capacity beats speed. An enthusiast/datacenter pick — slow per-token but uniquely capable of models nothing else can hold.
Overview
AMD's Zen 5 server CPU: up to 192 cores, 12-channel DDR5-6400 (~614 GB/s per socket), supporting terabyte-scale RAM. Anchors the CPU-inference strategy — running the full DeepSeek-R1 671B entirely in system memory.
Search-fallback link — editorial hasn't yet curated a retailer URL for this card.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Specs
| System RAM (typical) | 768 GB |
| Power draw (peak) | 400 W |
| Released | 2024 |
| Backends |
Models that fit
Open-weight models small enough to run on AMD EPYC 9005 (Zen 5, Turin) with usable context.
Frequently asked
Does AMD EPYC 9005 (Zen 5, Turin) support CUDA?
Where next?
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.