Every catalog hardware unit ranked by composite score (0–1000): measured tok/s, VRAM fit, ecosystem support, perf-per-watt. 2 of 154 ranks anchored to a measured benchmark — the rest are honestly flagged as extrapolated or estimated.
Methodology: /methodology · Run your own: curl -fsSL runlocalai.co/bench.mjs -o bench.mjs && node bench.mjs
53 units shown · sorted by score
| # | Hardware | Tier | Score | Data |
|---|---|---|---|---|
| 1 | NVIDIA RTX A6000 (Ampere) nvidia · workstation · 48GB | C | 477 | Estimated |
| 2 | NVIDIA GeForce RTX 5070 Ti nvidia · high · 16GB | C | 477 | Estimated |
| 3 | NVIDIA RTX A5000 nvidia · workstation · 24GB | C | 468 | Estimated |
| 4 | NVIDIA A40 nvidia · workstation · 48GB | C | 458 | Estimated |
| 5 | Apple M4 Max apple · enthusiast | C | 457 | Estimated |
| 6 | NVIDIA GeForce RTX 3080 Ti nvidia · enthusiast · 12GB | C | 456 | Estimated |
| 7 | NVIDIA GeForce RTX 3080 12GB nvidia · high · 12GB | C | 456 | Estimated |
| 8 | NVIDIA RTX PRO 4000 Blackwell nvidia · workstation · 24GB | C | 455 | Estimated |
| 9 | MacBook Pro 16" M4 Max apple · enthusiast | C | 445 | Estimated |
| 10 | Apple Mac Studio (M4 Max) apple · enthusiast | C | 438 | Estimated |
| 11 | NVIDIA GeForce RTX 4080 Super nvidia · high · 16GB | C | 433 | Estimated |
| 12 | NVIDIA GeForce RTX 4080 nvidia · high · 16GB | C | 428 | Estimated |
| 13 | AMD Radeon RX 7900 XTX amd · enthusiast · 24GB | C | 420 | Estimated |
| 14 | NVIDIA GeForce RTX 4070 Ti Super nvidia · high · 16GB | C | 418 | Estimated |
| 15 | NVIDIA RTX 5000 Ada Generation nvidia · workstation · 32GB | C | 414 | Estimated |
| 16 | NVIDIA RTX 2080 Ti 22GB (China-mod) nvidia · mid · 22GB | C | 405 | Estimated |
| 17 | Apple M1 Max apple · high | C | 404 | Estimated |
| 18 | Apple M2 Max apple · high | C | 400 | Estimated |
| 19 | NVIDIA GeForce RTX 4090 Mobile nvidia · enthusiast · 16GB | C | 400 | Estimated |
| 20 | NVIDIA GeForce RTX 5070 nvidia · mid · 12GB | C | 399 | Estimated |
| 21 | Apple M3 Max apple · enthusiast | C | 398 | Estimated |
| 22 | NVIDIA GeForce RTX 3080 10GB nvidia · high · 10GB | C | 397 | Estimated |
| 23 | AMD Radeon RX 7900 XT amd · enthusiast · 20GB | C | 365 | Estimated |
| 24 | NVIDIA GeForce RTX 5060 Ti 16GB nvidia · mid · 16GB | C | 364 | Estimated |
| 25 | NVIDIA GeForce RTX 2080 Ti nvidia · enthusiast · 11GB | C | 363 | Estimated |
| 26 | NVIDIA L4 nvidia · workstation · 24GB | C | 360 | Estimated |
| 27 | NVIDIA GeForce RTX 3070 Ti nvidia · high · 8GB | C | 358 | Estimated |
| 28 | NVIDIA GeForce RTX 4070 nvidia · mid · 12GB | C | 356 | Estimated |
| 29 | NVIDIA GeForce RTX 4070 Super nvidia · mid · 12GB | C | 355 | Estimated |
| 30 | Apple M4 Pro apple · high | C | 351 | Estimated |
| 31 | NVIDIA GeForce RTX 4070 Ti nvidia · high · 12GB | C | 351 | Estimated |
| 32 | Apple Mac Mini (M4 Pro) apple · high | C | 340 | Estimated |
| 33 | AMD Radeon RX 9060 XT amd · mid · 16GB | C | 339 | Estimated |
| 34 | NVIDIA GeForce RTX 5070 Laptop GPU nvidia · high · 12GB | C | 337 | Estimated |
| 35 | AMD Radeon RX 9070 XT amd · high · 16GB | C | 332 | Estimated |
| 36 | AMD Radeon RX 9070 amd · high · 16GB | C | 332 | Estimated |
| 37 | NVIDIA GeForce RTX 2080 Super nvidia · high · 8GB | C | 330 | Estimated |
| 38 | AMD Radeon RX 7800 XT amd · high · 16GB | C | 329 | Estimated |
| 39 | NVIDIA GeForce GTX 1080 Ti nvidia · high · 11GB | C | 327 | Estimated |
| 40 | NVIDIA GeForce RTX 5060 nvidia · entry · 8GB | C | 326 | Estimated |
| 41 | NVIDIA GeForce RTX 2070 nvidia · high · 8GB | C | 323 | Estimated |
| 42 | NVIDIA GeForce RTX 2060 Super nvidia · mid · 8GB | C | 323 | Estimated |
| 43 | NVIDIA GeForce RTX 5060 Ti 8GB nvidia · mid · 8GB | C | 322 | Estimated |
| 44 | NVIDIA GeForce RTX 3060 Ti nvidia · high · 8GB | C | 321 | Estimated |
| 45 | NVIDIA GeForce RTX 4060 Ti 16GB nvidia · mid · 16GB | C | 320 | Estimated |
| 46 | NVIDIA GeForce RTX 3060 12GB nvidia · mid · 12GB | C | 319 | Estimated |
| 47 | NVIDIA GeForce RTX 3070 nvidia · mid · 8GB | C | 319 | Estimated |
| 48 | AMD Radeon RX 7900 GRE amd · high · 16GB | C | 319 | Estimated |
| 49 | NVIDIA GeForce RTX 2070 Super nvidia · high · 8GB | C | 319 | Estimated |
| 50 | AMD Radeon RX 6950 XT amd · enthusiast · 16GB | C | 316 | Estimated |
| 51 | AMD Radeon RX 6800 amd · high · 16GB | C | 304 | Estimated |
| 52 | AMD Radeon RX 6800 XT amd · enthusiast · 16GB | C | 302 | Estimated |
| 53 | AMD Radeon RX 6900 XT amd · enthusiast · 16GB | C | 302 | Estimated |
Amazon search links — we may earn a small commission at no extra cost to you. How we make money.
Steady-state tok/s on a representative 7B/8B Q4 model. Measured from real benchmark rows, or extrapolated from VRAM bandwidth × runtime-stack efficiency.
How comfortably the rig holds 7B / 32B / 70B class models. Apple unified memory counts; NPU/SoC system RAM counts.
CUDA / MLX / ROCm / Vulkan reach. Real-world friction the operator hits when installing tools.
Tok/s per watt. Mobile / NPU class scores well; dense desktop GPUs trade efficiency for absolute throughput.
A confidence multiplier (1.0 measured · 0.85 extrapolated · 0.7 estimated) discounts the headline so we don't pretend to know more than we do. Score is recomputed on every page load against the latest catalog + benchmark data — submit your own run with runlocalai-bench --submit --hardware your-rig to firm up the numbers.