Build: Apple Mac Studio (M3 Ultra) + — + 32 GB RAM (windows)
Ranked by fit for agents use case + predicted speed. Click a row for VRAM breakdown.
ollama run hermes3:8bollama run dolphin-mistral:24bollama run qwen2.5:7bollama run mistral:7bollama run mistral-small:24bollama run llama3.1:8bTight VRAM, partial CPU offload, or context-limited.
ollama run qwen2.5:14bollama run qwen3:30bollama run qwen2.5-coder:32bollama run nemotron3:nanoollama run gemma4:26b-moeollama run qwen3:14bHypothetical scenarios. We re-ran the compatibility engine for each.
~$200–400 over base
On Apple Silicon, more unified memory is the only path forward — VRAM and system RAM are the same pool.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Need more memory than you have. Shown for orientation.
Needs ~1024 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~256 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~160 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~420 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~192 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Want a specific benchmark we don't have? Email Contact support and we'll prioritize it.