Build: Apple Mac Studio (M3 Ultra) + — + 32 GB RAM (windows)
Ranked by fit for long context use case + predicted speed. Click a row for VRAM breakdown.
ollama run phi3.5:3.8bollama run gemma4:e4bollama run qwen2.5:7bollama run qwen3:8bTight VRAM, partial CPU offload, or context-limited.
ollama run nemotron3:nanoollama run mistral-nemo:12bollama run qwen3:14bollama run qwen2.5:14bollama run gemma3:27bollama run qwen3:30bollama run phi4:14bHypothetical scenarios. We re-ran the compatibility engine for each.
~$200–400 over base
On Apple Silicon, more unified memory is the only path forward — VRAM and system RAM are the same pool.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Need more memory than you have. Shown for orientation.
Needs ~1024 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~256 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~160 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~420 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Needs ~192 GB unified memory minimum at smallest quant; you have 24 GB available after OS overhead.
Want a specific benchmark we don't have? Email Contact support and we'll prioritize it.