Identify your hardware. We compute which open-weight AI models you can run locally, how fast, with what tradeoffs.
> Backed by real benchmarks when available, computed from memory bandwidth + model architecture otherwise. Confidence tier always shown next to every number.
Pick any GPUs (single/dual/mixed/multi-node), set CPU/RAM/OS, choose runtime + use case. We compute effective VRAM honestly, recommend a runtime, and tell you which models fit comfortably.
> all four fields required