Frontier zone · Model releases
The frontier of open-weight model releases
Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.
By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Filtered results (6)
Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.
Llama 4 70B
Meta · 2026-02-10
70Bdatacenter
production self-hosted serving on 2x A100 / H100
Verdict
EVA Llama 3.3 70B
EVA-Unit-01 community · 2025-08-22
70Bdatacenter
datacenter-tier creative / narrative generation
Verdict
Llama 3.1 Nemotron 70B Instruct
NVIDIA · 2024-10-15
70Bdatacenter
NVIDIA-fine-tuned Llama 3.1 70B
Verdict
Llama 3.2 90B Vision
Meta · 2024-09-25
90Bdatacenter
datacenter-tier multimodal serving
VerdictMultimodal
Llama 3.2 90B Vision Instruct
Meta · 2024-09-25
90Bdatacenter
datacenter vision-language Llama at 70B-class
VerdictMultimodal
Llama 3.1 70B Instruct
Meta · 2024-07-23
70Bdatacenter
production self-hosted serving at the 70B class
Verdict
Going deeper
- Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
- Execution stacks — recipes that combine models with runtimes + hardware.
- Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
- Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.