Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (6)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Llama 4 Scout

Meta · 2026-04-05

109Bdatacenter

production multimodal serving — image + text at workstation-cluster scale

L1.25 enrichedVerdict

Llama 4 70B

Meta · 2026-02-10

70Bdatacenter

production self-hosted serving on 2x A100 / H100

Verdict

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22

70Bdatacenter

datacenter-tier creative / narrative generation

Verdict

Llama 3.3 70B Instruct

Meta · 2024-12-06

70Bdatacenter

production self-hosted serving at the 70B class — when you need general-purpose capability above 32B but don't need frontier-tier

L1.25 enrichedVerdict

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15

70Bdatacenter

NVIDIA-fine-tuned Llama 3.1 70B

Verdict

Llama 3.1 70B Instruct

Meta · 2024-07-23

70Bdatacenter

production self-hosted serving at the 70B class

Verdict

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.