Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (6)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Llama 4 70B

Meta · 2026-02-10

70Bdatacenter

production self-hosted serving on 2x A100 / H100

Verdict

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22

70Bdatacenter

datacenter-tier creative / narrative generation

Verdict

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15

70Bdatacenter

NVIDIA-fine-tuned Llama 3.1 70B

Verdict

Llama 3.2 90B Vision

Meta · 2024-09-25

90Bdatacenter

datacenter-tier multimodal serving

VerdictMultimodal

Llama 3.2 90B Vision Instruct

Meta · 2024-09-25

90Bdatacenter

datacenter vision-language Llama at 70B-class

VerdictMultimodal

Llama 3.1 70B Instruct

Meta · 2024-07-23

70Bdatacenter

production self-hosted serving at the 70B class

Verdict

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.