RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (15)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Ring-2.6-1T

InclusionAI / Ant Group · 2026-05-14
1000B/32B-Afrontier

frontier reasoning at MoE serving cost

Verdict

Qwen 3.5 235B-A17B (MoE)

Alibaba · 2026-05-01
397B/17B-Afrontier

frontier-tier reasoning + multilingual serving on multi-machine clusters

L1.25 enrichedVerdict

Mistral Medium 3.5 (675B MoE)

Mistral AI · 2026-04-29
675B/41B-Afrontier

frontier MoE — Mistral's response to the open MoE wave

Verdict

DeepSeek V4 Pro (1.6T MoE)

DeepSeek · 2026-04-24
1600B/49B-Afrontier

frontier-tier coding + reasoning serving — currently the open-weight ceiling

L1.25 enrichedVerdict

DeepSeek V4

DeepSeek AI · 2026-03-15
745B/38B-Afrontier

frontier-tier reasoning on multi-machine clusters

Verdict

Kimi K2.6

Moonshot AI · 2026-03-10
1000Bfrontier

Moonshot frontier MoE — long-context specialist

Verdict

Llama 4 405B

Meta · 2026-02-10
405Bfrontier

frontier-tier serving on cluster hardware

Verdict

GLM-5

Zhipu AI (Z.AI) · 2026-02-05
200Bfrontier

Zhipu GLM-5 frontier MoE

Verdict

Step-3

StepFun · 2025-09-30
1000B/38B-Afrontier

frontier-research workloads

Verdict

Qwen 3 235B-A22B

Alibaba · 2025-04-29
235Bfrontier

Qwen 3 MoE flagship — pre-3.5 baseline

Verdict

Llama 3.1 Nemotron Ultra 253B

NVIDIA · 2025-04-08
253Bfrontier

frontier-tier Nemotron-Llama

Verdict

DeepSeek R1 (671B reasoning)

DeepSeek · 2025-01-20
671Bfrontier

frontier-tier reasoning research; cluster-only deployment

L1.25 enrichedVerdict

DeepSeek V3 (671B MoE)

DeepSeek · 2024-12-26
671Bfrontier

frontier-tier MoE serving — pre-V4 baseline

Verdict

Hunyuan Large 389B MoE

Tencent · 2024-11-05
389B/52B-Afrontier

frontier-tier serving with Tencent license tolerance

Verdict

Jamba 1.5 Large

AI21 Labs · 2024-08-22
398B/94B-Afrontier

frontier-tier long-context

Verdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.