RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (32)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

DeepSeek V4 Flash (284B MoE)

DeepSeek · 2026-04-24
284B/13B-Adatacenter

datacenter MoE — V4 efficiency variant

Verdict

Llama 4 Scout

Meta · 2026-04-05
109Bdatacenter

production multimodal serving — image + text at workstation-cluster scale

L1.25 enrichedVerdict

GLM-5 Pro

Zhipu AI · 2026-02-18
144B/16B-Adatacenter

Chinese-language enterprise serving

Verdict

Nemotron 3 Super (120B-A12B)

NVIDIA · 2026-02-15
120Bdatacenter

NVIDIA-tuned datacenter-tier reasoning

Verdict

Llama 4 70B

Meta · 2026-02-10
70Bdatacenter

production self-hosted serving on 2x A100 / H100

Verdict

Hermes 4 Llama 3.3 70B

Nous Research · 2025-12-22
70Bdatacenter

datacenter-tier instruction-tuned alternative to base Llama 3.3

Verdict

Kimi K1.5

Moonshot AI · 2025-12-01
200Bdatacenter

deep math + reasoning research

Verdict

Dolphin 3 Llama 3.3 70B

Cognitive Computations · 2025-09-30
70Bdatacenter

datacenter creative / less-restricted generation

Verdict

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22
70Bdatacenter

datacenter-tier creative / narrative generation

Verdict

Qwen 2.5-VL 72B

Alibaba · 2025-03-10
72Bdatacenter

frontier-tier multimodal serving

VerdictMultimodal

DeepSeek R1 Distill Llama 70B

DeepSeek · 2025-01-20
70Bdatacenter

datacenter-tier reasoning

Verdict

Llama 3.3 70B Instruct

Meta · 2024-12-06
70Bdatacenter

production self-hosted serving at the 70B class — when you need general-purpose capability above 32B but don't need frontier-tier

L1.25 enrichedVerdict

InternVL 2.5 78B

OpenGVLab · 2024-12-05
78Bdatacenter

datacenter-tier permissive VLM

VerdictMultimodal

Tulu 3 70B

Allen Institute (AI2) · 2024-11-21
70Bdatacenter

datacenter-tier open-recipe instruct

Verdict

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15
70Bdatacenter

NVIDIA-fine-tuned Llama 3.1 70B

Verdict

Llama 3.2 90B Vision

Meta · 2024-09-25
90Bdatacenter

datacenter-tier multimodal serving

VerdictMultimodal

Llama 3.2 90B Vision Instruct

Meta · 2024-09-25
90Bdatacenter

datacenter vision-language Llama at 70B-class

VerdictMultimodal

Molmo 72B

Allen Institute (AI2) · 2024-09-25
72Bdatacenter

datacenter-tier open VLM for agent UI

VerdictMultimodal

Qwen 2.5 Math 72B

Alibaba · 2024-09-19
72Bdatacenter

datacenter-tier math specialist

Verdict

Qwen 2.5 72B Instruct

Alibaba · 2024-09-19
72Bdatacenter

production multilingual at 70B-class

Verdict

DeepSeek V2.5 236B

DeepSeek · 2024-09-05
236B/21B-Adatacenter

DeepSeek lineage reference — pre-V3

Verdict

Command R+ (Aug 2024)

Cohere · 2024-08-30
104Bdatacenter

research / non-commercial RAG workflows

Verdict

Command R+ 104B

Cohere · 2024-08-30
104Bdatacenter

datacenter RAG-tuned at 100B class

Verdict

Hermes 3 Llama 3.1 70B

NousResearch · 2024-08-15
70Bdatacenter

datacenter-tier Hermes — instruction following

Verdict

Mistral Large 2 (123B)

Mistral AI · 2024-07-24
123Bdatacenter

datacenter dense Mistral flagship — pre-Medium-3.5

Verdict

Llama 3.1 70B Instruct

Meta · 2024-07-23
70Bdatacenter

production self-hosted serving at the 70B class

Verdict

DeepSeek Coder V2 236B

DeepSeek · 2024-06-17
236B/21B-Adatacenter

datacenter-tier MoE coding

Verdict

OpenBioLLM Llama 3 70B

Saama Technologies · 2024-04-26
70Bdatacenter

medical / clinical NLP

Verdict

Mixtral 8x22B Instruct

Mistral AI · 2024-04-17
141Bdatacenter

datacenter MoE — 39B active, 141B total

Verdict

WizardLM-2 8x22B

Microsoft (WizardLM team) · 2024-04-15
141Bdatacenter

Mixtral 8x22B fine-tune — reasoning-tuned

Verdict

DBRX Instruct

Databricks · 2024-03-27
132B/36B-Adatacenter

Databricks-native enterprise inference

Verdict

DBRX Base

Databricks · 2024-03-27
132B/36B-Adatacenter

MoE fine-tuning base

Verdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.