->Will it run?Best GPU Compare Troubleshoot Start Learn Pulse Models Hardware Tools Bench

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo

DIR

Models
Hardware
Tools
Benchmarks

TOOLS

Will it run?
Compare hardware
Cost vs cloud
Choose my GPU
Prompting kits
Quick answers

REF

All buyer guides
Learn local AI
Methodology
Glossary
Errors KB
Trust

EDITOR

About
Author
How we make money
Editorial policy
Contact

LEGAL

Privacy
Terms
Sitemap

MAIL · MONTHLY DIGEST

Get monthly local AI changes

Monthly recap. No spam.

Email address

DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated

RUNLOCALAI · v38

>
Home
Frontier
Models

Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (32)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

DeepSeek V4 Flash (284B MoE)

DeepSeek · 2026-04-24

284B/13B-Adatacenter

datacenter MoE — V4 efficiency variant

Llama 4 Scout

Meta · 2026-04-05

production multimodal serving — image + text at workstation-cluster scale

L1.25 enrichedVerdict

GLM-5 Pro

Zhipu AI · 2026-02-18

144B/16B-Adatacenter

Chinese-language enterprise serving

Nemotron 3 Super (120B-A12B)

NVIDIA · 2026-02-15

NVIDIA-tuned datacenter-tier reasoning

Llama 4 70B

Meta · 2026-02-10

production self-hosted serving on 2x A100 / H100

Hermes 4 Llama 3.3 70B

Nous Research · 2025-12-22

datacenter-tier instruction-tuned alternative to base Llama 3.3

Kimi K1.5

Moonshot AI · 2025-12-01

deep math + reasoning research

Dolphin 3 Llama 3.3 70B

Cognitive Computations · 2025-09-30

datacenter creative / less-restricted generation

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22

datacenter-tier creative / narrative generation

Qwen 2.5-VL 72B

Alibaba · 2025-03-10

frontier-tier multimodal serving

VerdictMultimodal

DeepSeek R1 Distill Llama 70B

DeepSeek · 2025-01-20

datacenter-tier reasoning

Llama 3.3 70B Instruct

Meta · 2024-12-06

production self-hosted serving at the 70B class — when you need general-purpose capability above 32B but don't need frontier-tier

L1.25 enrichedVerdict

InternVL 2.5 78B

OpenGVLab · 2024-12-05

datacenter-tier permissive VLM

VerdictMultimodal

Tulu 3 70B

Allen Institute (AI2) · 2024-11-21

datacenter-tier open-recipe instruct

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15

NVIDIA-fine-tuned Llama 3.1 70B

Llama 3.2 90B Vision

Meta · 2024-09-25

datacenter-tier multimodal serving

VerdictMultimodal

Llama 3.2 90B Vision Instruct

Meta · 2024-09-25

datacenter vision-language Llama at 70B-class

VerdictMultimodal

Molmo 72B

Allen Institute (AI2) · 2024-09-25

datacenter-tier open VLM for agent UI

VerdictMultimodal

Qwen 2.5 Math 72B

Alibaba · 2024-09-19

datacenter-tier math specialist

Qwen 2.5 72B Instruct

Alibaba · 2024-09-19

production multilingual at 70B-class

DeepSeek V2.5 236B

DeepSeek · 2024-09-05

236B/21B-Adatacenter

DeepSeek lineage reference — pre-V3

Command R+ (Aug 2024)

Cohere · 2024-08-30

research / non-commercial RAG workflows

Command R+ 104B

Cohere · 2024-08-30

datacenter RAG-tuned at 100B class

Hermes 3 Llama 3.1 70B

NousResearch · 2024-08-15

datacenter-tier Hermes — instruction following

Mistral Large 2 (123B)

Mistral AI · 2024-07-24

datacenter dense Mistral flagship — pre-Medium-3.5

Llama 3.1 70B Instruct

Meta · 2024-07-23

production self-hosted serving at the 70B class

DeepSeek Coder V2 236B

DeepSeek · 2024-06-17

236B/21B-Adatacenter

datacenter-tier MoE coding

OpenBioLLM Llama 3 70B

Saama Technologies · 2024-04-26

medical / clinical NLP

Mixtral 8x22B Instruct

Mistral AI · 2024-04-17

datacenter MoE — 39B active, 141B total

WizardLM-2 8x22B

Microsoft (WizardLM team) · 2024-04-15

Mixtral 8x22B fine-tune — reasoning-tuned

DBRX Instruct

Databricks · 2024-03-27

132B/36B-Adatacenter

Databricks-native enterprise inference

DBRX Base

Databricks · 2024-03-27

132B/36B-Adatacenter

MoE fine-tuning base

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.