RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (48)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Mistral Medium 3 24B (dense)

Mistral AI · 2026-04-29
24Bconsumer

research / non-commercial workstation deployments

Verdict

Granite 3.3 8B

IBM · 2026-03-12
8Bconsumer

enterprise tool-calling on IBM stacks

Verdict

Mistral Small 3.2 24B

Mistral AI · 2026-03-08
24Bconsumer

consumer-tier multilingual instruction-following

Verdict

Phi-4 Multimodal

Microsoft · 2026-02-25
14Bconsumer

16GB-consumer multimodal Q&A

VerdictMultimodal

Nemotron 3 Nano 9B

NVIDIA · 2026-01-22
9Bconsumer

NVIDIA-stack tool-calling agents

Verdict

Nemotron 3 Nano (30B-A3B)

NVIDIA · 2026-01-15
30Bconsumer

NVIDIA-tuned consumer-tier general

Verdict

DeepSeek V3 Lite (16B MoE)

DeepSeek AI · 2026-01-10
16B/2.4B-Aconsumer

consumer-tier MoE inference

Verdict

EXAONE 3.5 8B

LG AI Research · 2025-11-10
7.8Bconsumer

consumer-tier Korean workloads

Verdict

InternLM 3 8B

Shanghai AI Lab · 2025-10-05
8Bconsumer

Chinese-language consumer workloads

Verdict

Devstral Small 2 24B

Mistral AI · 2025-09-25
24Bconsumer

Apache 2.0 coding alternative to Qwen 2.5 Coder

Verdict

Yi Coder 9B

01.AI · 2025-09-20
9Bconsumer

8GB-VRAM coding

Verdict

Qwen 3 7B

Alibaba · 2025-09-15
7Bconsumer

consumer-tier reasoning on 8GB+ GPUs

Verdict

MiniCPM-V 3 8B

OpenBMB · 2025-08-14
8Bconsumer

consumer multimodal document Q&A

VerdictMultimodal

Qwen 3 Embedding 8B

Alibaba · 2025-06-05
8Bconsumer

permissively-licensed embeddings at 8B

Verdict

Qwen 3 8B

Alibaba · 2025-04-29
8Bconsumer

consumer-tier reasoning toggle

Verdict

Granite 3 MoE (3B active)

IBM · 2025-04-15
16B/3B-Aconsumer

consumer-tier enterprise MoE

Verdict

Llama 3.3 8B Instruct

Meta · 2025-04-12
8Bconsumer

consumer-tier chat — drop-in 3.1 8B replacement

Verdict

Llama 3.1 Nemotron Nano 8B

NVIDIA · 2025-04-08
8Bconsumer

consumer-tier Nemotron-Llama

Verdict

DeepSeek R1 Distill Mistral 24B

Community (DeepSeek-derived) · 2025-03-18
24Bconsumer

consumer-tier reasoning with Mistral instruction lineage

Verdict

Qwen 2.5-VL 7B

Alibaba · 2025-03-10
7Bconsumer

consumer-tier OCR + image Q&A

VerdictMultimodal

Granite 3.2 8B

IBM · 2025-02-25
8Bconsumer

enterprise tool-calling on IBM stacks

Verdict

Mistral Saba 24B

Mistral AI · 2025-02-17
24Bconsumer

Arabic / South-Asian multilingual

Verdict

Mistral Small 3 24B

Mistral AI · 2025-01-30
24Bconsumer

consumer-tier multilingual instruction-following — Mistral's instruction-tuned baseline at 24B

L1.25 enrichedVerdict

Dolphin 3.0 Mistral 24B

Cognitive Computations · 2025-01-30
24Bconsumer

consumer-tier creative / less-restricted generation

Verdict

Janus-Pro 7B

DeepSeek AI · 2025-01-29
7Bconsumer

consumer multimodal with image-generation

VerdictMultimodal

DeepSeek R1 Distill Qwen 14B

DeepSeek · 2025-01-20
14Bconsumer

consumer-tier reasoning at 14B

Verdict

DeepSeek R1 Distill Llama 8B

DeepSeek AI · 2025-01-20
8Bconsumer

consumer-tier reasoning on 8GB+ GPUs

Verdict

Falcon 3 10B

TII (Abu Dhabi) · 2024-12-17
10Bconsumer

Arabic-language workloads

Verdict

Falcon 3 7B Instruct

TII (UAE) · 2024-12-17
7Bconsumer

consumer-tier multilingual

Verdict

Phi-4 14B

Microsoft · 2024-12-12
14Bconsumer

16 GB VRAM tier reasoning + chat — the right pick when 32B-class doesn't fit

L1.25 enrichedVerdict

InternVL 2.5 26B

OpenGVLab · 2024-12-05
26Bconsumer

permissively-licensed VLM at 24GB VRAM

VerdictMultimodal

PaliGemma 2 10B

Google · 2024-12-05
10Bconsumer

VLM fine-tuning at 24GB VRAM

VerdictMultimodal

OLMo 2 13B

AI2 (Allen Institute) · 2024-11-26
13Bconsumer

reproducibility / academic research

Verdict

Tulu 3 8B

Allen Institute (AI2) · 2024-11-21
8Bconsumer

fully-open instruction-following research baseline

Verdict

Qwen 2.5 Coder 7B Instruct

Alibaba · 2024-11-12
7Bconsumer

consumer-tier coding at 8GB VRAM

Verdict

OpenCoder 8B

INFLY AI · 2024-11-09
8Bconsumer

academic / reproducibility-sensitive coding research

Verdict

Baichuan 4 13B

Baichuan AI · 2024-10-30
13Bconsumer

Chinese-language consumer workloads — alternative to GLM

Verdict

Granite 3.0 8B Instruct

IBM · 2024-10-21
8Bconsumer

enterprise-friendly Apache 2.0 baseline

Verdict

Ministral 8B Instruct

Mistral AI · 2024-10-16
8Bconsumer

consumer-tier long-context — research only

Verdict

Llama 3.2 11B Vision

Meta · 2024-09-25
11Bconsumer

consumer-tier multimodal — Llama-ecosystem migration path for vision workflows

L1.25 enrichedVerdictMultimodal

Molmo 7B-D

Allen Institute (AI2) · 2024-09-25
8Bconsumer

open-research VLM with UI grounding

VerdictMultimodal

Qwen 2.5 14B Instruct

Alibaba · 2024-09-19
14Bconsumer

16GB-VRAM general chat with multilingual depth

Verdict

Qwen 2.5 Math 7B

Alibaba · 2024-09-19
7Bconsumer

consumer-tier math problem solving

Verdict

Pixtral 12B

Mistral AI · 2024-09-17
12Bconsumer

consumer-tier vision-language Mistral

VerdictMultimodal

NV-Embed v2

NVIDIA · 2024-09-09
7.85Bconsumer

research-grade embeddings

Verdict

MiniCPM-V 2.6 8B

OpenBMB · 2024-08-30
8Bconsumer

consumer multimodal document Q&A

VerdictMultimodal

Qwen 2-VL 7B

Alibaba · 2024-08-29
7Bconsumer

consumer-tier multimodal — pre-2.5-VL baseline

VerdictMultimodal

Falcon Mamba 7B

TII (Abu Dhabi) · 2024-08-12
7Bconsumer

long-context inference where memory matters

Verdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.