Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (11)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

DeepSeek Coder V3

DeepSeek AI · 2026-02-08

33Bworkstation

workstation coding alternative to Qwen 2.5 Coder

Verdict

DeepSeek V3 Lite (16B MoE)

DeepSeek AI · 2026-01-10

16B/2.4B-Aconsumer

consumer-tier MoE inference

Verdict

DeepSeek R1 Distill Qwen 3 32B

DeepSeek AI · 2025-11-15

32Bworkstation

workstation reasoning with Qwen 3 base improvements

Verdict

DeepSeek R1 Distill Mistral 24B

Community (DeepSeek-derived) · 2025-03-18

24Bconsumer

consumer-tier reasoning with Mistral instruction lineage

Verdict

DeepSeek R1 Distill Llama 70B

DeepSeek · 2025-01-20

70Bdatacenter

datacenter-tier reasoning

Verdict

DeepSeek R1 Distill Qwen 14B

DeepSeek · 2025-01-20

14Bconsumer

consumer-tier reasoning at 14B

Verdict

DeepSeek R1 Distill Llama 8B

DeepSeek AI · 2025-01-20

8Bconsumer

consumer-tier reasoning on 8GB+ GPUs

Verdict

DeepSeek R1 Distill Qwen 7B

DeepSeek · 2025-01-20

7Bconsumer

consumer-tier reasoning at 7B

BenchmarkVerdict

DeepSeek Coder V2 Lite (16B)

DeepSeek · 2024-06-17

16Bconsumer

consumer-tier coding MoE

BenchmarkVerdict

DeepSeek MoE 16B Base

DeepSeek AI · 2024-01-15

16B/2.4B-Aconsumer

research / lineage reference

Verdict

DeepSeek V2 Lite Chat

DeepSeek

15.7B/2.4B-A

Workstation chat where MoE active-param efficiency matters more than total VRAM

Verdict

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.