Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (22)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Qwen 3.6 35B-A3B (MTP)

Alibaba / Qwen team · 2026-05-11

35B/3B-Aworkstation

high-throughput MoE inference at workstation tier

Qwen 3.6 27B (MTP)

Alibaba / Qwen team · 2026-05-11

dense workstation model with throughput-acceleration

Qwen 3 Coder 32B

Alibaba · 2025-11-20

coding-specialized agent workloads

Qwen 3 7B

Alibaba · 2025-09-15

consumer-tier reasoning on 8GB+ GPUs

Qwen 3 Embedding 8B

Alibaba · 2025-06-05

permissively-licensed embeddings at 8B

Qwen 3 30B-A3B

Alibaba · 2025-04-29

workstation MoE — 3B active, 30B total

Qwen 3 8B

Alibaba · 2025-04-29

consumer-tier reasoning toggle

Qwen 2.5-VL 72B

Alibaba · 2025-03-10

frontier-tier multimodal serving

VerdictMultimodal

Qwen 2.5-VL 7B

Alibaba · 2025-03-10

consumer-tier OCR + image Q&A

VerdictMultimodal

QwQ 32B Preview

Alibaba · 2024-11-27

workstation-tier reasoning — Qwen team alternative to R1

Qwen 2.5 Coder 14B Instruct

Alibaba · 2024-11-12

16GB-VRAM coding

BenchmarkVerdict

Qwen 2.5 Coder 7B Instruct

Alibaba · 2024-11-12

consumer-tier coding at 8GB VRAM

Qwen 2.5 Math 72B

Alibaba · 2024-09-19

datacenter-tier math specialist

Qwen 2.5 72B Instruct

Alibaba · 2024-09-19

production multilingual at 70B-class

Qwen 2.5 32B Instruct

Alibaba · 2024-09-19

workstation-tier multilingual general chat

Qwen 2.5 14B Instruct

Alibaba · 2024-09-19

16GB-VRAM general chat with multilingual depth

Qwen 2.5 Math 7B

Alibaba · 2024-09-19

consumer-tier math problem solving

Qwen 2.5 7B Instruct

Alibaba · 2024-09-19

consumer-tier multilingual chat

BenchmarkVerdict

Qwen 2-VL 7B

Alibaba · 2024-08-29

consumer-tier multimodal — pre-2.5-VL baseline

VerdictMultimodal

CodeQwen 1.5 7B

Alibaba · 2024-04-16

historical reference — Qwen 2.5 Coder 7B is the modern pick

Qwen3 Swallow 32B RL v0.2

Japanese-English reasoning tasks: math, coding, structured analysis

Qwen3.5 9B Thai Law Base

Foundation for fine-tuning Thai legal NLP tools

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.