Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (9)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Gemma 3 1B

Google · 2025-03-12

1Bedge

phone-tier Gemma — smallest practical Gemma 3

BenchmarkVerdict

Gemma 2 9B Instruct

Google · 2024-06-27

9Bconsumer

consumer-tier Gemma — pre-Gemma-3 baseline

BenchmarkVerdict

CodeGemma 7B

Google · 2024-04-09

7Bconsumer

Gemma-derived coding model

BenchmarkVerdict

Gemma 4 Turkish 26B (4B active)

esokullu

26B

YTU Turkish Gemma 9B v0.1

ytu-ce-cosmos

9.2Bconsumer

Turkish instruction following on 16GB GPUs

Benchmark

Turkish Gemma 9B T1

ytu-ce-cosmos

ColPali v1.3

ColPali team (Illuin Technology)

Visual-document retrieval for multi-page PDFs with charts, tables, and scans where OCR pipelines fail

L1.25 enrichedVerdict

Gemma 2 2B Instruct

Google

Consumer-GPU local chat with strong safety defaults

L1.25 enrichedVerdict

Gemma 3 270M

Google

0.27B

Fine-tuning base for sub-1W on-device classifiers and routers

L1.25 enrichedVerdict

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.