RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (20)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Gemma 4 31B Dense

Google · 2026-04-02
31Bworkstation

workstation-tier multilingual chat with permissive license

L1.25 enrichedVerdictMultimodal

Gemma 4 26B MoE

Google · 2026-04-02
26Bworkstation

Gemma 4 MoE — workstation efficiency variant

VerdictMultimodal

Gemma 4 E4B (Effective 4B)

Google · 2026-04-02
4Bedge

edge-tier Gemma 4 — laptop friendly

BenchmarkVerdictMultimodal

Gemma 4 E2B (Effective 2B)

Google · 2026-04-02
2Bedge

phone-tier Gemma 4

BenchmarkVerdictMultimodal

MedGemma 27B

Google · 2025-05-20
27Bworkstation

medical-domain fine-tune of Gemma 3 27B

VerdictMultimodal

Gemma 3 27B

Google · 2025-03-12
27Bworkstation

Google's open-weight workstation-tier multilingual flagship — pre-Gemma-4 baseline

L1.25 enrichedVerdictMultimodal

Gemma 3 12B

Google · 2025-03-12
12Bconsumer

consumer-tier multilingual chat with vision support in 'it' variant

BenchmarkVerdictMultimodal

Gemma 3 4B

Google · 2025-03-12
4Bedge

edge-tier chat — Apple Silicon laptop friendly

BenchmarkVerdictMultimodal

Gemma 3 1B

Google · 2025-03-12
1Bedge

phone-tier Gemma — smallest practical Gemma 3

BenchmarkVerdict

PaliGemma 2 10B

Google · 2024-12-05
10Bconsumer

VLM fine-tuning at 24GB VRAM

VerdictMultimodal

PaliGemma 2 3B

Google · 2024-12-05
3Bconsumer

task-specific VLM fine-tuning base

VerdictMultimodal

Gemma 2 9B Instruct

Google · 2024-06-27
9Bconsumer

consumer-tier Gemma — pre-Gemma-3 baseline

BenchmarkVerdict

CodeGemma 7B

Google · 2024-04-09
7Bconsumer

Gemma-derived coding model

BenchmarkVerdict

Gemma 4 Turkish 26B (4B active)

esokullu
26B

Trendyol LLM Asure 12B

Trendyol
11.8Bconsumer

Turkish business workflow assistants

BenchmarkMultimodal

YTU Turkish Gemma 9B v0.1

ytu-ce-cosmos
9.2Bconsumer

Turkish instruction following on 16GB GPUs

Benchmark

Turkish Gemma 9B T1

ytu-ce-cosmos
9B

ColPali v1.3

ColPali team (Illuin Technology)
3B

Visual-document retrieval for multi-page PDFs with charts, tables, and scans where OCR pipelines fail

L1.25 enrichedVerdict

Gemma 2 2B Instruct

Google
2B

Consumer-GPU local chat with strong safety defaults

L1.25 enrichedVerdict

Gemma 3 270M

Google
0.27B

Fine-tuning base for sub-1W on-device classifiers and routers

L1.25 enrichedVerdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.