->Will it run?Best GPU Compare Troubleshoot Start Learn Pulse Models Hardware Tools Bench

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo

DIR

Models
Hardware
Tools
Benchmarks

TOOLS

Will it run?
Compare hardware
Cost vs cloud
Choose my GPU
Prompting kits
Quick answers

REF

All buyer guides
Learn local AI
Methodology
Glossary
Errors KB
Trust

EDITOR

About
Author
How we make money
Editorial policy
Contact

LEGAL

Privacy
Terms
Sitemap

MAIL · MONTHLY DIGEST

Get monthly local AI changes

Monthly recap. No spam.

Email address

DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated

RUNLOCALAI · v38

>
Home
Frontier
Models

Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (36)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Llama 4 Maverick

Meta · 2026-04-05

frontier-tier multimodal serving on multi-machine clusters

L1.25 enrichedVerdictMultimodal

Gemma 4 31B Dense

Google · 2026-04-02

workstation-tier multilingual chat with permissive license

L1.25 enrichedVerdictMultimodal

Gemma 4 26B MoE

Google · 2026-04-02

Gemma 4 MoE — workstation efficiency variant

VerdictMultimodal

Gemma 4 E4B (Effective 4B)

Google · 2026-04-02

edge-tier Gemma 4 — laptop friendly

BenchmarkVerdictMultimodal

Gemma 4 E2B (Effective 2B)

Google · 2026-04-02

phone-tier Gemma 4

BenchmarkVerdictMultimodal

Phi-4 Multimodal

Microsoft · 2026-02-25

16GB-consumer multimodal Q&A

VerdictMultimodal

MiniCPM-V 3 8B

OpenBMB · 2025-08-14

consumer multimodal document Q&A

VerdictMultimodal

MedGemma 27B

Google · 2025-05-20

medical-domain fine-tune of Gemma 3 27B

VerdictMultimodal

Gemma 3 27B

Google · 2025-03-12

Google's open-weight workstation-tier multilingual flagship — pre-Gemma-4 baseline

L1.25 enrichedVerdictMultimodal

Gemma 3 12B

Google · 2025-03-12

consumer-tier multilingual chat with vision support in 'it' variant

BenchmarkVerdictMultimodal

Gemma 3 4B

Google · 2025-03-12

edge-tier chat — Apple Silicon laptop friendly

BenchmarkVerdictMultimodal

Qwen 2.5-VL 72B

Alibaba · 2025-03-10

frontier-tier multimodal serving

VerdictMultimodal

Qwen 2.5-VL 7B

Alibaba · 2025-03-10

consumer-tier OCR + image Q&A

VerdictMultimodal

Janus-Pro 7B

DeepSeek AI · 2025-01-29

consumer multimodal with image-generation

VerdictMultimodal

Qwen 2.5-VL 3B

Alibaba · 2025-01-26

edge-tier multimodal

VerdictMultimodal

InternVL 2.5 78B

OpenGVLab · 2024-12-05

datacenter-tier permissive VLM

VerdictMultimodal

InternVL 2.5 26B

OpenGVLab · 2024-12-05

permissively-licensed VLM at 24GB VRAM

VerdictMultimodal

PaliGemma 2 10B

Google · 2024-12-05

VLM fine-tuning at 24GB VRAM

VerdictMultimodal

PaliGemma 2 3B

Google · 2024-12-05

task-specific VLM fine-tuning base

VerdictMultimodal

Whisper Large v3 Turbo

OpenAI · 2024-10-01

real-time / batch transcription

VerdictMultimodal

Llama 3.2 90B Vision Instruct

Meta · 2024-09-25

datacenter vision-language Llama at 70B-class

VerdictMultimodal

Llama 3.2 90B Vision

Meta · 2024-09-25

datacenter-tier multimodal serving

VerdictMultimodal

Molmo 72B

Allen Institute (AI2) · 2024-09-25

datacenter-tier open VLM for agent UI

VerdictMultimodal

Llama 3.2 11B Vision

Meta · 2024-09-25

consumer-tier multimodal — Llama-ecosystem migration path for vision workflows

L1.25 enrichedVerdictMultimodal

Llama 3.2 11B Vision Instruct

Meta · 2024-09-25

consumer-tier vision-language Llama

BenchmarkVerdictMultimodal

Molmo 7B-D

Allen Institute (AI2) · 2024-09-25

open-research VLM with UI grounding

VerdictMultimodal

Pixtral 12B

Mistral AI · 2024-09-17

consumer-tier vision-language Mistral

VerdictMultimodal

MiniCPM-V 2.6 8B

OpenBMB · 2024-08-30

consumer multimodal document Q&A

VerdictMultimodal

Qwen 2-VL 7B

Alibaba · 2024-08-29

consumer-tier multimodal — pre-2.5-VL baseline

VerdictMultimodal

Phi-3.5 Vision

Microsoft · 2024-08-20

edge-tier vision-language Phi

VerdictMultimodal

LLaVA-OneVision 7B

LLaVA Team · 2024-08-06

permissively-licensed multi-image / video VLM

VerdictMultimodal

Moondream 2

vikhyat (community) · 2024-07-22

edge / phone-tier vision Q&A

VerdictMultimodal

GLM-4V 9B

Zhipu AI · 2024-06-04

Chinese document VLM

VerdictMultimodal

LLaVA 1.6 Mistral 7B

LLaVA Team · 2024-01-30

consumer-tier vision-language with permissive license

VerdictMultimodal

Whisper Large v3

OpenAI · 2023-11-06

open speech-to-text baseline

VerdictMultimodal

Trendyol LLM Asure 12B

Turkish business workflow assistants

BenchmarkMultimodal

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.