->Will it run?Best GPU Compare Troubleshoot Start Learn Pulse Models Hardware Tools Bench

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo

DIR

Models
Hardware
Tools
Benchmarks

TOOLS

Will it run?
Compare hardware
Cost vs cloud
Choose my GPU
Prompting kits
Quick answers

REF

All buyer guides
Learn local AI
Methodology
Glossary
Errors KB
Trust

EDITOR

About
Author
How we make money
Editorial policy
Contact

LEGAL

Privacy
Terms
Sitemap

MAIL · MONTHLY DIGEST

Get monthly local AI changes

Monthly recap. No spam.

Email address

DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated

RUNLOCALAI · v38

>
Home
Frontier
Models

Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (27)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Phi-4 Reasoning Mini 4B

Microsoft · 2026-04-08

edge-tier reasoning

Phi-4 Mini 4B

Microsoft · 2026-02-25

edge / embedded reasoning

SmolLM 3 3B

HuggingFace · 2025-11-04

edge-tier reasoning

Qwen 3 4B

Alibaba · 2025-04-29

edge-tier Qwen 3 — Apple Silicon laptop friendly

BenchmarkVerdict

Gemma 3 1B

Google · 2025-03-12

phone-tier Gemma — smallest practical Gemma 3

BenchmarkVerdict

RWKV 7 'Goose' 1.5B

RWKV community · 2025-02-15

long-context edge inference where memory matters more than quality

DeepSeek R1 Distill Qwen 1.5B

DeepSeek AI · 2025-01-20

edge-tier reasoning

Dolphin 3.0 Llama 3.2 3B

Cognitive Computations · 2024-12-15

creative / less-restricted generation at edge tier

EXAONE 3.5 2.4B

LG AI Research · 2024-12-09

edge-tier Korean chat

Qwen 2.5 Coder 3B

Alibaba · 2024-11-12

Apple Silicon laptop coding autocomplete

Qwen 2.5 Coder 1.5B

Alibaba · 2024-11-12

IDE autocomplete on integrated GPUs

SmolLM 2 1.7B Instruct

Hugging Face · 2024-11-01

edge-tier Apache 2.0 baseline

SmolLM 2 360M Instruct

Hugging Face · 2024-11-01

phone / Pi-class chat

Granite 3.0 2B Instruct

IBM · 2024-10-21

edge-tier IBM Granite

Ministral 3B Instruct

Mistral AI · 2024-10-16

edge-tier long-context — research only

Hermes 3 Llama 3.2 3B

Nous Research · 2024-10-15

edge-tier instruction following

Llama 3.2 3B Instruct

Meta · 2024-09-25

battery-powered laptop chat tier

Llama 3.2 1B Instruct

Meta · 2024-09-25

edge / phone-tier chat — smallest practical Llama

BenchmarkVerdict

Qwen 2.5 3B Instruct

Alibaba · 2024-09-19

edge-tier Qwen 2.5 chat

Qwen 2.5 1.5B Instruct

Alibaba · 2024-09-19

edge-tier Apache 2.0 chat

Qwen 2.5 0.5B Instruct

Alibaba · 2024-09-19

phone-tier Qwen baseline

Nemotron Mini 4B Instruct

NVIDIA · 2024-09-13

edge-tier role-play / chat

MiniCPM 3 4B

OpenBMB · 2024-09-12

phone / embedded inference

Phi-3.5 Mini Instruct

Microsoft · 2024-08-20

edge-tier Phi

BenchmarkVerdict

BGE Reranker v2 M3

BAAI · 2024-04-15

RAG reranker

StarCoder 2 3B

BigCode · 2024-02-28

edge-tier code completion

BGE M3

BAAI · 2024-01-30

multilingual RAG embeddings

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.