->Will it run?Best GPU Compare Troubleshoot Start Learn Pulse Models Hardware Tools Bench

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo

DIR

Models
Hardware
Tools
Benchmarks

TOOLS

Will it run?
Compare hardware
Cost vs cloud
Choose my GPU
Prompting kits
Quick answers

REF

All buyer guides
Learn local AI
Methodology
Glossary
Errors KB
Trust

EDITOR

About
Author
How we make money
Editorial policy
Contact

LEGAL

Privacy
Terms
Sitemap

MAIL · MONTHLY DIGEST

Get monthly local AI changes

Monthly recap. No spam.

Email address

DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated

RUNLOCALAI · v38

>
Home
Frontier
Models

Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed

Filter

Family

Any Qwen Llama DeepSeek Mistral Gemma Phi GLM OLMo

Deployment

Any Edge Consumer Workstation Datacenter Frontier

Modality

Any Multimodal Text-only

Coverage

Any L1.25 enriched Needs L1.25 Needs benchmark

Filtered results (22)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Qwen 3.6 35B-A3B (MTP)

Alibaba / Qwen team · 2026-05-11

35B/3B-Aworkstation

high-throughput MoE inference at workstation tier

Qwen 3.6 27B (MTP)

Alibaba / Qwen team · 2026-05-11

dense workstation model with throughput-acceleration

OLMo 2 32B

AI2 (Allen AI) · 2026-04-12

fully-open AI2 OLMo 2 — research provenance flagship

Gemma 4 26B MoE

Google · 2026-04-02

Gemma 4 MoE — workstation efficiency variant

VerdictMultimodal

DeepSeek Coder V3

DeepSeek AI · 2026-02-08

workstation coding alternative to Qwen 2.5 Coder

Nemotron 3 Super 49B

NVIDIA · 2026-01-22

32GB-VRAM enterprise deployments

Magistral 32B

Mistral AI · 2025-12-15

research / non-commercial reasoning at 32B scale

Qwen 3 Coder 32B

Alibaba · 2025-11-20

coding-specialized agent workloads

DeepSeek R1 Distill Qwen 3 32B

DeepSeek AI · 2025-11-15

workstation reasoning with Qwen 3 base improvements

EXAONE 3.5 32B

LG AI Research · 2025-11-10

Korean / Japanese / CJK workloads

MedGemma 27B

Google · 2025-05-20

medical-domain fine-tune of Gemma 3 27B

VerdictMultimodal

Qwen 3 30B-A3B

Alibaba · 2025-04-29

workstation MoE — 3B active, 30B total

QwQ 32B Preview

Alibaba · 2024-11-27

workstation-tier reasoning — Qwen team alternative to R1

Aya Expanse 32B

Cohere · 2024-10-22

research / non-commercial multilingual workflows

Qwen 2.5 32B Instruct

Alibaba · 2024-09-19

workstation-tier multilingual general chat

Jamba 1.5 Mini

AI21 Labs · 2024-08-22

52B/12B-Aworkstation

workstation long-context with hybrid SSM throughput

Codestral 22B

Mistral AI · 2024-05-29

workstation coding at 22B class

Aya 23 35B

Cohere For AI · 2024-05-23

multilingual research at workstation tier

Yi 1.5 34B

01.AI · 2024-05-12

workstation-tier multilingual

Command R 35B

Cohere · 2024-03-11

workstation-tier RAG-tuned

Mixtral 8x7B Instruct

Mistral AI · 2023-12-11

workstation MoE — 13B active, 47B total

Phind CodeLlama 34B v2

Phind · 2023-09-01

historical reference for Llama 2 coder lineage

Going deeper

Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
Execution stacks — recipes that combine models with runtimes + hardware.
Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.