RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (31)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Llama 4 70B

Meta · 2026-02-10
70Bdatacenter

production self-hosted serving on 2x A100 / H100

Verdict

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22
70Bdatacenter

datacenter-tier creative / narrative generation

Verdict

Llama 3.3 8B Instruct

Meta · 2025-04-12
8Bconsumer

consumer-tier chat — drop-in 3.1 8B replacement

Verdict

Llama 3.1 Nemotron Nano 8B

NVIDIA · 2025-04-08
8Bconsumer

consumer-tier Nemotron-Llama

Verdict

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15
70Bdatacenter

NVIDIA-fine-tuned Llama 3.1 70B

Verdict

Llama 3.2 90B Vision

Meta · 2024-09-25
90Bdatacenter

datacenter-tier multimodal serving

VerdictMultimodal

Llama 3.2 90B Vision Instruct

Meta · 2024-09-25
90Bdatacenter

datacenter vision-language Llama at 70B-class

VerdictMultimodal

Llama 3.2 11B Vision Instruct

Meta · 2024-09-25
11Bconsumer

consumer-tier vision-language Llama

BenchmarkVerdictMultimodal

Llama 3.1 70B Instruct

Meta · 2024-07-23
70Bdatacenter

production self-hosted serving at the 70B class

Verdict

Llama 3.1 8B Instruct

Meta · 2024-07-23
8Bconsumer

consumer-tier general chat — the default 8B baseline

BenchmarkVerdict

Phind CodeLlama 34B v2

Phind · 2023-09-01
34Bworkstation

historical reference for Llama 2 coder lineage

Verdict

Hermes 4 70B FP8

NousResearch
70B

English/multilingual STEM reasoning and structured data extraction

Verdict

ALIA 40b instruct 2601

BSC-LT
40B

Spanish-region enterprise document processing and multilingual Iberian assistant apps

Verdict

OpenThaiGPT 1.0.0 Beta 13B Chat

OpenThaiGPT
13B

Basic Thai-language instruction following and Q&A

Verdict

Bielik-11B v3.0 Instruct FP8 Dynamic

speakleash
11B

Polish-language instruction following and chat on constrained GPU hardware

Verdict

Bielik 11B v3.0 Instruct GGUF

SpeakLeash
11B

Polish-language instruction following and document Q&A

Verdict

SOLAR 10.7B v1.0

upstage
10.7B

Foundation for custom fine-tuning pipelines

Verdict

Saiga Llama3 8B GGUF

IlyaGusev
8B

Russian conversational assistant on consumer hardware

Verdict

Turkish Llama 8B Instruct v0.1

ytu-ce-cosmos
8B

Cosmos Llama 3 8B Turkish

ytu-ce-cosmos
8B

LLM-jp 4 8B Instruct

llm-jp
8B

Japanese/English bilingual document summarization and extraction

Verdict

LLM-jp 4 8B Thinking

llm-jp
8B

Internal Japanese-English reasoning and document processing pipelines

Verdict

RefinedNeuro RN TR R1

RefinedNeuro
8Bconsumer

compact local reasoning baseline

Benchmark

RefinedNeuro RN TR R2

RefinedNeuro
8Bconsumer

compact local reasoning baseline

Benchmark

Gervásio 8B PTPT

PORTULAN
8B

European Portuguese text tasks where PT-BR is acceptable collateral

Verdict

Swallow 7B

tokyotech-llm
7B

Japanese-language fine-tuning base or research

Verdict

Salamandra 7B Instruct

BSC-LT
7B

Spanish and European multilingual chat prototyping

Verdict

Trendyol LLM 7B Chat v0.1

Trendyol
7B

OpenThaiGPT 7B 1.0.0 Chat

openthaigpt
7B

Thai-language instruction-following and chat

Verdict

Trendyol LLM 7B Base v0.1

Trendyol
7B

Salamandra 7B

BSC-LT
7B

Spanish/Catalan fine-tuning base for custom NLP pipelines

Verdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.