RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Frontier
  4. /Models
Frontier zone · Model releases

The frontier of open-weight model releases

Open-weight model releases tracked by RunLocalAI — recent additions, rising families, distill chains, multimodal and reasoning waves. Each card links into the catalog with authority badges (L1.25 enriched · benchmark-backed · verdict) so you can scan editorial coverage at a glance.

By Fredoline Eruo · Refreshed continuously from catalog seed
Filter
Family
AnyQwenLlamaDeepSeekMistralGemmaPhiGLMOLMo
Deployment
AnyEdgeConsumerWorkstationDatacenterFrontier
Modality
AnyMultimodalText-only
Coverage
AnyL1.25 enrichedNeeds L1.25Needs benchmark

Filtered results (26)

Models matching your filters. Clear filters by clicking “Any” on each row above, or remove individual filters via the URL.

Llama 4 70B

Meta · 2026-02-10
70Bdatacenter

production self-hosted serving on 2x A100 / H100

Verdict

EVA Llama 3.3 70B

EVA-Unit-01 community · 2025-08-22
70Bdatacenter

datacenter-tier creative / narrative generation

Verdict

Llama 3.3 8B Instruct

Meta · 2025-04-12
8Bconsumer

consumer-tier chat — drop-in 3.1 8B replacement

Verdict

Llama 3.1 Nemotron Nano 8B

NVIDIA · 2025-04-08
8Bconsumer

consumer-tier Nemotron-Llama

Verdict

Llama 3.3 70B Instruct

Meta · 2024-12-06
70Bdatacenter

production self-hosted serving at the 70B class — when you need general-purpose capability above 32B but don't need frontier-tier

L1.25 enrichedVerdict

Llama 3.1 Nemotron 70B Instruct

NVIDIA · 2024-10-15
70Bdatacenter

NVIDIA-fine-tuned Llama 3.1 70B

Verdict

Llama 3.1 70B Instruct

Meta · 2024-07-23
70Bdatacenter

production self-hosted serving at the 70B class

Verdict

Phind CodeLlama 34B v2

Phind · 2023-09-01
34Bworkstation

historical reference for Llama 2 coder lineage

Verdict

Hermes 4 70B FP8

NousResearch
70B

English/multilingual STEM reasoning and structured data extraction

Verdict

ALIA 40b instruct 2601

BSC-LT
40B

Spanish-region enterprise document processing and multilingual Iberian assistant apps

Verdict

OpenThaiGPT 1.0.0 Beta 13B Chat

OpenThaiGPT
13B

Basic Thai-language instruction following and Q&A

Verdict

Bielik 11B v3.0 Instruct GGUF

SpeakLeash
11B

Polish-language instruction following and document Q&A

Verdict

Bielik-11B v3.0 Instruct FP8 Dynamic

speakleash
11B

Polish-language instruction following and chat on constrained GPU hardware

Verdict

SOLAR 10.7B v1.0

upstage
10.7B

Foundation for custom fine-tuning pipelines

Verdict

Cosmos Llama 3 8B Turkish

ytu-ce-cosmos
8B

Saiga Llama3 8B GGUF

IlyaGusev
8B

Russian conversational assistant on consumer hardware

Verdict

LLM-jp 4 8B Instruct

llm-jp
8B

Japanese/English bilingual document summarization and extraction

Verdict

Gervásio 8B PTPT

PORTULAN
8B

European Portuguese text tasks where PT-BR is acceptable collateral

Verdict

LLM-jp 4 8B Thinking

llm-jp
8B

Internal Japanese-English reasoning and document processing pipelines

Verdict

Turkish Llama 8B Instruct v0.1

ytu-ce-cosmos
8B

Trendyol LLM 7B Base v0.1

Trendyol
7B

Trendyol LLM 7B Chat v0.1

Trendyol
7B

Salamandra 7B Instruct

BSC-LT
7B

Spanish and European multilingual chat prototyping

Verdict

Swallow 7B

tokyotech-llm
7B

Japanese-language fine-tuning base or research

Verdict

Salamandra 7B

BSC-LT
7B

Spanish/Catalan fine-tuning base for custom NLP pipelines

Verdict

OpenThaiGPT 7B 1.0.0 Chat

openthaigpt
7B

Thai-language instruction-following and chat

Verdict

Going deeper

  • Ecosystem maps — structured-landscape views (memory frameworks, inference runtimes, MCP, coding agents).
  • Execution stacks — recipes that combine models with runtimes + hardware.
  • Frontier index — broader ecosystem-momentum view across coding agents, inference runtimes, memory systems, MCP.
  • Benchmarks — measured tokens-per-second + topology fields across hardware/model/runtime triples.