mGPT 1.3B Mongol

A 1.3B-parameter GPT model fine-tuned from ai-forever's mGPT base for Mongolian, with English and Russian also supported. Fine-tuning ran for 50,000 steps on Mongolian-specific data, yielding a validation perplexity of 4.35. Context window is 2048 tokens.

License: mit·Context: 2,048 tokens

BLK · VERDICT

Our verdict

OP · Eruo Fredoline|VERIFIED MAY 29, 2026

9.3/10

If you specifically need Mongolian language support, this is one of the very few options available at any size, which matters more than its parameter count. The perplexity number looks healthy, but community adoption is near zero so production reliability is unproven. For lightweight tasks — autocomplete, classification, simple generation — it's worth a test run given the low VRAM cost. Skip it if your workload is primarily Russian or English, where far better-validated models exist.

›Why this rating

Auto-generated rating (Opus 4.7 judge, claude-opus-4-7). Overall 9.30/10. All factual claims (1.3B params, 2048 context, MIT license, perplexity 4.35, 50k steps) are directly verifiable from the model card. The description is honest and operator-voiced, explicitly flagging the small-model limitations, tight context, and near-zero community vetting. Use case is sharp — Mongolian text generation is a clear niche. Brand fit is moderate (narrow language audience, base GPT-2 architecture without instruction tuning), but for the subset of runlocalai readers who need Mongolian, this is genuinely useful and the verdict honestly steers others away. Clears the 9.0 bar.

Overview

Strengths

Rare Mongolian-language coverage — few models this accessible target it
Validation perplexity of 4.35 suggests solid fit to the Mongolian training distribution
MIT license — fully commercial-friendly
Tiny footprint (1.3B params) runs on CPU or minimal VRAM

Weaknesses

1.3B parameters is small; complex reasoning or long-form generation will degrade quickly
2048-token context is tight by modern standards
50,000 fine-tuning steps is relatively short — edge cases in Mongolian may be shaky
818 downloads and 3 likes signals almost no community vetting or reported real-world results