Turkish open-weight LLMs

English-trained LLMs handle Turkish poorly because of how BPE tokenization interacts with Turkish’s agglutinative morphology. A single Turkish word (e.g., evlerimizdekilerden) can fragment into 8+ tokens on a Llama 3 tokenizer, vs ~3 on Qwen 3 (which trained for 119 languages) and ~1 on a purpose-built Turkish tokenizer. More tokens means more compute, more VRAM, and degraded attention quality on the same logical sentence.

That’s why a Turkish-from-scratch model (Kanarya), or a continued-pretrained Turkish fine-tune (the YTU CE COSMOS family, Trendyol’s flagship), often outperforms a larger English-base model on Turkish-language tasks despite having a fraction of the parameters. The Turkish-language community has its own ecosystem, and the catalog below covers the active releases.

Each entry links to the model page on /models with the same depth of editorial we give every model. Where we’ve run benchmarks (TurkishMMLU specifically), the score appears next to the model. Where we haven’t yet, it says “not yet tested” — we don’t fake numbers.

Why a separate Turkish catalog?

Other / from-scratch

Mistral-based

Gemma-based

Llama-based

Qwen-based

We’re running TurkishMMLU on every model above

Turkish AI community moves fast. If we missed a release, tell us.