Typhoon S ThaiLLM 8B Instruct Research Preview

An instruction-tuned 8B Thai language model from typhoon-ai, built on ThaiLLM using supervised fine-tuning and on-policy distillation. Training ran on a single H100 node for two days using an academic budget. Full training data, code, and a technical report are publicly available under Apache 2.0.

License: apache-2.0·Context: 32,768 tokens

Auto-generated rating (Opus 4.7 judge, claude-opus-4-7). Overall 9.20/10. License is explicitly Apache 2.0 on the card and tags, matching the row. Metadata (8B params, 32K context, Qwen3 family, ThaiLLM base) is all verifiable from the card. The description is concrete and operator-voiced, citing the academic budget, SFT+OPD method, and openness honestly. Weaknesses are appropriately blunt about the 'research preview' label, thin safety, and low traction. Use case is specifically scoped to Thai NLP research, which is sharp. Brand fit is slightly weaker since it's explicitly research-preview rather than production-ready, but the verdict handles that hedge well.

Flags: - arxiv:2601.18129 is a future-dated/likely placeholder arXiv ID — worth a sanity check that the technical report actually resolves - License link on card points to Qwen3 LICENSE files, not a typhoon-ai LICENSE — Apache 2.0 claim is consistent but inherited; commercial-OK is defensible but readers should verify base model (ThaiLLM) license chain

Quantization	File size	VRAM required
Q4_K_M	4.4 GB	6 GB

Quantization

File size

VRAM required

Q4_K_M

4.4 GB

6 GB

Frequently asked

What's the minimum VRAM to run Typhoon S ThaiLLM 8B Instruct Research Preview?

6GB of VRAM is enough to run Typhoon S ThaiLLM 8B Instruct Research Preview at the Q4_K_M quantization (file size 4.4 GB). Higher-quality quantizations need more.

Can I use Typhoon S ThaiLLM 8B Instruct Research Preview commercially?

Yes — Typhoon S ThaiLLM 8B Instruct Research Preview ships under the apache-2.0, which permits commercial use. Always read the license text before deployment.

What's the context length of Typhoon S ThaiLLM 8B Instruct Research Preview?

Typhoon S ThaiLLM 8B Instruct Research Preview supports a context window of 32,768 tokens (about 33K).

Typhoon S ThaiLLM 8B Instruct Research Preview

Our verdict

Overview

Strengths

Weaknesses

Quantization variants

Get the model

HuggingFace

Hardware that runs this

Models worth comparing

Frequently asked

What's the minimum VRAM to run Typhoon S ThaiLLM 8B Instruct Research Preview?

Can I use Typhoon S ThaiLLM 8B Instruct Research Preview commercially?

What's the context length of Typhoon S ThaiLLM 8B Instruct Research Preview?

Related — keep moving