PhoGPT 4B

PhoGPT-4B is a 3.7B-parameter model pre-trained from scratch on 102B Vietnamese tokens, making it one of the few Vietnamese-first generative models available. Context window tops out at 8192 tokens. A separate chat-tuned variant (PhoGPT-4B-Chat) handles instruction following and conversation.

License: bsd-3-clause·Context: 8,192 tokens

BLK · VERDICT

Our verdict

OP · Fredoline Eruo|VERIFIED MAY 28, 2026

9.1/10

If Vietnamese is your only target language and your hardware is constrained, PhoGPT-4B is the most purpose-built open option available at this size. The 102B-token Vietnamese pretraining is a genuine differentiator over repurposed multilingual models. That said, 3.7B parameters will hurt on anything requiring nuanced reasoning, and the thin download numbers mean you're partly in uncharted territory. Hedge: worth testing against your actual task before committing; skip if you need multilingual support at all.

›Why this rating

Auto-generated rating (Opus 4.7 judge, claude-opus-4-7). Overall 9.10/10. License is explicitly BSD-3-Clause on the HF card and commercial use is correctly flagged. Params (3.7B), context (8192), vendor (VinAI), and family (mpt) all match the card and HF tags. Description is concrete, honest, and operator-voiced — explicitly notes the 102B-token Vietnamese pretraining, calls out low download counts as a real risk, and warns about reasoning ceilings at 3.7B. Use case is sharply scoped to Vietnamese text generation. Minor deployability gap: no mention of GGUF availability or VRAM expectations, and the custom_code MPT architecture can be a real friction point that isn't flagged. Overall a solid, niche-but-defensible row that clears the bar.

Flags: - No mention of custom_code/trust_remote_code requirement for MPT architecture — relevant deployability detail - No GGUF/quantization availability noted

Overview

Strengths

Pre-trained from scratch on 102B Vietnamese tokens — not adapted from a multilingual base
8192-token context window is generous for a 3.7B model
BSD-3-Clause license allows commercial use without royalty concerns
Outperforms prior open-source Vietnamese-language models per VinAI's own evals

Weaknesses

3.7B parameters puts a real ceiling on reasoning and factual recall
Vietnamese-only — not a viable option if you need any cross-lingual capability
Low community traction (929 downloads, 21 likes) means limited third-party testing or bug reports
Training data sourcing and composition are not publicly detailed beyond language and token count