GPT-NeoX 20B

GPT-NeoX 20B

GPT-NeoX-20B is a 20B-parameter English autoregressive model from EleutherAI, trained on the 825 GiB Pile dataset. It uses a GPT-3-style transformer architecture and ships under Apache 2.0. There is no instruction tuning or chat fine-tuning — this is a raw base model.

License: apache-2.0·Context: 2,048 tokens

Auto-generated rating (Opus 4.7 judge, claude-opus-4-7). Overall 9.10/10. License (Apache 2.0) is explicitly verified in the HF card with commercial use permitted. Metadata (20B params, 2048 context, EleutherAI vendor, GPT-NeoX architecture) all match the model card precisely. The description and verdict are honest, operator-voiced, and correctly flag this as a base model with no Korean support and outdated context length. The useCases tag listing 'korean' is contradictory with the row's own honest assessment that it has zero Korean capability — this is a real concern but the verdict explicitly warns Korean-hub readers away, which is the right editorial call. Brand fit is moderate: a 2022-era 40GB English base model is niche for local-AI builders, but the row honestly scopes it to fine-tuning research.

Flags: - useCases includes 'korean' which directly contradicts the row's own weakness ('English-only — no Korean language capability') — should be removed - Narrow practical audience: most runlocalai readers won't fine-tune a 20B base model; verdict appropriately hedges this

Quantization	File size	VRAM required
Q4_K_M	11.0 GB	14 GB

Quantization

File size

VRAM required

Q4_K_M

11.0 GB

14 GB

Frequently asked

What's the minimum VRAM to run GPT-NeoX 20B?

14GB of VRAM is enough to run GPT-NeoX 20B at the Q4_K_M quantization (file size 11.0 GB). Higher-quality quantizations need more.

Can I use GPT-NeoX 20B commercially?

Yes — GPT-NeoX 20B ships under the apache-2.0, which permits commercial use. Always read the license text before deployment.

What's the context length of GPT-NeoX 20B?

GPT-NeoX 20B supports a context window of 2,048 tokens (about 2K).

Our verdict

Overview

Strengths

Weaknesses

Quantization variants

Get the model

HuggingFace

Hardware that runs this

Models worth comparing

Frequently asked

What's the minimum VRAM to run GPT-NeoX 20B?

Can I use GPT-NeoX 20B commercially?

What's the context length of GPT-NeoX 20B?

Related — keep moving