internlm

8B parameters

Restricted

Reviewed May 2026

InternLM 3 8B

Shanghai AI Lab's open-research line. InternLM 3 at 8B; strong on Chinese-language tasks.

License: InternLM License·Released Oct 5, 2025·Context: 32,768 tokens

Overview

Shanghai AI Lab's open-research line. InternLM 3 at 8B; strong on Chinese-language tasks.

Strengths

Chinese-language strength
Active research lineage

Weaknesses

Commercial use restricted

Quantization variants

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

Quantization	File size	VRAM required
Q4_K_M	4.7 GB	6 GB

Get the model

HuggingFace

Original weights

huggingface.co/internlm/internlm3-8b-instruct

Source repository — direct quantization required.

Hardware that runs this

Cards with enough VRAM for at least one quantization of InternLM 3 8B.

NVIDIA GB200 NVL72

13824GB · nvidia

AMD Instinct MI355X

AMD Instinct MI325X

AMD Instinct MI300X

192GB · nvidia

NVIDIA H100 NVL

188GB · nvidia

141GB · nvidia

Frequently asked

What's the minimum VRAM to run InternLM 3 8B?

6GB of VRAM is enough to run InternLM 3 8B at the Q4_K_M quantization (file size 4.7 GB). Higher-quality quantizations need more.

Can I use InternLM 3 8B commercially?

InternLM 3 8B is released under the InternLM License, which has restrictions for commercial use. Review the license terms before using it in a product.

What's the context length of InternLM 3 8B?

InternLM 3 8B supports a context window of 32,768 tokens (about 33K).

Source: huggingface.co/internlm/internlm3-8b-instruct

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.

Related — keep moving

Compare hardware

4060 Ti 16 GB vs 4070 Ti Super →
Arc B580 vs 4060 Ti 16 GB →

Buyer guides

When it doesn't work

Recommended hardware

Before you buy

Verify InternLM 3 8B runs on your specific hardware before committing money.

Will it run on my hardware? →Custom hardware comparison →GPU recommender (4 questions) →