dolphin

3B parameters

Commercial OK

Reviewed May 2026

Dolphin 3.0 Llama 3.2 3B

Eric Hartford's Dolphin fine-tune at 3B. Less-censored than the base Llama; popular for unconstrained-generation use cases.

License: Llama Community License·Released Dec 15, 2024·Context: 131,072 tokens

Overview

Eric Hartford's Dolphin fine-tune at 3B. Less-censored than the base Llama; popular for unconstrained-generation use cases.

Family & lineage

How this model relates to others in its lineage. Family members share architecture and training-data roots; parent / children edges record direct distillation or fine-tune relationships.

Parent / base model

Llama 3.2 3B Instruct3B

Edge

Family siblings (dolphin-3)

Dolphin 3.0 Llama 3.2 3B3B

You are here

Dolphin 3.0 Mistral 24B24B

Consumer

Dolphin 3 Llama 3.3 70B70B

Datacenter

Strengths

Less censored than base
Strong creative-writing benchmarks

Weaknesses

Smaller community than Llama base

Quantization variants

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

Quantization	File size	VRAM required
Q4_K_M	1.8 GB	3 GB

Get the model

HuggingFace

Original weights

huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-3B

Source repository — direct quantization required.

Hardware that runs this

Cards with enough VRAM for at least one quantization of Dolphin 3.0 Llama 3.2 3B.

Compare alternatives

Models worth comparing

Same parameter band, plus what's one tier above and below — so you can decide what actually fits your hardware.

Same tier

Models in the same parameter band as this one

Step up

More capable — bigger memory footprint

Step down

Smaller — faster, runs on weaker hardware

Frequently asked

What's the minimum VRAM to run Dolphin 3.0 Llama 3.2 3B?

3GB of VRAM is enough to run Dolphin 3.0 Llama 3.2 3B at the Q4_K_M quantization (file size 1.8 GB). Higher-quality quantizations need more.

Can I use Dolphin 3.0 Llama 3.2 3B commercially?

Yes — Dolphin 3.0 Llama 3.2 3B ships under the Llama Community License, which permits commercial use. Always read the license text before deployment.

What's the context length of Dolphin 3.0 Llama 3.2 3B?

Dolphin 3.0 Llama 3.2 3B supports a context window of 131,072 tokens (about 131K).

Source: huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-3B

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.

Related — keep moving

Compare hardware

Buyer guides

When it doesn't work

Recommended hardware

Alternatives

Dolphin 3.0 Mistral 24B Dolphin 3 Llama 3.3 70B

Before you buy

Verify Dolphin 3.0 Llama 3.2 3B runs on your specific hardware before committing money.

Will it run on my hardware? →Custom hardware comparison →GPU recommender (4 questions) →