Kimi K2.6

Moonshot's long-context, agent-oriented MoE. Optimized for stability under tool use and multi-step coding/planning workflows.

License: Kimi Open Weights License·Released Mar 10, 2026·Context: 2,000,000 tokens

Overview

Moonshot's long-context, agent-oriented MoE. Optimized for stability under tool use and multi-step coding/planning workflows.

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

Quantization	File size	VRAM required
Q4_K_M	600.0 GB	700 GB

Original weights

Source repository — direct quantization required.

Cards with enough VRAM for at least one quantization of Kimi K2.6.

Compare alternatives

Same parameter band, plus what's one tier above and below — so you can decide what actually fits your hardware.

Same tier

Models in the same parameter band as this one

Step up

More capable — bigger memory footprint

No verdicted models in the next tier up yet.

Step down

Smaller — faster, runs on weaker hardware

700GB of VRAM is enough to run Kimi K2.6 at the Q4_K_M quantization (file size 600.0 GB). Higher-quality quantizations need more.

Yes — Kimi K2.6 ships under the Kimi Open Weights License, which permits commercial use. Always read the license text before deployment.

Kimi K2.6 supports a context window of 2,000,000 tokens (about 2000K).

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.