runner

Open source

free

4.4/5

Llamafile

Mozilla's single-binary llama.cpp distribution. Download one file, run on any OS without dependencies.

By Fredoline Eruo·Last verified Jun 12, 2026·22,000 GitHub stars

Overview

Mozilla's single-binary llama.cpp distribution. Download one file, run on any OS without dependencies.

Stack & relationships

How Llamafile relates to other entries in the catalog — recommended pairings, alternatives, dependencies, and edges to avoid. Each edge carries a one-line operator note from our editorial team.

Llamafile ↔ ecosystem

Lifecycle

Forked from
llama.cpp
Mozilla's single-binary distribution of llama.cpp + the Cosmopolitan libc trick. Same engine, zero-install delivery.

Pros

Zero install — single executable
Cross-platform
No runtime deps

Cons

Behind upstream llama.cpp on bleeding edge

Compatibility

Operating systems	macOS Linux Windows
GPU backends	NVIDIA CUDA Apple Metal CPU
License	Open source · free

Runtime health

Operator-grade signals on how actively Llamafile is being maintained, how fresh its measurements are, and what failure classes operators have flagged. Every label below is anchored to a real date or count — we never infer maintainer activity we can't show.

Release cadence

Derived from the most recent editorial signal on this row.

Active

Updated Jun 12, 2026

8 days since last refresh · source: lastUpdated

Benchmark freshness

How recent the editorial measurements on this runtime are.

0editorial benchmarks

No editorial benchmarks for this runtime yet.

Community reproduction

Submissions that match an editorial measurement on similar hardware.

0reproduced reports

No community reproductions on file yet.

Ecosystem stability

Editorial rating from RunLocalAI — qualitative, not measured.

4.4/5Editorial

Get Llamafile

GitHub

https://github.com/Mozilla-Ocho/llamafile

Frequently asked

Is Llamafile free?

Yes — Llamafile is free to use and open-source.

What operating systems does Llamafile support?

Llamafile supports macOS, Linux, Windows.

Which GPUs work with Llamafile?

Llamafile supports NVIDIA CUDA, Apple Metal, CPU. CPU-only operation is also possible but typically slower.

Reviewed by RunLocalAI Editorial. See our editorial policy for how we evaluate tools.

Related — keep moving

Compare hardware

Buyer guides

When it doesn't work

Recommended hardware

Alternatives

MLX-LM ExLlamaV2 IPEX-LLM Intel OpenVINO DirectML llama-cpp-python Aphrodite Engine ONNX Runtime Mobile

Before you buy

Verify Llamafile runs on your specific hardware before committing money.

Will it run on my hardware? →Custom hardware comparison →GPU recommender (4 questions) →