runner
Open source
free
4.4/5

Llamafile

Mozilla's single-binary llama.cpp distribution. Download one file, run on any OS without dependencies.

By Fredoline Eruo·Last verified Jun 12, 2026·22,000 GitHub stars

Overview

Mozilla's single-binary llama.cpp distribution. Download one file, run on any OS without dependencies.

Stack & relationships

How Llamafile relates to other entries in the catalog — recommended pairings, alternatives, dependencies, and edges to avoid. Each edge carries a one-line operator note from our editorial team.

Llamafile ↔ ecosystem

Lifecycle

  • Forked from
    llama.cpp

    Mozilla's single-binary distribution of llama.cpp + the Cosmopolitan libc trick. Same engine, zero-install delivery.

Pros

  • Zero install — single executable
  • Cross-platform
  • No runtime deps

Cons

  • Behind upstream llama.cpp on bleeding edge

Compatibility

Operating systems
macOS
Linux
Windows
GPU backends
NVIDIA CUDA
Apple Metal
CPU
LicenseOpen source · free

Runtime health

Operator-grade signals on how actively Llamafile is being maintained, how fresh its measurements are, and what failure classes operators have flagged. Every label below is anchored to a real date or count — we never infer maintainer activity we can't show.

Release cadence

Derived from the most recent editorial signal on this row.

Active
Updated Jun 12, 2026

8 days since last refresh · source: lastUpdated

Benchmark freshness

How recent the editorial measurements on this runtime are.

0editorial benchmarks

No editorial benchmarks for this runtime yet.

Community reproduction

Submissions that match an editorial measurement on similar hardware.

0reproduced reports

No community reproductions on file yet.

Ecosystem stability

Editorial rating from RunLocalAI — qualitative, not measured.

4.4/5Editorial

Get Llamafile

Frequently asked

Is Llamafile free?

Yes — Llamafile is free to use and open-source.

What operating systems does Llamafile support?

Llamafile supports macOS, Linux, Windows.

Which GPUs work with Llamafile?

Llamafile supports NVIDIA CUDA, Apple Metal, CPU. CPU-only operation is also possible but typically slower.

Reviewed by RunLocalAI Editorial. See our editorial policy for how we evaluate tools.

Related — keep moving

Before you buy

Verify Llamafile runs on your specific hardware before committing money.