What plugs into your local AI runtime. 37 curated apps across 12 categories — chat UIs, coding agents, RAG pipelines, voice, image, browser extensions, editor plugins, mobile + desktop, agent frameworks, productivity, SDK wrappers.
Each entry carries an honest editorial verdict — pros, cons, the runtime it works against, the minimum VRAM, and the privacy posture. Filter to your stack, jump to the detail page, ship.
URL updates as you change filters — share or bookmark a result. All filters are server-rendered, so the page works without JS.
3 of 37 apps matching your filters
Air-gappable RAG over your docs. The OG offline-RAG project, now mature and team-friendly.
“Best when air-gap compliance is the requirement. Less polished than AnythingLLM, more configurable.”
Self-hosted AI assistant for your notes, emails, docs. Web + mobile + desktop, all local-first.
“Best 'AI second brain' app. Self-hosted, local-first, works against Obsidian.”
Weaviate's open-source RAG demo turned production. Strong defaults, opinionated stack.
“Best for 'don't make me choose chunking strategy' teams. Opinionated stack works.”
We curate this directory editorially — same review queue as the benchmarks feed. Open an issue with the project link and your one-line pitch for why it belongs.
Editorial review applies — same standards as the rest of the site. We won't list apps that don't actually work against a local runtime, regardless of marketing claims.
The runtime layer: Ollama, vLLM, llama.cpp, MLX, LM Studio server, ComfyUI. What apps in this directory talk to.
Tell us your use case, get a full rig recipe — runtime + models + the apps from this directory that fit your stack.
Match an app's minimum-VRAM requirement to real hardware with our price/perf comparison.
Real operator submissions on the model × hardware × app combos that work. The proof behind the editorial picks.