Apps directory

What plugs into your local AI runtime. 39 curated apps across 12 categories — chat UIs, coding agents, RAG pipelines, voice, image, browser extensions, editor plugins, mobile + desktop, agent frameworks, productivity, SDK wrappers.

Each entry carries an honest editorial verdict — pros, cons, the runtime it works against, the minimum VRAM, and the privacy posture. Filter to your stack, jump to the detail page, ship.

Filter the directory

URL updates as you change filters — share or bookmark a result. All filters are server-rendered, so the page works without JS.

§ Category

§ Runtime

§ OS / platform

§ Privacy posture

§ Max VRAM I have

§ Sort by

3 of 39 apps matching your filters

Obsidian Copilot

Editor plugin

★ 4.4

Hybrid (offline or cloud)

Obsidian plugin that wires Ollama / OpenAI into your notes. Inline chat, summarize, prompt-templates.

“Best Obsidian plugin for local LLM in your notes. Pair with Smart Connections for RAG.”

ollamaopenai-compat

Free· 4GB+ VRAM

★ 3.0k

Smart Connections (Obsidian)

Editor plugin

★ 4.4

Fully offline

Local semantic search across all your Obsidian notes. Embed-once, query-fast, fully offline.

“Best local semantic search for personal notes. Foundational layer for Obsidian RAG.”

ollamaopenai-compat

Freemium· 4GB+ VRAM

★ 3.0k

Codeium (with local backend)

Editor plugin

★ 4.0

Fully offline

Codeium self-hosted enterprise backend lets the popular IDE plugin run fully on your hardware.

“Best 'enterprise Copilot' replacement when self-hosting is mandatory. Paid tier.”

custom

Paid· 24GB+ VRAM

Missing an app? Suggest one.

We curate this directory editorially — same review queue as the benchmarks feed. Open an issue with the project link and your one-line pitch for why it belongs.

Open a GitHub issue →Or contribute a benchmark →

Editorial review applies — same standards as the rest of the site. We won't list apps that don't actually work against a local runtime, regardless of marketing claims.

Where to go from here

Runtimes (/tools) →

The runtime layer: Ollama, vLLM, llama.cpp, MLX, LM Studio server, ComfyUI. What apps in this directory talk to.

Stack Builder →

Tell us your use case, get a full rig recipe — runtime + models + the apps from this directory that fit your stack.

GPU chooser →

Match an app's minimum-VRAM requirement to real hardware with our price/perf comparison.

Community benchmarks →

Real operator submissions on the model × hardware × app combos that work. The proof behind the editorial picks.