Apps directory

What plugs into your local AI runtime. 39 curated apps across 12 categories — chat UIs, coding agents, RAG pipelines, voice, image, browser extensions, editor plugins, mobile + desktop, agent frameworks, productivity, SDK wrappers.

Each entry carries an honest editorial verdict — pros, cons, the runtime it works against, the minimum VRAM, and the privacy posture. Filter to your stack, jump to the detail page, ship.

Filter the directory

URL updates as you change filters — share or bookmark a result. All filters are server-rendered, so the page works without JS.

§ Category

§ Runtime

§ OS / platform

§ Privacy posture

§ Max VRAM I have

§ Sort by

4 of 39 apps matching your filters

LiteLLM

SDK / proxy

★ 4.5

Hybrid (offline or cloud)

Drop-in OpenAI-compatible proxy across 100+ providers. Route to local Ollama or cloud, same code.

“Best universal LLM proxy. Foundational layer for multi-provider deployments.”

ollamallama-cppopenai-compatanthropic+2

Free tier

★ 15.0k

Ollama Python SDK

SDK / proxy

★ 4.5

Fully offline

Official Python SDK for Ollama. Async, streaming, typed — the right primitive for scripts.

“Foundational primitive for Python scripts against Ollama. Official, maintained, typed.”

ollama

Free

★ 5.5k

Ollama JS / TS SDK

SDK / proxy

★ 4.4

Fully offline

Official Node + browser SDK for Ollama. ESM-first, typed, streaming.

“Foundational primitive for Node + browser apps against Ollama. ESM-native, typed.”

ollama

Free

★ 4.5k

Claudin.io

SDK / proxy

Cloud required

Cloud LLM router pitching 'unlimited' inference from $10/month — proxies your requests across a pool of upstream models.

“Cloud-only LLM router. Useful category, novel pricing. The 'unlimited' math has a known failure mode at heavy usage.”

Paid

Missing an app? Suggest one.

We curate this directory editorially — same review queue as the benchmarks feed. Open an issue with the project link and your one-line pitch for why it belongs.

Open a GitHub issue →Or contribute a benchmark →

Editorial review applies — same standards as the rest of the site. We won't list apps that don't actually work against a local runtime, regardless of marketing claims.

Where to go from here

Runtimes (/tools) →

The runtime layer: Ollama, vLLM, llama.cpp, MLX, LM Studio server, ComfyUI. What apps in this directory talk to.

Stack Builder →

Tell us your use case, get a full rig recipe — runtime + models + the apps from this directory that fit your stack.

GPU chooser →

Match an app's minimum-VRAM requirement to real hardware with our price/perf comparison.

Community benchmarks →

Real operator submissions on the model × hardware × app combos that work. The proof behind the editorial picks.