Open WebUI

Fully offline

The default chat UI for solo Ollama users. Multi-model, built-in RAG, web search, Docker-friendly.

Editorial verdict: “Best default chat UI for solo Ollama users. Pick this first; switch only if you outgrow it.”

Chat UI

Free

MIT

★ 4.8 / 5

GitHub ★ 56,000

↗ Homepage ↗ GitHub ↗ Docs

Compatibility at a glance

Which runtime + OS combos this app works against. Source of truth for "will it run on my setup?"

§ Runtimes supported

ollamaopenai-compat

§ OS / platform

macoslinuxwindowsweb

§ Hardware + model hint

Minimum VRAM

4 GB

Recommended starter model

Llama 3.1 8B Q4_K_M

→ Build the rest of the stack with /stack-builder → Pick a GPU for this app

What it is

Open WebUI started life as 'Ollama WebUI' and now speaks to Ollama, OpenAI-compatible endpoints, and external API providers via a single config. Out of the box: multi-conversation, persistent chats, built-in RAG against uploaded files, web search hooks, image-gen integration via ComfyUI. The most popular path from 'I installed Ollama' to 'I'm using a real chat UI' for 2024-2026.

✓ Strengths

+ChatGPT-style UX with no learning curve
+Multi-user with workspaces (good for small teams)
+Built-in RAG against uploaded files
+Active development, weekly releases

△ Caveats

−Docker is the only blessed install path; bare-metal install is unsupported
−Some advanced features (image gen, voice) require additional services

About the Chat UI category

Web or desktop chat client that connects to your local runtime.

§ Other chat ui apps

Jan

Best one-binary desktop chat. Curated catalog removes 'which model?' decision paralysis.

AnythingLLM

Best fast-RAG app. Workspace model is the right abstraction for doc-corpora chat.

LibreChat

Best if you mix local + cloud models in the same workflow. Strong team features.

Where to go from here

Stack Builder →

Pre-filled with this app's recommended use case + budget tier. Get the full rig + runtime + model picks.

Back to /apps →

The full directory — filter by category, runtime, OS, privacy posture, or VRAM.

Runtimes (/tools) →

What this app talks to: Ollama, vLLM, llama.cpp, MLX, LM Studio. The upstream layer.

Community benchmarks →

Did this app work for you on a specific rig? Submit the benchmark — it powers the model + hardware pages.