Open-source clone of the ChatGPT UI with multi-provider routing. Local + cloud in one interface.
Editorial verdict: “Best if you mix local + cloud models in the same workflow. Strong team features.”
Which runtime + OS combos this app works against. Source of truth for "will it run on my setup?"
LibreChat is for teams or power users who want a single interface that bridges local Ollama models with cloud APIs like Anthropic and OpenAI. It replicates the ChatGPT UX closely—same prompt library, custom instructions, chat folders—so the learning curve is shallow if you’re coming from OpenAI’s web app. Setup is heavier than Open WebUI: you’ll edit YAML to configure local runtimes, and the 4 GB VRAM floor means a Llama 3.1 8B Q4_K_M is the practical entry point. The payoff is per-conversation model routing, letting you switch between a private local inference and a cloud call without leaving the chat. Works on Linux, macOS, and Windows, but the hybrid privacy posture means you’re trusting both your local hardware and whichever cloud provider you plug in.
Web or desktop chat client that connects to your local runtime.
Best fast-RAG app. Workspace model is the right abstraction for doc-corpora chat.
Best one-binary desktop chat. Curated catalog removes 'which model?' decision paralysis.
Best default chat UI for solo Ollama users. Pick this first; switch only if you outgrow it.
The most visible on-ramp to local AI yet — its hardware-aware Cookbook makes it a genuine beginner pick, but it's young and 'janky' by its own README; treat Agent mode's shell access with caution.
Pre-filled with this app's recommended use case + budget tier. Get the full rig + runtime + model picks.
The full directory — filter by category, runtime, OS, privacy posture, or VRAM.
What this app talks to: Ollama, vLLM, llama.cpp, MLX, LM Studio. The upstream layer.
Did this app work for you on a specific rig? Submit the benchmark — it powers the model + hardware pages.