Obsidian plugin that wires Ollama / OpenAI into your notes. Inline chat, summarize, prompt-templates.
Editorial verdict: “Best Obsidian plugin for local LLM in your notes. Pair with Smart Connections for RAG.”
Which runtime + OS combos this app works against. Source of truth for "will it run on my setup?"
Obsidian Copilot is the plugin to pick if you keep your notes in Obsidian and want a local LLM wired directly into your writing flow. It bridges to Ollama or any OpenAI-compatible endpoint, so you can run a model like Llama 3.1 8B Q4_K_M on your own hardware and keep everything offline. Inline chat, selection summarization, and vault-wide search-and-ask all work without leaving your editor. Pair it with Smart Connections for RAG over your full note corpus. The catch: mobile use requires a reachable LAN endpoint, and the best vault-context features lean on Obsidian Sync or a self-hosted server.
Plugin for VS Code, JetBrains, Vim, etc.
Pre-filled with this app's recommended use case + budget tier. Get the full rig + runtime + model picks.
The full directory — filter by category, runtime, OS, privacy posture, or VRAM.
What this app talks to: Ollama, vLLM, llama.cpp, MLX, LM Studio. The upstream layer.
Did this app work for you on a specific rig? Submit the benchmark — it powers the model + hardware pages.