PewDiePie's self-hosted AI workspace: chat, agents, deep research, and a hardware-aware model Cookbook in one local-first dashboard.
Editorial verdict: “The most visible on-ramp to local AI yet — its hardware-aware Cookbook makes it a genuine beginner pick, but it's young and 'janky' by its own README; treat Agent mode's shell access with caution.”
Which runtime + OS combos this app works against. Source of truth for "will it run on my setup?"
Odysseus is a free, open-source, self-hosted AI workspace from Felix Kjellberg (PewDiePie), released May 31, 2026 under the MIT license. It is **not a model or a chatbot** — it is a single dashboard for talking to language models you run on your own hardware: chat, autonomous agents, deep research, email/calendar, notes, image generation, and model serving. The standout piece for newcomers is the **Cookbook**: it scans your hardware and recommends compatible models (VRAM-aware, GGUF/FP8/AWQ, fit-scored), then downloads and serves them via llama.cpp or vLLM — directly addressing the "will it run on my GPU?" problem. **Agent mode** (built on opencode) hands the model tools — MCP, web, files, shell, skills, memory — to run whole tasks. **Deep Research** does multi-step gather-read-synthesize runs; **Compare** does blind multi-model testing. Stack: Python 3.11, FastAPI, SQLite, ChromaDB, Docker. Runs locally against Ollama, llama.cpp, and vLLM, and can also connect OpenAI / OpenRouter / GitHub Copilot APIs. Quick start: clone the repo, `cp .env.example .env`, `docker compose up -d --build`; default UI at http://localhost:7000. It exploded on launch — 30,000+ GitHub stars in the first 48 hours — and its real significance is reach: someone with 110M+ subscribers pushing local, self-hosted AI into mainstream culture. **The privacy caveat that matters:** Odysseus is private *only when you use local models*. Connect a cloud API (OpenAI, Anthropic) and your data leaves for those servers exactly like ChatGPT — the local-first promise applies to the local path only. **Security note:** Agent mode with shell + MCP is a full remote-code-execution surface; don't point it at production systems or sensitive repos.
Web or desktop chat client that connects to your local runtime.
Best fast-RAG app. Workspace model is the right abstraction for doc-corpora chat.
Best if you mix local + cloud models in the same workflow. Strong team features.
Best one-binary desktop chat. Curated catalog removes 'which model?' decision paralysis.
Best default chat UI for solo Ollama users. Pick this first; switch only if you outgrow it.
Pre-filled with this app's recommended use case + budget tier. Get the full rig + runtime + model picks.
The full directory — filter by category, runtime, OS, privacy posture, or VRAM.
What this app talks to: Ollama, vLLM, llama.cpp, MLX, LM Studio. The upstream layer.
Did this app work for you on a specific rig? Submit the benchmark — it powers the model + hardware pages.