RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
← Home·/apps·Productivity

Khoj CLI

Fully offline

Terminal entry into Khoj's local AI assistant. Use grep, get answers, never leave the shell.

Editorial verdict: “Best terminal companion for note-summarization workflows. Pipe-friendly.”

Productivity
Free
AGPL-3.0
★ 4.2 / 5
GitHub ★ 18,000
↗ Homepage↗ GitHub↗ Docs

Compatibility at a glance

Which runtime + OS combos this app works against. Source of truth for "will it run on my setup?"

§ Runtimes supported
ollamallama-cppopenai-compat
§ OS / platform
macoslinuxwindows
§ Hardware + model hint
Minimum VRAM
4 GB
Recommended starter model
Llama 3.1 8B Q4_K_M
→ Build the rest of the stack with /stack-builder→ Pick a GPU for this app

What it is

For solo Ollama users who live in the terminal, Khoj CLI turns your shell into a local document assistant. Pipe any text—grep output, log files, notes—into `khoj 'summarize this'` and get answers without leaving the command line. It bridges to Ollama, llama.cpp, or any OpenAI-compatible runtime, and works offline with models as small as Llama 3.1 8B Q4_K_M on 4 GB VRAM. The trade-off: you need the Khoj server running in the background, which adds a dependency not everyone wants. Best for note-summarization workflows where speed and pipe compatibility matter more than a GUI.

✓ Strengths

  • +Pipe-friendly — slots into shell workflows naturally
  • +Same local-runtime backing as Khoj proper
  • +Open-source

△ Caveats

  • −Needs Khoj server running
  • −Less discoverable than GUI

About the Productivity category

Note-taking, knowledge management, or workflow apps with AI.

§ Other productivity apps
Reor

Best AI-first note app that's actually local. Niche but well-executed.

Where to go from here

Stack Builder →

Pre-filled with this app's recommended use case + budget tier. Get the full rig + runtime + model picks.

Back to /apps →

The full directory — filter by category, runtime, OS, privacy posture, or VRAM.

Runtimes (/tools) →

What this app talks to: Ollama, vLLM, llama.cpp, MLX, LM Studio. The upstream layer.

Community benchmarks →

Did this app work for you on a specific rig? Submit the benchmark — it powers the model + hardware pages.