RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Home
  3. /Learn
  4. /Courses
  5. /Local AI on Windows
  6. /Ch. 1
Local AI on Windows

01. Introduction to Local AI on Windows

Chapter 1 of 15 · 10 min
KEY INSIGHT

Windows AI setup is a three-layer problem (Windows host, WSL2 layer, and tool runtime) and most failures occur at the boundaries between layers.

Windows is not Linux. That sounds obvious, but it creates specific problems when you try to run local AI tooling that was originally built for Unix environments. Most AI runtimes—Ollama, llama.cpp, text-generation-webui—are primarily distributed as Linux binaries. On Windows you have three paths: WSL2 (Windows Subsystem for Linux 2), native Windows builds, or Docker containers. Each has tradeoffs.

WSL2 gives you a full Linux kernel running as a lightweight VM. GPU passthrough works through NVIDIA's WSL2 driver, so CUDA workloads run at near-native speed. The main inconvenience is that you are constantly switching between Windows and Linux file systems: /home/user/ inside WSL2 is not C:\Users\user\ on the Windows side, and the two sides have different line endings, path separators, and permission models.

Native Windows builds exist for some tools—Ollama ships a Windows installer, LM Studio has a Windows binary. These are easier to manage from File Explorer but harder to integrate with Docker and often miss Linux-specific optimizations. Docker Desktop on Windows runs inside a WSL2 backend by default, which means containers see the Linux environment, not the Windows environment.

The Windows-specific failure modes that bite most often: antivirus scans blocking executable files in %LOCALAPPDATA%, Windows Defender interfering with model file downloads, Hyper-V conflicting with WSL2, and memory overcommit when WSL2 and Docker both try to use 16 GB of RAM. We address all of these in subsequent chapters.

Local verification checkpoint

Run the smallest example from this chapter in a local workspace and record the package version, runtime, data path, and observed output. If the result depends on model size, vector count, CPU/GPU backend, or available memory, note that constraint beside the exercise so the lesson remains reproducible.

EXERCISE

Run wsl --status and wsl --list --verbose in PowerShell as Administrator. Note the default distribution and kernel version. If WSL is not installed, run wsl --install and reboot before continuing.

← Overview
Local AI on Windows
Chapter 2 →
WSL2 Installation and Configuration