20. A/B Testing Prompts

Chapter 20 of 25 · 15 min

A/B testing compares prompt variants under production conditions. Unlike ad-hoc comparison, production A/B testing handles statistical significance, traffic distribution, and guardrails against regression.

EXERCISE

Design an A/B test comparing two prompt variants for a customer support task. Calculate required sample size for detecting 5% improvement in resolution rate. Simulate running the test with Python and analyze when significance is reached.