07. Rate Limiting
Chapter 7 of 18 · 20 min
Local verification checkpoint
Run the smallest example from this chapter in a local workspace and record the package version, runtime, data path, and observed output. If the result depends on model size, vector count, CPU/GPU backend, or available memory, note that constraint beside the exercise so the lesson remains reproducible.
EXERCISE
Implement a rate limiter that allows 10 requests per minute per API key. Verify that the 11th request within a minute receives a 429 response while the first 10 succeed.