16. Caching Layer
Chapter 16 of 18 · 15 min
Local verification checkpoint
Run the smallest example from this chapter in a local workspace and record the package version, runtime, data path, and observed output. If the result depends on model size, vector count, CPU/GPU backend, or available memory, note that constraint beside the exercise so the lesson remains reproducible.
EXERCISE
Implement a cache invalidation mechanism that removes entries for a specific model. Add a DELETE /v1/cache/{model} endpoint that clears all cached completions for the specified model.