05. Streaming with SSE
Chapter 5 of 18 · 20 min
Local verification checkpoint
Run the smallest example from this chapter in a local workspace and record the package version, runtime, data path, and observed output. If the result depends on model size, vector count, CPU/GPU backend, or available memory, note that constraint beside the exercise so the lesson remains reproducible.
EXERCISE
Implement a streaming endpoint that yields chunks with a 100ms delay between each word of a static sentence. Use curl or a browser to verify the chunks arrive progressively rather than all at once.