05. Visual Question Answering
Chapter 5 of 18 · 15 min
Local verification checkpoint
Run the smallest example from this chapter in a local workspace and record the package version, runtime, data path, and observed output. If the result depends on model size, vector count, CPU/GPU backend, or available memory, note that constraint beside the exercise so the lesson remains reproducible.
EXERCISE
Build an interactive VQA script that maintains conversation history and allows multi-turn dialogue on a single image. Test edge cases like yes/no questions, counting, and spatial relationships.