10. Prune-Distill-Quantize Pipeline

Chapter 10 of 18 · 20 min
EXERCISE

Implement a three-stage compression pipeline: apply magnitude pruning at 50% sparsity, then distill from the unpruned teacher, then quantize to int8. Measure the cumulative accuracy loss at each stage.