Phase 1 MVP
Lesson
Create a small but representative dataset you can trust for regressions.
## Benchmarks
Curated edge cases beat large but noisy evaluation sets.