Phase 1 MVP

Build career momentum with visible, repeatable progress.

Single-user private mode

Exercise

Calculate judge agreement on evaluation labels

Implement this task with explicit validation, predictable output shape, and enough error handling that it could survive reuse in a real AI workflow.

Evaluation · hard

Attempt history

Recent submissions

Use notes to make weaknesses explicit and repeatable.

No attempts yet.