How a one-afternoon lab polish turned into two working days of autograder integration, repo-split engineering, and an insight about why a failing test is actually the proof it works.