Data Labeling QA Playbook Cheat Sheet

Last Updated: November 21, 2025

Focus Areas

Focus
`Document instructions and examples for each label`
`Audit labelers using golden data and consensus scoring`


         ./labeling/run_quality_checks.sh

Run QA scripts


         python scripts/compare_labels.py --batch 12

Score label agreement


         labelstudio export --task-id 1234

Export labeled artifacts

Standardize instructions, watch for drift, and feed QA insights back into models.

💡 Pro Tip: Lock instructions, include golden sets, and rotate validators.