Guides ยท Business
Data Quality Incident Playbook
Respond to bad data fast
When data breaks, triage severity/blast radius, pause downstream jobs if needed, communicate status, fix root cause, backfill or correct data, and add tests/alerts to prevent repeats.
- data quality
- incidents
- backfill
- triage
- communications
Triage
Assess blast radius, freshness, and impacted consumers.
Communicate
Notify owners/consumers; set update cadence.
Fix and Backfill
Repair cause, backfill/correct data, and add guards/tests.