SRE On-call Cheat Sheet

Rotations, alerts, and documentation

Last Updated: November 21, 2025

Rotation Basics

Topic Advice
Rotation length 1-2 weeks
Handoff 20-min overlap
Escalations Defined responders

Alert Hygiene

Set severity threshold
Avoid noisy alerts
PagerDuty rules
Handle retries
Auto-resolve
Use CI to close alerts

Playbooks

Link runbooks inside alerts, document rollback steps, and store knowledge in searchable docs.

💡 Pro Tip: Keep on-call rotation short and document runbooks next to services.
← Back to DevOps & Cloud | Browse all categories | View all cheat sheets