AI Ops Cheat Sheet

Last Updated: November 21, 2025

Anomaly Signals

Signal Use
Time-series drift Detect baselines shifting
Log entropy Notice unusual verbosity
Topology changes Spot new dependencies failing

Automation Tasks

runbook trigger
Kick off mitigation workflow
auto-remediate
Apply known fix (restart, redeploy)
notify-telegram
Inform SRE channel with context

Knowledge Capture

Record incidents with root causes and remediation steps to train the next model.

💡 Pro Tip: Blend deterministic thresholds with ML models to reduce false positives.
← Back to Data Science & ML | Browse all categories | View all cheat sheets