DataOps Orchestration Cheat Sheet

ETL pipelines, testing, and observability

Last Updated: November 21, 2025

Pipeline Stages

Stage Focus
Extract Collect from sources and CDC
Transform Clean, unify, enrich
Load Push to warehouse/lake
Test Assertions + data quality

Tool Commands

airflow dags list
List DAG inventory
dagster schedule up
Activate schedules
prefect deployment build
Package flows

Observability

Track job duration, row counts, and schema drift alerts.

💡 Pro Tip: Automate schema checks and surface lineage before prod updates.
← Back to Data Science & ML | Browse all categories | View all cheat sheets