Data Engineering Pipelines Cheat Sheet

Last Updated: November 21, 2025

ETL/ELT Stages

Stage Focus
Extract Capture from APIs, CDC, or logs
Transform Clean, normalize, join
Load Push into warehouse/lake

Orchestration Commands

airflow dags list
Show DAG inventory
dagster ui
Launch development UI
prefect deployment build
Package a flow for deployment

Monitoring Signals

Track job duration, success rates, row counts, and schema drift alerts.

💡 Pro Tip: Codify schema checks post-load and emit row counts for drift detection.
← Back to Data Science & ML | Browse all categories | View all cheat sheets