Last Updated: November 21, 2025
Focus Areas
| Focus |
|---|
Detect node health thrashes via health agents
|
Use taints/tolerations to keep replacements stable
|
Commands & Queries
gcloud container node-pools update pool --enable-autorepair
Enable auto repair
kubectl describe node my-node
Inspect failure
gcloud compute operations list --filter repairs
Review repair history
Summary
Auto-repair makes GKE nodes self-healing so teams avoid manual remediations.
💡 Pro Tip:
Label taints so replacement nodes can still host critical pods.