Node upgrades, autoscaler scale-downs, and spot reclaims all drain nodes. Without PDBs they can take all your replicas at once. The budgets, probes, and graceful-shutdown handling that keep voluntary disruptions invisible to users.

On this page

Kubernetes Pod Disruption Budgets — Surviving Node Drains Without an Outage

The first time a cluster upgrade took down a "highly available" service, we learned what PodDisruptionBudgets are for. Three replicas, spread across three nodes, looked redundant — until the node-pool upgrade drained all three nodes in quick succession and every replica went down at once. The deployment said 3/3; reality said 0 serving. Pod Disruption Budgets are how you tell Kubernetes "you may disrupt my pods, but not all of them at once."

Voluntary vs involuntary disruption #

PDBs protect against voluntary disruptions — the ones Kubernetes initiates and can be asked to slow down:

kubectl drain during node upgrades
Cluster Autoscaler / Karpenter scaling a node down
Spot/preemptible node reclamation handled gracefully

They do not protect against involuntary disruptions — a kernel panic, a hardware failure, an OOM kill. Those just happen. PDBs are a contract with the eviction API, and only voluntary disruptions go through it.

The budget #

yaml.yaml

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: web-pdb
spec:
  minAvailable: 2          # never let fewer than 2 pods be available
  selector:
    matchLabels:
      app: web

When something tries to drain a node, the eviction API checks the PDB. If evicting a web pod would drop available replicas below 2, the eviction is refused and the drain blocks until a replacement pod is Ready elsewhere. The drain proceeds one pod at a time, waiting for recovery between each — exactly the rolling behavior you wanted.

minAvailable vs maxUnavailable #

Two ways to express the same budget; pick by what stays stable as you scale:

yaml.yaml

minAvailable: 2        # absolute floor — but means 50% at 4 replicas, 20% at 10
# vs
maxUnavailable: 1      # at most 1 down at a time, regardless of replica count

Use maxUnavailable: 1 for most stateless services — it scales naturally and clearly says "drain one at a time."
Use minAvailable as a percentage (minAvailable: 80%) when you need a capacity floor to handle load, not just availability.

Never set minAvailable equal to the replica count. minAvailable: 3 on a 3-replica deployment means no pod can ever be voluntarily evicted — the drain blocks forever and your node upgrade hangs. We did this once and wondered why a cluster upgrade stalled for an hour.

PDBs are necessary, not sufficient #

A PDB controls how many pods go down at once. It does nothing about whether each individual pod shutdown is graceful. You also need:

Readiness probes that mean it. The PDB counts a pod as "available" when it's Ready. If your readiness probe goes green before the app can actually serve, the PDB lets the next eviction proceed into a pod that isn't really ready. The budget is only as honest as the probe.

Graceful shutdown. On eviction, Kubernetes sends SIGTERM, waits terminationGracePeriodSeconds, then SIGKILL. The pod must use that window: stop accepting new connections, drain in-flight requests, then exit.

yaml.yaml

lifecycle:
  preStop:
    exec:
      command: ["sh", "-c", "sleep 5"]   # let endpoints propagate removal
terminationGracePeriodSeconds: 30

The preStop sleep matters more than it looks: pod termination and Service endpoint removal are concurrent, not ordered. Without the brief sleep, the pod can receive SIGTERM and start shutting down while the load balancer still routes new requests to it — connections refused, errors to users. The sleep holds the pod alive long enough for endpoint removal to propagate.

The gotcha: PDBs can block legitimate operations #

A PDB that's too strict blocks the very operations it's meant to make safe. If a deployment is already degraded (one pod crashlooping) and the PDB requires minAvailable: 3 of 3, a node drain can't make progress — you're stuck. Leave headroom: run enough replicas that the PDB permits at least one eviction even during a partial outage. Our rule: replicas ≥ minAvailable + 1, always, so there's room to drain even when something's already wrong.

What it bought us #

Cluster and node-pool upgrades became boring — pods relocate one at a time, users see nothing.
Cluster Autoscaler scale-downs stopped causing latency blips during off-peak consolidation.
Spot reclaim (with a 2-minute notice handler that cordons and drains) stopped dropping requests.

PDBs are cheap to add and easy to get subtly wrong. The pattern that works: maxUnavailable: 1 (or a percentage floor), honest readiness probes, a preStop drain delay, and always one more replica than the budget requires. Then voluntary disruptions — which happen constantly in a healthy cluster — stay invisible to the people using your service.

Kubernetes Pod Disruption Budgets — Surviving Node Drains Without an Outage

Kubernetes Pod Disruption Budgets — Surviving Node Drains Without an Outage

Voluntary vs involuntary disruption #

The budget #

minAvailable vs maxUnavailable #

PDBs are necessary, not sufficient #

The gotcha: PDBs can block legitimate operations #

What it bought us #

Stay Updated

Terraform Drift Detection in CI — Catching Out-of-Band Changes Before They Bite

Linux Memory Pressure — Reading PSI Before the OOM Killer Reads You

More from DevOps

Alert on Symptoms, Not Causes — SLO Burn-Rate Alerting in Practice

CI Pipeline Caching That Actually Pays Off

Kubernetes NetworkPolicies in Practice

Alert on Symptoms, Not Causes — SLO Burn-Rate Alerting in Practice

CI Pipeline Caching That Actually Pays Off

Kubernetes NetworkPolicies in Practice

Incident Post-Mortems That Drive Change (Not Theater)

Prompt Caching for Production LLM Apps — Cutting Cost and Latency at the Token Layer

LLM Output Validation — Schema-Constrained Generation in Production

About Kiril Urbonas

You might have missed

Prompt Engineering Best Practices: Maximizing LLM Performance

Process Management and Monitoring in Linux

AI Agents in DevOps: From Copilots to Autonomous Automation in 2025

Kubernetes Pod Disruption Budgets — Surviving Node Drains Without an Outage

Voluntary vs involuntary disruption#

The budget#

minAvailable vs maxUnavailable#

PDBs are necessary, not sufficient#

The gotcha: PDBs can block legitimate operations#

What it bought us#

Stay Updated

Terraform Drift Detection in CI — Catching Out-of-Band Changes Before They Bite

Linux Memory Pressure — Reading PSI Before the OOM Killer Reads You

More from DevOps

Alert on Symptoms, Not Causes — SLO Burn-Rate Alerting in Practice

CI Pipeline Caching That Actually Pays Off

Kubernetes NetworkPolicies in Practice

About Kiril Urbonas

You might have missed

Prompt Engineering Best Practices: Maximizing LLM Performance

Process Management and Monitoring in Linux

AI Agents in DevOps: From Copilots to Autonomous Automation in 2025

Voluntary vs involuntary disruption #

The budget #

minAvailable vs maxUnavailable #

PDBs are necessary, not sufficient #

The gotcha: PDBs can block legitimate operations #

What it bought us #