On-Call Without Burnout: Rotations, Runbooks, and Escalation

Our best engineer quit citing on-call. We rebuilt the whole thing: saner rotations, runbooks that actually help at 3am, and escalation that doesn't punish asking for help.

DevOps SRE On Call Reliability

Kiril UrbonasDevOps Engineer

|Jul 7, 2026

On-Call Without Burnout: Rotations, Runbooks, and Escalation

Topics

Terraform188 Monitoring179 AWS153 Kubernetes110 LLM102 CI/CD93 Linux83 Python81 GPT73 Security69

Latest Articles

View All →

••last month

pg_stat_statements — Postgres Query Analysis Without Guessing

The single most useful Postgres extension you might not be using. The queries it surfaces, the indexes it implies, and the operational discipline of reading it weekly.

Kiril Urbonas·7 min read·8

Read article

••last month

Linux io_uring — Async I/O Patterns We Use

io_uring replaces epoll for new high-throughput services. The patterns that earn their place, the gotchas in older kernels, and where we'd still pick epoll.

Kiril Urbonas·7 min read·15

Read article

Page 7 of 44 · 517 posts

1...6 7 8...44