Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Category: infrastructureClear filters

Monitoring That Actually Helps On-Call: Alerts, Dashboards, and Runbooks

We were drowning in 200 alerts a week. Most got ignored. After a quarter of triage and rework, we're at about 15 — and on-call actually responds to them.

Kiril Urbonas·11

Read article

••3 months ago

Terraform Modules Done Right: Lessons from Managing 50+ Services

Practical patterns for Terraform modules at scale: versioning, composition, testing, and avoiding the monolith trap.

Kiril Urbonas·10

Read article

••3 months ago

Terraform Module Version Pinning: How One Platform Team Stopped Surprise Breakage

A real-world Terraform module version pinning guide for platform teams that want safer upgrades, clearer ownership, and fewer broken pipelines after shared module releases.

Kiril Urbonas·6

Read article

••3 months ago

Terraform State Isolation by Environment: How We Stopped One Change from Hitting Prod

A practical Terraform state isolation guide built from a real environment-mixing incident, with patterns for safer backends, clearer ownership, and lower blast radius.

Kiril Urbonas·11

Read article

••3 months ago

Infrastructure Documentation as Code: How One Platform Team Reduced Audit Fire Drills

This infrastructure documentation as code guide shows how a platform team moved runbooks, ownership maps, and architecture decisions into versioned workflows that people actually trusted.

Kiril Urbonas·8

Read article