Blog

••2 weeks ago

Terraform Drift Detection in CI — Catching Out-of-Band Changes Before They Bite

State drift is silent until a deploy fails or an outage reveals it. The scheduled plan-and-diff pipeline that surfaces console hotfixes and manual edits while they're still cheap to reconcile.

Kiril Urbonas·5

••last month

Terraform Module Versioning and Shared Registries

Version-pinned modules across many repos. The release process, semver discipline, and the breaking-change communication that keeps a shared registry sane.

Kiril Urbonas·3

Terraform Tutorial — Your First Infrastructure-as-Code Project

Provision real cloud resources with Terraform — a VPC, an S3 bucket, and an EC2 instance — using the standard init/plan/apply workflow.

Kiril Urbonas·9

Pulumi vs Terraform: What 18 Months of Production Taught Us

We ran Pulumi in TypeScript and Terraform in HCL side by side across 60+ services. Each won different categories of work. Here's the breakdown.

Kiril Urbonas·6

GCP Workload Identity Federation: Replacing Service Account Keys

We deleted every static GCP service account key in our org over six weeks. Here's the migration plan, the gotchas, and the policies we now enforce.

Kiril Urbonas·78

EKS Auto Mode: What Worked, What Broke in Our Migration

We moved a 60-node production EKS cluster to Auto Mode. Some pain points evaporated, others got harder. The cost picture is more nuanced than the marketing suggests.

Kiril Urbonas·10

Zero Trust on AWS: Lessons From Implementing IAM Identity Center

We replaced 14 long-lived IAM users with SSO + temporary credentials. The migration plan, the gotchas, and the policies we now enforce.

Kiril Urbonas·7

Database Migrations Without Downtime: Patterns From Three Real Cutovers

How we shipped three schema migrations with zero customer impact. Expand-then-contract, dual-writes, and the rollback plan we never had to use — but tested anyway.

Kiril Urbonas·9

Monitoring That Actually Helps On-Call: Alerts, Dashboards, and Runbooks

We were drowning in 200 alerts a week. Most got ignored. After a quarter of triage and rework, we're at about 15 — and on-call actually responds to them.

Kiril Urbonas·12

Secrets Management in Practice: From .env Files to Vault

We had .env files in three repos, AWS keys in Slack DMs, and a postgres password etched into a Confluence page. Cleaning it up took a sprint and changed how we think about secrets.

Kiril Urbonas·10

Terraform Modules Done Right: Lessons from Managing 50+ Services

Practical patterns for Terraform modules at scale: versioning, composition, testing, and avoiding the monolith trap.

Kiril Urbonas·10