Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.
A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.
How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.
We cut our AWS bill by 38% in a quarter. The specific changes that moved the bill, ranked by impact, with what we'd do first.
We run mostly on AWS but use GCP for specific workloads. The honest cost-benefit analysis of multi-cloud, plus the patterns that make it not awful.
A different angle on DR: the planning process — RTO/RPO conversations, dependency mapping, and what we learned about prioritizing what to recover.
Embed cost ownership in engineering: tags, budgets, and showback.
We cut our largest playbook's runtime from 14 minutes to 4 minutes. The specific changes that mattered, plus the ones that didn't.
We tried Pulumi for a quarter and went back to Terraform. Both are real options. Why we picked one and what would change our mind.