Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.
We cut LLM inference cost 47% over a quarter while improving p95 latency. Six changes, ranked by what each one actually delivered.
A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.
How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.
Blue/green sounds simple until your green cluster has a memory leak and you've already sent 50% of traffic there. The guardrails are what make it safe.