_d
devops/ness
Blog
Reading ListAbout
Subscribe

Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Tag: #reliabilityClear filters
Infrastructure Documentation as Code: How One Platform Team Reduced Audit Fire Drills
••last month

Infrastructure Documentation as Code: How One Platform Team Reduced Audit Fire Drills

This infrastructure documentation as code guide shows how a platform team moved runbooks, ownership maps, and architecture decisions into versioned workflows that people actually trusted.

KU
Kiril urbonas
Read article
Linux Patch Management for Production Fleets: A Real-World Maintenance Workflow
••last month

Linux Patch Management for Production Fleets: A Real-World Maintenance Workflow

A production-tested Linux patch management workflow for teams that need security fixes without turning every maintenance window into a gamble.

KU
Kiril urbonas
Read article
GitHub Actions Monorepo CI: How We Cut Build Times Without Breaking Main
••last month

GitHub Actions Monorepo CI: How We Cut Build Times Without Breaking Main

A practical GitHub Actions monorepo CI guide built around a real scaling problem: long queues, noisy failures, and developers waiting 40 minutes for feedback.

KU
Kiril urbonas
Read article
End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday
••last month

End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday

A practical risk-management framework for release timing, Friday deployment policies, progressive delivery, and how elite teams protect reliability and people.

KU
Kiril Urbonas
Read article
SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability
••last month

SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability

A practical way to define SLOs and error budgets, connect them to release decisions, and avoid reliability debates without data.

KU
Kiril Urbonas
Read article
Page 2 of 2 · 17 posts
Previous
12