Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

••June 24, 2025

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

Kiril Urbonas·6

Read article

••June 22, 2025

What We Learned Running Weekly Game Days on Our CI/CD Pipeline

Practical game day scenarios for CI/CD: broken rollbacks, permission issues, and slow feedback loops—and how we fixed them.

Kiril Urbonas·7

Read article

••June 21, 2025

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·4

Read article

••June 20, 2025

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

Kiril Urbonas·3

Read article

••June 18, 2025

Systemd Tricks We Use to Keep Services Boring

Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.

Kiril Urbonas·4

Read article

••June 17, 2025

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

Kiril Urbonas·5

Read article

••June 16, 2025

What We Learned Running Weekly Game Days on Our CI/CD Pipeline

Practical game day scenarios for CI/CD: broken rollbacks, permission issues, and slow feedback loops—and how we fixed them.

Kiril Urbonas·2

Read article

••June 14, 2025

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·5

Read article

••June 13, 2025

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

Kiril Urbonas·2

Read article

••June 12, 2025

Best Practices: Kernel and Package Patch Management

We had four different patch cadences across our fleet and routinely missed CVEs by weeks. The unified workflow that finally caught up.

Kiril Urbonas·7

Read article

••June 11, 2025

Docker Security Best Practices: Images, Runtime, and Supply Chain

Harden container images and runtime. Image scanning, minimal base, and supply chain security.

Kiril Urbonas·12

Read article

••June 10, 2025

Systemd Tricks We Use to Keep Services Boring

Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.

Kiril Urbonas·7

Read article

Page 34 of 44 · 518 posts