Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Cloud Cost Monitoring: Tracking and Optimizing AWS Spending

Building visibility into cloud costs that actually drives action. The dashboards we look at, the alerts that fire, and the queries we run.

Kiril Urbonas·5

Read article

••7 months ago

Systemd Tricks We Use to Keep Services Boring

Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.

Kiril Urbonas·5

Read article

••7 months ago

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

Kiril Urbonas·8

Read article

••8 months ago

Multi-Region Deployment: Building Resilient Cloud Applications

We run our app in two AWS regions for failover. The hard parts aren't the deployment — they're data consistency, traffic shifting, and the assumptions that break when "primary" is suddenly the wrong region.

Kiril Urbonas·13

Read article

••8 months ago

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

Kiril Urbonas·4

Read article

••8 months ago

AWS Lambda Optimization: Reducing Costs and Improving Performance

We run ~200 Lambda functions. Cold starts, memory tuning, and the cost-vs-latency trade-offs that actually move the bill.

Kiril Urbonas·7

Read article

••8 months ago

Systemd Tricks We Use to Keep Services Boring

Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.

Kiril Urbonas·3

Read article

••8 months ago

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

Kiril Urbonas·2

Read article

••8 months ago

Multi-Region Resilience: Failover, Data, and DNS

Design for region failure. Active/passive and active/active, data replication, and failover testing.

Kiril Urbonas·11

Read article

••8 months ago

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

Kiril Urbonas·4

Read article

••8 months ago

Systemd Tricks We Use to Keep Services Boring

Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.

Kiril Urbonas·5

Read article

••8 months ago

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

Kiril Urbonas·4

Read article

Page 7 of 16 · 188 posts