Blog

••last week

Cloud IAM Least-Privilege Without Breaking Everything

Least privilege fails when it's a one-time audit that locks things down until something breaks, then gets reverted. The iterative, log-driven approach that tightens permissions safely — and the policies we stopped writing by hand.

••2 weeks ago

Edge Caching with Stale-While-Revalidate — Fast and Fresh at the CDN

The cache-control header most teams under-use. How stale-while-revalidate and stale-if-error turned our CDN from a freshness liability into a latency and resilience win — with the gotchas.

••3 weeks ago

Multi-Region — Active-Active vs Active-Passive, And What We Actually Run

The architectural choice is presented as binary; the practical answer is "depends on the workload." The patterns that earn their place and the failure modes we've hit.

••3 weeks ago

AWS Reserved Instances vs Savings Plans vs Spot — When Each Fits

Three discounting mechanisms, three different commitments. The rules of thumb we use to pick, and the mistakes we made before settling on them.

Kiril Urbonas·2

Caching Patterns — Read-Through, Write-Through, Cache-Aside in Practice

Three caching patterns, three failure modes. The one we use most, the one that bit us, and the rule that decides which pattern fits which workload.

Kiril Urbonas·10

Kubernetes Resource Requests — Right-Sizing Without Guessing

Bad resource requests waste money or trigger OOMs. The methodology we use to right-size requests based on actual usage, and the gotchas the autoscalers don't fix.

Kiril Urbonas·2

Edge Databases for Low-Latency Apps — D1, Turso, Neon Serverless

Edge compute is useless without an edge data layer. Three serverless databases that put data within ms of your edge functions, with the tradeoffs that aren't on the marketing pages.

Kiril Urbonas·40

Cross-Cloud Identity Federation — Patterns That Replaced Our Long-Lived Keys

OIDC federation between AWS, GCP, and CI providers let us delete every long-lived cloud credential we had. The setup, the gotchas, and the trust-relationship discipline.

Kiril Urbonas·1

CDN Cache Invalidation — Strategies That Don't Break in Production

There are two hard problems in computer science." We've worked on the cache-invalidation one for a while. The patterns that hold up at scale and the ones that look clean and aren't.

••2 months ago

AWS Step Functions for Workflow Orchestration

We use Step Functions for batch processing, document ingestion, and a few agentic workflows. The patterns that work, the limits we hit, and where we'd reach for something else.

Kiril Urbonas·5

••2 months ago

Karpenter — Node Provisioning Patterns at Scale

After two years of running Karpenter on production EKS clusters, the NodePool patterns that survived, the ones we replaced, and the tuning that matters.

Kiril Urbonas·5