Blog

Multi-Region — Active-Active vs Active-Passive, And What We Actually Run

The architectural choice is presented as binary; the practical answer is "depends on the workload." The patterns that earn their place and the failure modes we've hit.

RAG vs Fine-Tuning — Picking the Right Tool, Honestly

They solve different problems. RAG injects knowledge; fine-tuning changes behavior. The decision criteria, the hybrid pattern, and what we'd do over.

Kiril Urbonas·7

Kubernetes NetworkPolicies in Practice

Default-deny, namespace isolation, egress control — the patterns we use, the gotchas around DNS, and where Cilium changed our calculus.

Database Sharding — The Choices We Wish We'd Made Earlier

Sharding isn't just "split the table" — the shard key choice cascades through queries, joins, rebalancing, and operations. The decisions that pay off and the ones we redid.

Incident Post-Mortems That Drive Change (Not Theater)

Most post-mortems produce a document and no follow-through. The format, the discipline, and the cultural moves that actually convert incidents into engineering improvements.

Kiril Urbonas·4

AWS Reserved Instances vs Savings Plans vs Spot — When Each Fits

Three discounting mechanisms, three different commitments. The rules of thumb we use to pick, and the mistakes we made before settling on them.

Kiril Urbonas·2

Linux Network Debugging — tcpdump, ss, and eBPF in Anger

When the service is slow and the network is suspect, these are the tools we reach for, in this order, with the exact flags that find the answer.

Kiril Urbonas·5

••0 months ago

LLM Cost Optimization in Production — What Actually Moves the Bill

Token caching, model routing, prompt compression, and the boring discipline of measuring. The levers that cut our LLM bill 60% without touching feature scope.

Kiril Urbonas·2

••0 months ago

Postgres Logical Replication for Zero-Downtime Major Upgrades

pg_upgrade is fast but takes downtime; logical replication lets you cut over while the old DB still serves traffic. The runbook, the gotchas, and the post-cutover checklist.

••last month

Kubernetes HPA and VPA — Tuning From Production Pain

Horizontal and vertical autoscalers solve different problems and break in different ways. The thresholds, cooldowns, and conflicts we learned the hard way.

Kiril Urbonas·6

••last month

MLOps — Model Registry vs MLflow Tracking, And When You Need Both

Tracking experiments and shipping models are different problems. The MLOps tooling assumes one solution; production splits them. The patterns we use.

Kiril Urbonas·9