Blog

Canary Releases: Gradual Rollout Strategy

We've run canary deploys on most services for two years. The mechanics are easy; the metrics that decide "promote or roll back" are where the design is.

Kiril Urbonas·14

Blue-Green Deployments: Zero-Downtime Releases

We use blue-green for stateful services where canary doesn't fit. The actual mechanics, the data-layer subtleties, and when blue-green isn't the right answer.

Kiril Urbonas·7

Log Aggregation Strategies: Centralizing Your Logs

We collect ~800GB of logs per day across our fleet. The shape of our logging stack, what we keep, what we drop, and what we'd build differently.

Kiril Urbonas·8

Infrastructure Monitoring with Prometheus: Complete Setup Guide

A working Prometheus stack for a 40-node cluster: what we deploy, what we tune, and what we wish we'd known about cardinality two years ago.

Kiril Urbonas·12

Docker Multi-Stage Builds: Optimizing Image Size

A focused look at the techniques that shrink container images: which actually pay off, which are folklore, and the discipline that keeps images small over time.

Kiril Urbonas·10

Kubernetes Backup Strategies: Protecting Your Cluster Data

We've had to restore a Kubernetes cluster from backup twice. Once it worked. Once it took 14 hours. Here's the strategy we run now.

Kiril Urbonas·6

Service Mesh Implementation: Istio vs Linkerd

We ran Istio for a year, then switched to Linkerd. Both can do the job. The decision came down to operational fit, not features.

Kiril Urbonas·11

Architecture Review: Python Worker Queue Scaling Patterns

We started with a single Celery worker handling everything. Eight months and three architecture changes later, here's what scaled and what we learned about queue design.

Kiril Urbonas·4

CI/CD Pipeline Optimization: Speeding Up Your Builds

We cut our average CI build time from 28 minutes to 6 minutes. The changes that mattered, ranked by impact.

Kiril Urbonas·8

Container Security Scanning: Protecting Your Docker Images

We scan every container image in CI and at runtime. Trivy + Cosign + admission controllers. The setup that earns its place and what we wish we'd known.

Kiril Urbonas·11

GitOps with ArgoCD: Automating Kubernetes Deployments

We migrated 40+ services to GitOps with Argo CD. Two years in, here's what works and what required workarounds.

Kiril Urbonas·3