Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

OIDC Federation for GitHub Actions to AWS: Killing Long-Lived Keys

We rotated a leaked AWS access key that a workflow had committed to logs. Switching GitHub Actions to OIDC federation meant no static AWS keys exist to leak in the first place.

Kiril Urbonas·1

Read article

••5 days ago

Guardrails for Production LLMs: Input and Output Filtering That Holds

A user got our support bot to recite its system prompt and then draft a refund it wasn't authorized to give. Two layers of guardrails, one on input, one on output, closed both holes.

Kiril Urbonas·1

Read article

••6 days ago

Zero-Trust Service-to-Service Auth with SPIFFE and SPIRE

Static service tokens leaked into logs and never rotated. SPIFFE identities plus SPIRE-issued SVIDs gave us short-lived certs and killed the shared-secret sprawl.

Kiril Urbonas·1

Read article

••6 days ago

Zero-Downtime Postgres Migrations: Expand-Contract in Practice

A single ALTER TABLE took a lock and stalled every write for 40 seconds during peak traffic. Expand-contract is how we stopped shipping outages.

Kiril Urbonas·1

Read article

••6 days ago

Reranking in RAG: When a Cross-Encoder Earns Its Latency

Our RAG answers kept citing the wrong paragraph even when the right one was retrieved. A cross-encoder reranker fixed relevance but added 180ms. Here's when that trade pays off.

Kiril Urbonas·1

Read article

••6 days ago

Postgres Read Replicas: Routing Reads Without Stale-Data Bugs

Adding a read replica cut primary load 60%, then support tickets rolled in about users not seeing their own edits. Replication lag turned into a correctness bug we had to route around.

Kiril Urbonas·1

Read article

••6 days ago

Linux TCP Tuning for High-Throughput Services

Our proxy topped out at 40k connections while the CPU sat half-idle. The bottleneck was kernel defaults tuned for 2009, not the hardware.

Kiril Urbonas·1

Read article

••6 days ago

Distroless Docker Images: Smaller, Safer Production Containers

Our node image shipped 240 CVEs, most from OS packages we never called. Moving to distroless dropped the count to single digits and cut image size by 70%.

Kiril Urbonas·1

Read article