Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

••February 28, 2025

AI Agents in DevOps: From Copilots to Autonomous Automation in 2025

How AI agents are moving from read-only copilots to autonomous automation with guardrails. Best practices for approval gates and rollback.

Kiril Urbonas·28

Read article

••December 10, 2024

Field Notes: RAG Retrieval Quality Evaluation

I spent 3 weeks chasing an answer-quality regression that turned out to be a tokenizer mismatch in a library upgrade. Here's what I learned about evaluating RAG.

Kiril Urbonas·2

Read article

••December 6, 2024

Field Notes: Prompt Versioning and Regression Testing

We changed a system prompt for what we thought was a tone improvement and broke a customer-critical extraction overnight. The version control and regression tests we built next.

Kiril Urbonas·7

Read article

••February 12, 2024

Fine-tuning Large Language Models: A Practical Guide

Learn how to fine-tune LLMs like Llama 2, Mistral, and GPT models for your specific use case. Includes LoRA, QLoRA, and full fine-tuning techniques.

Kiril Urbonas·14

Read article

••January 15, 2024

Orchestrating AI Agents on Kubernetes

We run a fleet of LLM agents on Kubernetes. They're stateful, bursty, and expensive — none of which K8s defaults are good at. Here's what we changed.

Kiril Urbonas·10

Read article

••January 1, 2024

Fine-tuning Llama 3 on Consumer Hardware

I fine-tuned Llama 3 8B on a single 4090 over a weekend for a side project. Here's what worked, what cost more than expected, and what I'd do differently.

Kiril Urbonas·5

Read article

Page 9 of 9 · 102 posts