Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

RAG Retrieval Quality Evaluation: The Checks We Added After Bad Answers Reached Production

A search-friendly guide to RAG retrieval quality evaluation, based on the moment one production assistant started citing stale documents and the team had to prove what 'good retrieval' meant.

Kiril Urbonas·7

Read article

••3 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·7

Read article

••4 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·10

Read article

••4 months ago

AI Best Practices in 2026: Shipping Reliable Systems, Not Demo Magic

A practical production playbook for AI systems: evaluation gates, guardrails, observability, cost control, and reliable release management.

Kiril Urbonas·31

Read article

••4 months ago

AI Best Practices for Engineering Teams: From Prompt Experiments to Platform Discipline

A practical field manual for engineering teams who want AI features that survive real users, incidents, and budgets — not just demo day.

Kiril Urbonas·30

Read article

••4 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·3

Read article

••4 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·5

Read article

••5 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·3

Read article

••5 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·3

Read article

••5 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·4

Read article

••6 months ago

Prompt Engineering for DevOps: Consistency and Safety

Use prompts to get reliable, safe outputs from LLMs for runbooks, code, and ops tasks.

Kiril Urbonas·9

Read article

••6 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

Kiril Urbonas·5

Read article

Page 4 of 10 · 111 posts