_d
devops/ness
Blog
Reading ListAbout
Subscribe

Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Tag: #llmClear filters
Prompt Engineering Patterns That Actually Work in Production
••last week

Prompt Engineering Patterns That Actually Work in Production

Battle-tested prompt patterns from running LLM features in production: structured output, chain-of-thought, and graceful failure handling.

KU
Kiril urbonas
Read article
Model Fallback Policies for Customer-Facing AI: The Routing Rules That Kept SLA Intact
••last week

Model Fallback Policies for Customer-Facing AI: The Routing Rules That Kept SLA Intact

A real-world model fallback guide for customer-facing AI systems, covering how one team preserved response quality and support SLAs during a partial provider degradation.

KU
Kiril urbonas
Read article
Embedding Model Upgrades Without Search Chaos: A Safer RAG Rollout Pattern
••2 weeks ago

Embedding Model Upgrades Without Search Chaos: A Safer RAG Rollout Pattern

A practical embedding model upgrade guide for RAG systems, built from a real support-search migration that initially reduced answer quality instead of improving it.

KU
Kiril urbonas
Read article
Prompt Versioning and Regression Testing: How Teams Avoid Silent AI Regressions
••2 weeks ago

Prompt Versioning and Regression Testing: How Teams Avoid Silent AI Regressions

A real-world guide to prompt versioning and regression testing for production AI features, focused on preventing the subtle changes that hurt quality long before anyone notices.

KU
Kiril urbonas
Read article
RAG Retrieval Quality Evaluation: The Checks We Added After Bad Answers Reached Production
••3 weeks ago

RAG Retrieval Quality Evaluation: The Checks We Added After Bad Answers Reached Production

A search-friendly guide to RAG retrieval quality evaluation, based on the moment one production assistant started citing stale documents and the team had to prove what 'good retrieval' meant.

KU
Kiril urbonas
Read article
Real-World RAG Incidents: Lessons from a Production Rollout
••0 months ago

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

KU
Kiril urbonas
Read article
Real-World RAG Incidents: Lessons from a Production Rollout
••last month

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

KU
Kiril urbonas
Read article
AI Best Practices in 2026: Shipping Reliable Systems, Not Demo Magic
••last month

AI Best Practices in 2026: Shipping Reliable Systems, Not Demo Magic

A practical production playbook for AI systems: evaluation gates, guardrails, observability, cost control, and reliable release management.

KU
Kiril Urbonas
Read article
AI Best Practices for Engineering Teams: From Prompt Experiments to Platform Discipline
••last month

AI Best Practices for Engineering Teams: From Prompt Experiments to Platform Discipline

A practical field manual for engineering teams who want AI features that survive real users, incidents, and budgets — not just demo day.

KU
Kiril Urbonas
Read article
Operational Checklist: AI Inference Cost Optimization
••last month

Operational Checklist: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas
Read article
Real-World RAG Incidents: Lessons from a Production Rollout
••last month

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

KU
Kiril urbonas
Read article
Real-World RAG Incidents: Lessons from a Production Rollout
••last month

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

KU
Kiril urbonas
Read article
Page 1 of 9 · 102 posts
12...9
Next