_d
devops/ness
Blog
Reading ListAbout
Subscribe
Featured Article

Real-World RAG Incidents: Lessons from a Production Rollout

A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.

AILLMGPTPython
KU
Kiril urbonasDevOps Engineer and AI Enthusiast
|Mar 10, 2026
Real-World RAG Incidents: Lessons from a Production Rollout

Topics

Monitoring280Terraform207AWS166Kubernetes124Python111Security107CI/CD103LLM97Ansible95Linux95

Latest Articles

View All →
Production Playbook: Linux Performance Baseline Methodology
••August 15, 2024

Production Playbook: Linux Performance Baseline Methodology

Linux Performance Baseline Methodology. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: Cloud Disaster Recovery Runbook Design
••August 10, 2024

Production Playbook: Cloud Disaster Recovery Runbook Design

Cloud Disaster Recovery Runbook Design. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Page 38 of 44 · 518 posts
Previous
1...373839...44
Next

DevOpsNess

Practical AI, DevOps, Cloud, and Linux guidance for engineering teams

Weekly deep dives, implementation patterns, and reliability-focused playbooks.

Join NewsletterBrowse Posts
_d
devops/ness

A practical blog covering AI, cloud, DevOps, and modern technology for engineering teams.

Explore

  • Latest Articles
  • Archive
  • Reading List

Resources

  • About
  • RSS Feed
  • Newsletter

Legal

Production Playbook: AWS Cost Control with Tagging and Budgets
••August 6, 2024

Production Playbook: AWS Cost Control with Tagging and Budgets

AWS Cost Control with Tagging and Budgets. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: Ansible Role Design for Large Teams
••August 3, 2024

Production Playbook: Ansible Role Design for Large Teams

Ansible Role Design for Large Teams. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: Terraform State Isolation by Environment
••July 30, 2024

Production Playbook: Terraform State Isolation by Environment

Terraform State Isolation by Environment. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: GitHub Actions Pipeline Reliability
••July 26, 2024

Production Playbook: GitHub Actions Pipeline Reliability

GitHub Actions Pipeline Reliability. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: Docker Image Hardening for Production
••July 22, 2024

Production Playbook: Docker Image Hardening for Production

Docker Image Hardening for Production. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Production Playbook: Kubernetes Cluster Upgrade Strategy
••July 18, 2024

Production Playbook: Kubernetes Cluster Upgrade Strategy

Kubernetes Cluster Upgrade Strategy. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: AI Inference Cost Optimization
••July 15, 2024

Deep Dive: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: SLO-Based Monitoring for APIs
••July 11, 2024

Deep Dive: SLO-Based Monitoring for APIs

SLO-Based Monitoring for APIs. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Secure Container Supply Chain Controls
••July 7, 2024

Deep Dive: Secure Container Supply Chain Controls

Secure Container Supply Chain Controls. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Infrastructure Documentation as Code
••July 2, 2024

Deep Dive: Infrastructure Documentation as Code

Infrastructure Documentation as Code. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
  • Privacy
  • Terms

© 2026 DevOpsNess. By Kiril Urbonas.

RSSPrivacyTerms