Skip to main content

Featured Article

Feature Flags for Safe Deploys: Decoupling Release From Deploy

We used to ship code and turn it on in the same breath, so every deploy was a bet. Feature flags split those two events apart and made rollbacks a config toggle.

DevOps Feature Flags Deployment CI/CD

KU

Kiril UrbonasDevOps Engineer

|Jul 7, 2026

Feature Flags for Safe Deploys: Decoupling Release From Deploy

Topics

Terraform188 Monitoring179 AWS153 Kubernetes110 LLM102 CI/CD93 Linux83 Python81 GPT73 Security69

Latest Articles

Field Notes: Prompt Versioning and Regression Testing

••December 6, 2024

Field Notes: Prompt Versioning and Regression Testing

We changed a system prompt for what we thought was a tone improvement and broke a customer-critical extraction overnight. The version control and regression tests we built next.

Kiril Urbonas·8 min read·6

Production Playbook: Cloud Disaster Recovery Runbook Design

••August 10, 2024

Production Playbook: Cloud Disaster Recovery Runbook Design

A DR runbook nobody reads is worse than no runbook. The shape that finally got ours executed correctly under pressure.

Kiril Urbonas·7 min read·5

Page 42 of 44 · 517 posts

1...41 42 43 44

DevOpsNess

Practical AI, DevOps, Cloud, and Linux guidance for engineering teams

Weekly deep dives, implementation patterns, and reliability-focused playbooks.

Join Newsletter Browse Posts

A practical blog covering AI, cloud, DevOps, and modern technology for engineering teams.

Explore

Latest Articles
Archive
Reading List

Resources

About
FAQ
RSS Feed
Newsletter

Deep Dive: SLO-Based Monitoring for APIs

••July 11, 2024

Deep Dive: SLO-Based Monitoring for APIs

We replaced 47 percentile threshold alerts with 3 SLO burn-rate alerts. The on-call rotation gets paged less and catches more.

Kiril Urbonas·7 min read·5

Deep Dive: Secure Container Supply Chain Controls

••July 7, 2024

Deep Dive: Secure Container Supply Chain Controls

We mapped every byte that ends up in our production containers. The map showed three places trust was implicit. Each became a control.

Kiril Urbonas·8 min read·3

Deep Dive: Multi-Cluster Traffic Routing Strategies

••June 13, 2024

Deep Dive: Multi-Cluster Traffic Routing Strategies

We expanded from one Kubernetes cluster to four across two regions. The traffic-routing layer was the hardest piece. Here's what we tried, what worked, and what we'd do again.

Kiril Urbonas·8 min read·11

Deep Dive: Model Serving Observability Stack

••June 2, 2024

Deep Dive: Model Serving Observability Stack

We had Datadog for app metrics, Loki for logs, and zero useful insight into what our LLM service was actually doing. Here's the observability stack we built specifically for model serving.

Kiril Urbonas·8 min read·11

Practical Guide: Incident Response for Platform Teams

••March 20, 2024

Practical Guide: Incident Response for Platform Teams

Platform teams own the systems that EVERY service depends on. Our incident response playbook for when the foundation cracks.

Kiril Urbonas·9 min read·2

Practical Guide: Infrastructure Drift Detection Workflow

••March 11, 2024

Practical Guide: Infrastructure Drift Detection Workflow

We had three months of slow drift between our Terraform code and AWS reality. Here's the daily-cron + Slack workflow that closed the gap.

Kiril Urbonas·8 min read·4

Fine-tuning Large Language Models: A Practical Guide

••February 12, 2024

Fine-tuning Large Language Models: A Practical Guide

Learn how to fine-tune LLMs like Llama 2, Mistral, and GPT models for your specific use case. Includes LoRA, QLoRA, and full fine-tuning techniques.

Kiril Urbonas·4 min read·13

Infrastructure as Code: Terraform vs Pulumi vs Ansible

••February 10, 2024

Infrastructure as Code: Terraform vs Pulumi vs Ansible

Compare Terraform, Pulumi, and Ansible for Infrastructure as Code. Learn when to use each tool and how they complement each other in modern DevOps workflows.

Kiril Urbonas·4 min read·6

Linux System Monitoring with Prometheus and Grafana

••February 7, 2024

Linux System Monitoring with Prometheus and Grafana

Set up comprehensive Linux system monitoring using Prometheus and Grafana. Monitor CPU, memory, disk, network, and application metrics with beautiful dashboards.

Kiril Urbonas·4 min read·9

AWS Cost Optimization: 10 Strategies to Reduce Your Cloud Bill

••February 5, 2024

AWS Cost Optimization: 10 Strategies to Reduce Your Cloud Bill

Discover proven strategies to reduce AWS costs by up to 50%. Learn about Reserved Instances, Spot Instances, right-sizing, and automated cost management.

Kiril Urbonas·4 min read·5

Legal

Privacy
Terms

© 2026 DevOpsNess. By Kiril Urbonas.

RSS Privacy Terms

Reading List About