_d
devops/ness
Blog
Reading ListAbout
Featured Article

Operational Checklist: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

KU
Kiril UrbonasAI & ML Engineer
|Feb 20, 2026
Operational Checklist: AI Inference Cost Optimization

Topics

Monitoring183Security102AWS71Kubernetes69Terraform62Python60Linux50CI/CD49Ansible47LLM45

Latest Articles

View All →
Deep Dive: Incident Response for Platform Teams
••June 25, 2024

Deep Dive: Incident Response for Platform Teams

Incident Response for Platform Teams. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Blue-Green Deployment Guardrails
••June 21, 2024

Deep Dive: Blue-Green Deployment Guardrails

Blue-Green Deployment Guardrails. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·3 min read
Read article
Page 18 of 23
Previous
1...171819...23
Next

Content

  • Latest
  • Subscribe

Resources

  • About
  • Reading List
  • RSS Feed

Legal

  • Privacy
  • Terms
/
© 2024 DevOpsNess.
Deep Dive: Infrastructure Drift Detection Workflow
••June 17, 2024

Deep Dive: Infrastructure Drift Detection Workflow

Infrastructure Drift Detection Workflow. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Multi-Cluster Traffic Routing Strategies
••June 13, 2024

Deep Dive: Multi-Cluster Traffic Routing Strategies

Multi-Cluster Traffic Routing Strategies. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Kubernetes Secrets and External Vault Integration
••June 9, 2024

Deep Dive: Kubernetes Secrets and External Vault Integration

Kubernetes Secrets and External Vault Integration. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Python Worker Queue Scaling Patterns
••June 6, 2024

Deep Dive: Python Worker Queue Scaling Patterns

Python Worker Queue Scaling Patterns. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Model Serving Observability Stack
••June 2, 2024

Deep Dive: Model Serving Observability Stack

Model Serving Observability Stack. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: RAG Retrieval Quality Evaluation
••May 28, 2024

Deep Dive: RAG Retrieval Quality Evaluation

RAG Retrieval Quality Evaluation. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Prompt Versioning and Regression Testing
••May 24, 2024

Deep Dive: Prompt Versioning and Regression Testing

Prompt Versioning and Regression Testing. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: LLM Gateway Design for Multi-Provider Inference
••May 20, 2024

Deep Dive: LLM Gateway Design for Multi-Provider Inference

LLM Gateway Design for Multi-Provider Inference. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Kernel and Package Patch Management
••May 17, 2024

Deep Dive: Kernel and Package Patch Management

Kernel and Package Patch Management. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article
Deep Dive: Systemd Service Reliability Patterns
••May 13, 2024

Deep Dive: Systemd Service Reliability Patterns

Systemd Service Reliability Patterns. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas·4 min read
Read article