Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

AI Observability and Monitoring: Tracking Model Performance in Production

Learn how to monitor AI models in production. Track performance, detect drift, and ensure model reliability with comprehensive observability strategies.

Kiril Urbonas

Read article

••5 months ago

Multi-Agent AI Systems: Building Collaborative AI Applications

Learn how to build multi-agent AI systems where multiple AI agents collaborate to solve complex tasks. Architecture patterns and implementation guide.

Kiril Urbonas

Read article

••5 months ago

Prompt Engineering Best Practices: Maximizing LLM Performance

Master prompt engineering techniques to get better results from LLMs. Learn about few-shot learning, chain-of-thought, and advanced prompting strategies.

Kiril Urbonas

Read article

••6 months ago

Model Quantization Techniques: Reducing LLM Size and Cost

Learn how to reduce LLM model size and inference costs using quantization techniques like Q4, Q8, and GPTQ. Practical guide with benchmarks.

Kiril Urbonas

Read article

••6 months ago

Vector Databases for AI: Comparing Pinecone, Weaviate, and ChromaDB

Compare the top vector databases for AI applications. Learn when to use Pinecone, Weaviate, or ChromaDB based on your requirements.

Kiril Urbonas

Read article

••6 months ago

Building RAG Applications: A Complete Guide to Retrieval Augmented Generation

Learn how to build production-ready RAG applications using vector databases, embedding models, and LLMs. Complete guide with code examples and best practices.

Kiril Urbonas

Read article

••6 months ago

Best Practices: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article

••8 months ago

Best Practices: RAG Retrieval Quality Evaluation

RAG Retrieval Quality Evaluation. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article

••8 months ago

Best Practices: Prompt Versioning and Regression Testing

Prompt Versioning and Regression Testing. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article

••8 months ago

Best Practices: LLM Gateway Design for Multi-Provider Inference

LLM Gateway Design for Multi-Provider Inference. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article

••9 months ago

Troubleshooting: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article

••11 months ago

Troubleshooting: RAG Retrieval Quality Evaluation

RAG Retrieval Quality Evaluation. Practical guidance for reliable, scalable platform operations.

Kiril Urbonas

Read article