Machine learning, LLM operations, and practical AI engineering.
A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.
AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.