Fine-Tuning vs RAG vs Long-Context: A Decision Framework With Numbers
We've shipped all three patterns to production. They're not interchangeable. Here's the framework we now use to decide which approach fits a given task.
Topics
Latest Articles
View All →Fine-tuning Llama 3 on Consumer Hardware
I fine-tuned Llama 3 8B on a single 4090 over a weekend for a side project. Here's what worked, what cost more than expected, and what I'd do differently.
AWS Cost Optimization Strategies
A different angle on AWS cost work: the operational discipline that prevents costs from creeping back up after the initial cleanup.