Blog & Insights

I'm currently documenting my engineering journey—focusing on LLM optimization, production-grade RAG pipelines, and MLOps best practices.

In Progress

Architectural Deep Dive

Optimizing RAG Pipelines for Production

A comprehensive guide on balancing latency, cost, and retrieval accuracy in large-scale RAG systems.

Coming Soon

Writing

Machine Learning

Why bigger isn't always better. How I fine-tuned 7B models to outperform GPT-4 on specific domain tasks.

Coming Soon

Drafting

Infrastructure

Closing the gap between research and production with automated pipelines, Docker, and AWS.

Coming Soon

Planning

Observability

Implementing robust monitoring and grounding checks for healthcare and legal AI assistants.

Coming Soon

I'm always open to discussing technical challenges and sharing my findings. If there's something specific you'd like to see covered, let's connect.