Writing

Thoughts & Deep Dives

Writing on AI engineering, distributed systems, cloud infrastructure, and the craft of building software that actually lasts.

4 articles

11 min read
AI/ML

5 Hard Lessons from Shipping RAG to Production

What nobody tells you about retrieval-augmented generation until you're debugging at 2am — chunking strategies, embedding drift, and latency traps that will bite you.

RAGLangChainLLMsProductionPython
Feb 14, 2025 Read Article
9 min read
Engineering

Go Concurrency Patterns I Actually Use in Production

Beyond the basics — worker pools, fan-out/fan-in, semaphore-bounded goroutines, and the context cancellation patterns that keep distributed systems from turning into memory leak generators.

GoConcurrencyBackendPerformance
Jan 20, 2025 Read Article
8 min read
Cloud

Cutting Our Kubernetes Bill by 60% Without Touching the App

A practical walkthrough of the resource requests, node sizing, cluster autoscaler tuning, and spot instance strategies that slashed our monthly EKS spend from $14k to $5.6k.

KubernetesAWSEKSCostDevOps
Dec 5, 2024 Read Article
7 min read
Career

From Engineer to Consultant: What the Transition Actually Looks Like

The unglamorous reality of going independent — how I priced my first engagements, what I got wrong, and the systems I built to replace the stability of a full-time role.

CareerConsultingBusinessIndependent
Oct 18, 2024 Read Article