blank

Jekyll2026-06-07T19:18:00+00:00https://venkatamanideep.com/feed.xmlblankAI Engineer specializing in LLM evaluation, benchmarking, and production-grade AI systems. Builder of CostGuard and RealDataAgentBench. How to Actually Use Claude: 18 Steps That Unlock 100% of Its Potential2025-05-19T00:00:00+00:002025-05-19T00:00:00+00:00https://venkatamanideep.com/blog/2025/how-to-use-claude

The Most Expensive Mistake in LLM Engineering (And How to Fix It With Data)2025-05-14T00:00:00+00:002025-05-14T00:00:00+00:00https://venkatamanideep.com/blog/2025/expensive-mistake-llm-engineering

KV Caching in LLMs2025-05-10T00:00:00+00:002025-05-10T00:00:00+00:00https://venkatamanideep.com/blog/2025/kv-caching-llms

You’re Doing RAG Wrong2025-05-09T00:00:00+00:002025-05-09T00:00:00+00:00https://venkatamanideep.com/blog/2025/rag-wrong

My LLM App Started Silently Getting Worse. I Almost Didn’t Notice. Here’s What I Built to Catch It.2025-05-04T00:00:00+00:002025-05-04T00:00:00+00:00https://venkatamanideep.com/blog/2025/llm-silent-drift

Every LLM Has a Superpower and a Blind Spot. I Built a Benchmark Around That Observation2025-04-24T00:00:00+00:002025-04-24T00:00:00+00:00https://venkatamanideep.com/blog/2025/llm-superpower-blind-spot

I Prompted 5 Frontier LLMs to ‘Report Uncertainty’ — Here’s What Happened to Their Statistical Validity Scores2025-04-18T00:00:00+00:002025-04-18T00:00:00+00:00https://venkatamanideep.com/blog/2025/llm-uncertainty-statistical-validity

I Ran 163 Benchmarks Across 10 LLMs So You Don’t Have To. Here’s What I Found2025-04-15T00:00:00+00:002025-04-15T00:00:00+00:00https://venkatamanideep.com/blog/2025/163-benchmarks-10-llms

I Built a Benchmark That Proves Most LLM Agents Are Statistically Blind — And Why That Costs Companies Real Money2025-04-11T00:00:00+00:002025-04-11T00:00:00+00:00https://venkatamanideep.com/blog/2025/llm-agents-statistically-blind

Everyone Is Calling It Prompt Engineering. They’re Already Behind.2025-04-10T00:00:00+00:002025-04-10T00:00:00+00:00https://venkatamanideep.com/blog/2025/beyond-prompt-engineering