Venkata Manideep Patibandla
AI Engineer · Agent Systems · ML Engineering · Production AI
New Haven, CT
I build and ship production AI systems — agents, automated workflows, and reliability layers that solve real engineering problems and hold up under production load.
I built RelayOps, a production-shaped telecom support agent with scoped MCP-style tools, hybrid RAG with citations, guardrails that block invented offers/PII, adversarial agent evals, a Qwen LoRA intent classifier, and a live Railway demo.
I built Tether, a durable execution layer for long-running LLM agents. Wrap your existing OpenAI or Anthropic client and get automatic checkpoint/resume, cross-provider failover, and resilient multi-step agent workflows — without rewriting your agent logic.
I built CostGuard, a production-grade open-source LLM reliability proxy that intercepts calls in real time, auto-rejects low-quality responses via a fallback chain, and tracks exact per-call cost across 12 models and 5 providers — with circuit breakers, Prometheus metrics, and a 6-type alerting engine over Slack and webhooks. Cuts LLM spend 10–20× vs GPT-4o defaults.
To power CostGuard’s model-selection engine, I built RealDataAgentBench (RDAB) — an automated benchmarking system across 12 frontier LLMs and 1,180+ runs on real-world tasks that produces cost-performance trade-off data for production decisions. Key output: GPT-4.1 delivers 97% of GPT-5’s score at 1/15th the cost.
Currently an AI/ML Engineer at Infosoft Solutions, building production ML pipelines, deploying LangGraph and CrewAI agent workflows, and monitoring model quality at scale. IBM Certified in Agentic AI.
news
| Jun 7, 2026 | Released RelayOps v1 — a production-shaped telecom support agent with scoped tools, guardrails, RAG citations, adversarial evals, a Qwen LoRA intent classifier, and a live Railway demo. |
|---|---|
| Aug 1, 2025 | Started a new role as Data & ML Engineer at Infosoft Solutions, building ML pipelines, deploying classifiers at scale, and monitoring production model quality. |
| May 15, 2025 | Graduated with an M.S. in Computer Science from Sacred Heart University (GPA: 3.8/4.0). Inducted into the Upsilon Pi Epsilon (UPE) Honor Society. |
| Apr 11, 2025 | Released RealDataAgentBench — an open-source benchmark across 12 frontier LLMs and 1,180+ runs surfacing the correctness vs. statistical-validity gap in frontier models. |
latest posts
| May 19, 2025 | How to Actually Use Claude: 18 Steps That Unlock 100% of Its Potential |
|---|---|
| May 14, 2025 | The Most Expensive Mistake in LLM Engineering (And How to Fix It With Data) |
| May 10, 2025 | KV Caching in LLMs |