projects

Open-source AI systems, benchmarks, and ML engineering projects.

work

project thumbnail

CostGuard

Production-grade LLM reliability proxy with cost tracking, circuit breakers, and observability

project thumbnail

RelayOps

Production-shaped telecom support agent with scoped tools, guardrails, RAG citations, evals, live demo, and fine-tuned intent routing

project thumbnail

Tether

Durable execution for long-running LLM agents — automatic checkpoint, resume, and cross-provider failover

project thumbnail

RealDataAgentBench

Automated model-selection engine for production LLM deployments — 12 frontier models, 1,412+ runs, real-world tasks

project thumbnail

LoRA Fine-tuning of DeepSeek-R1

Mathematical reasoning on a 1.5B model — 98.8% parameter reduction, 2× throughput with Unsloth

project thumbnail

AI-Assisted Medical Image Diagnosis

LLM-powered X-ray interpretation tool using Groq Cloud API and vision models