Two of the highest-leverage things you can build with LLMs today: retrieval that grounds answers in your own data, and agents that actually complete tasks. I build both — with the evaluation and guardrails that make them trustworthy.

Retrieval-Augmented Generation (RAG)

  • Ground model output in your documents, policies, catalog, or knowledge base so answers cite real sources instead of hallucinating.
  • Full pipeline: ingestion, chunking, embeddings, vector store (ChromaDB / pgvector), retrieval, and re-ranking.
  • An evals harness (ragas or a custom suite) so retrieval and answer quality are measured against a test set — the piece most builds skip.

AI Agents

  • Tool-using agents that plan and execute real workflows — research-and-draft, ops automation, data tasks.
  • Guardrails and human-in-the-loop checkpoints for anything that writes to a system of record.
  • Built on hands-on experience with LangChain and the agent ecosystem (including a merged fix to LangChain’s agent tool-input parser).

MCP (Model Context Protocol)

  • I ship MCP servers that expose your tools and data sources to AI assistants through a clean, current interface — so your systems plug into the AI tools your team already uses.
  • A differentiating, up-to-date capability: standardized tool access instead of one-off integrations.