The Lab is where I explore systems before they become full case studies. These entries are faster to scan than the main Projects page and more focused on technical patterns, failure modes, and workflow design. The emphasis is not polish for its own sake, but learning through disciplined experimentation.

experiment 2025

RAG Failure Maps

A retrieval-first exploration of where RAG systems break, from weak chunk boundaries and noisy evidence to grounding failures and misleading answer confidence.

  • RAG
  • Retrieval
  • Grounding
  • Failure Analysis
Read experiment
experiment 2025

Grounded Extraction Agents

Experiments in multi-step extraction workflows that combine tool-calling, structured outputs, and evidence checks to make extraction more reliable than one-shot prompting.

  • Agentic Workflows
  • Extraction
  • Tool Calling
  • Structured Outputs
Read experiment
experiment 2025

DSPy Retrieval Loops

Structured DSPy-style experiments for improving retrieval and answer quality through optimization, evaluation-aware prompting, and tighter system feedback loops.

  • DSPy
  • Retrieval
  • Optimization
  • Evaluation
Read experiment
tool 2025

Local AI CLI

A command-line workflow for fast AI-assisted research and document tasks, designed to make local experimentation, iteration, and tool chaining more usable in serious technical work.

  • CLI
  • Tooling
  • Automation
  • AI Workflows
Read experiment