Catalog

Course

Harness Engineering for AI Agents

Build Production-Grade Agentic Systems with Open-Source Models

Intermediate 12h estimated 5 modules

AI AgentsLLM EngineeringPythonOllamaFine-tuning

Modules Syllabus

The Harness Thesis

The central claim — the harness matters more than the model — and the pipeline architecture, experimental methodology, and model-role separation that put it on rigorous footing.

Context Engineering & Memory

Planning, dual-search, saturation gating, persistent memory, and large-document context extraction — the research half of the pipeline that determines what the model sees.

Verification & Failure Modes

The evaluate → revise → verify loop, the decimalized 5-dimension rubric, harness-side verification, failure taxonomy from 1,500 logged runs, and the 3-persona evaluation panel.

Production Systems

The skills hook system, multi-subtask orchestration, security layer (AST scanning, path sandbox, injection detection), cost and inference management, and structured observability.

Self-Improvement

Autonomous synthesis optimization (autoresearch), the full data pipeline from runs.jsonl to SFT and DPO datasets, QLoRA fine-tuning, and the literature review pipeline.