awesome-harness-engineering
Curated list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
Tag
2 entries tagged with #evals.
Eval harnesses, simulation frameworks, and observability platforms for measuring whether your agent is actually getting better.
Curated list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
Code-first toolkit for building, evaluating, and deploying agents on Google's stack. Tool wiring, traces, and eval harness in one package.