Agent Eval Harness - AI Skill | Natoma

plaited

Agent Eval Harness

Type

Instruction

Platforms

Claude CodeClaude.aiAPI

Best for

developerqa

Resources

Source Repository Author

Agent Eval Harness

This skill helps you evaluate CLI agent trajectories by capturing full runs and providing structured JSONL for downstream scoring.

Try it out

Help me use the Agent Eval Harness skill effectively.

How it works

This skill helps you evaluate CLI agent trajectories by capturing full runs and providing structured JSONL for downstream scoring.

Tags

aiclitestingautomationanalyticstypescript