0 "evals" Awesome Lists
awesome-ai-agent-evaluation
A curated list of benchmarks, eval harnesses, papers, datasets, and production checks for AI agents.
agent-benchmark agent-evaluation ai-agents ai-evaluation awesome benchmarks coding-agents evals llm-agents llm-evaluation
1 stars
1 forks
141 projects
Last updated: 30 May 2026