0 "ai-benchmark" Awesome Lists
awesome-ai-agent-testing
🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems
agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering
32 stars
8 forks
168 projects
Last updated: 20 Apr 2026
awesome-ai-benchmarks-evaluation
A curated list of evaluation tools, benchmark datasets, leaderboards, frameworks, and resources for assessing model performance.
ai ai-benchmark ai-benchmarks ai-evaluation awesome awesome-list awesome-lists
5 stars
0 forks
52 projects
Last updated: 23 Mar 2026