0 "ai-evaluation" Awesome Lists
awesome-ai-eval
☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications
ai-evaluation ai-evaluation-framework ai-evaluation-metrics ai-evaluation-tools awesome awesome-list awesome-lists chatgpt claude evaluation
81 stars
16 forks
186 projects
Last updated: 01 Jun 2026
Awesome-AI-Evaluation-Guide
A comprehensive, implementation-focused guide to evaluating Large Language Models, RAG systems, and Agentic AI in production environments.
agentic-ai ai-evaluation ai-evaluation-framework ai-evaluation-metrics ai-evaluation-tools awesome awesome-lists claude evaluation-framework evaluation-metrics
14 stars
4 forks
98 projects
Last updated: 30 May 2026
awesome-ai-benchmarks-evaluation
A curated list of evaluation tools, benchmark datasets, leaderboards, frameworks, and resources for assessing model performance.
ai ai-benchmark ai-benchmarks ai-evaluation awesome awesome-list awesome-lists
8 stars
4 forks
52 projects
Last updated: 08 Jun 2026
awesome-ai-agent-evaluation
A curated list of benchmarks, eval harnesses, papers, datasets, and production checks for AI agents.
agent-benchmark agent-evaluation ai-agents ai-evaluation awesome benchmarks coding-agents evals llm-agents llm-evaluation
1 stars
1 forks
141 projects
Last updated: 30 May 2026