An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with agent-benchmark

A curated list of projects in awesome lists tagged with agent-benchmark .

https://github.com/hidai25/eval-view

Catch AI agent regressions before you ship. YAML test cases, golden baselines, execution tracing, cost tracking, CI integration. LangGraph, CrewAI, Anthropic, OpenAI.

agent agent-benchmark agent-evaluation agentic-ai ai-agents anthropic crewai crewai-tools evaluation langchain langgraph langgraph-python llm llmops mlops openai-assistants pytest testing tools

Last synced: 09 Mar 2026