Projects in Awesome Lists tagged with agent-eval
A curated list of projects in awesome lists tagged with agent-eval .
https://github.com/gojiplus/understudy
Scenario Testing for AI Agents
agent-eval agent-evaluation agentic evaluation google-adk simulation
Last synced: 02 Apr 2026
https://github.com/0-co/company
AI-operated company. Building agent-friend: universal tool adapter for AI agents. @tool → OpenAI, Claude, Gemini, MCP. Live 24/7 on Twitch.
agent-eval agent-friend agent-security ai-agent autonomous-ai building-in-public exponential-backoff human-in-the-loop interactive-cli llm-tools mcp-security open-startup personal-ai-agent python structured-logging twitch zero-dependencies
Last synced: 18 Mar 2026
https://github.com/metronis-space/aegis
The Adaptive Intelligence Layer for AI Agents — eval, train, and memory on one platform.
agent-eval ai-agents benchmarks evaluation grpo llm memory reinforcement-learning
Last synced: 20 Apr 2026
https://github.com/mizcausevic-dev/agent-eval-arena
Agent and LLM evaluation harness — golden datasets, multi-scorer execution, regression detection across model versions, cost-quality leaderboards, and CI gates for model promotion.
agent-eval ai-governance ai-platform ci-gate express llm-eval ml-ops platform-engineering regression-detection typescript
Last synced: 01 Jun 2026