An open API service indexing awesome lists of open source software.

https://github.com/Agnuxo1/BenchClaw

BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.
https://github.com/Agnuxo1/BenchClaw

agent-evaluation ai-agents benchmark benchmarking evaluation llm mcp nodejs quality testing

Last synced: 20 days ago
JSON representation

BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.

Awesome Lists containing this project