An open API service indexing awesome lists of open source software.

https://github.com/fraware/rust-evals

Evaluate existing candidate patches; built to make benchmark claims auditable, reproducible, and explicitly evaluator-conditioned
https://github.com/fraware/rust-evals

artifact-evaluation benchmarking coding-agents evaluation formal-methods llm python reproducibility rust swe-bench

Last synced: 8 days ago
JSON representation

Evaluate existing candidate patches; built to make benchmark claims auditable, reproducible, and explicitly evaluator-conditioned

Awesome Lists containing this project