An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with evaluation-methodology

A curated list of projects in awesome lists tagged with evaluation-methodology .

https://github.com/fraware/cta-benchmark

CTA-Bench: research benchmark and toolkit for studying how well systems turn problem descriptions and reference code into Lean 4 proof obligations, and how faithful those obligations are to the intended algorithm.

ai-evaluation autoformalization benchmark evaluation-methodology formal-verification lean program-verification semantic-faithfulness theorem-proving

Last synced: 09 Jun 2026