Projects in Awesome Lists tagged with evaluation-methodology
A curated list of projects in awesome lists tagged with evaluation-methodology .
https://github.com/fraware/cta-benchmark
CTA-Bench: research benchmark and toolkit for studying how well systems turn problem descriptions and reference code into Lean 4 proof obligations, and how faithful those obligations are to the intended algorithm.
ai-evaluation autoformalization benchmark evaluation-methodology formal-verification lean program-verification semantic-faithfulness theorem-proving
Last synced: 09 Jun 2026