An open API service indexing awesome lists of open source software.

https://github.com/abhisang3/xverify

xVerify: Efficient Answer Verifier for Large Language Model Evaluations
https://github.com/abhisang3/xverify

benchmark chatgpt deepseek-math evaluation judge-model llm math-verify open-compass open-r1 reasoning-models reliability reliability-tools xverify

Last synced: 2 months ago
JSON representation

xVerify: Efficient Answer Verifier for Large Language Model Evaluations

Awesome Lists containing this project