https://github.com/abhisang3/xverify
xVerify: Efficient Answer Verifier for Large Language Model Evaluations
https://github.com/abhisang3/xverify
benchmark chatgpt deepseek-math evaluation judge-model llm math-verify open-compass open-r1 reasoning-models reliability reliability-tools xverify
Last synced: 2 months ago
JSON representation
xVerify: Efficient Answer Verifier for Large Language Model Evaluations
- Host: GitHub
- URL: https://github.com/abhisang3/xverify
- Owner: Abhisang3
- License: other
- Created: 2025-03-30T08:01:26.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2025-03-30T09:10:34.000Z (2 months ago)
- Last Synced: 2025-03-30T09:26:23.547Z (2 months ago)
- Topics: benchmark, chatgpt, deepseek-math, evaluation, judge-model, llm, math-verify, open-compass, open-r1, reasoning-models, reliability, reliability-tools, xverify
- Language: Python
- Size: 806 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0