Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/IAAR-Shanghai/UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
https://github.com/IAAR-Shanghai/UHGEval
benchmark ceval chatgpt dataset evaluation gpt-3 gpt-4 hallucination hallucination-detection hallucination-evaluation hallucinations huggingface huggingface-transformers large-language-models llm openai openai-api qwen
Last synced: about 1 month ago
JSON representation
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
- Host: GitHub
- URL: https://github.com/IAAR-Shanghai/UHGEval
- Owner: IAAR-Shanghai
- License: apache-2.0
- Created: 2023-11-06T11:46:22.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-08T13:08:38.000Z (2 months ago)
- Last Synced: 2024-11-09T00:52:59.573Z (about 1 month ago)
- Topics: benchmark, ceval, chatgpt, dataset, evaluation, gpt-3, gpt-4, hallucination, hallucination-detection, hallucination-evaluation, hallucinations, huggingface, huggingface-transformers, large-language-models, llm, openai, openai-api, qwen
- Language: Python
- Homepage: https://aclanthology.org/2024.acl-long.288/
- Size: 65.1 MB
- Stars: 182
- Watchers: 12
- Forks: 17
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff
Awesome Lists containing this project
- StarryDivineSky - IAAR-Shanghai/UHGEval