https://gair-nlp.github.io/benbench/
Benchmarking Benchmark Leakage in Large Language Models
https://gair-nlp.github.io/benbench/
benchmarks dataset large-language-models leakage-detection
Last synced: 7 months ago
JSON representation
Benchmarking Benchmark Leakage in Large Language Models
- Host: GitHub
- URL: https://gair-nlp.github.io/benbench/
- Owner: GAIR-NLP
- Created: 2023-11-26T08:25:11.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-05-20T01:59:32.000Z (over 1 year ago)
- Last Synced: 2025-03-21T20:03:55.337Z (7 months ago)
- Topics: benchmarks, dataset, large-language-models, leakage-detection
- Language: JavaScript
- Homepage: https://gair-nlp.github.io/benbench/
- Size: 51.7 MB
- Stars: 52
- Watchers: 1
- Forks: 3
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-foundation-model-leaderboards - BenBench