https://github.com/allenai/WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
https://github.com/allenai/WildBench
Last synced: 3 months ago
JSON representation
Benchmarking LLMs with Challenging Tasks from Real Users
- Host: GitHub
- URL: https://github.com/allenai/WildBench
- Owner: allenai
- License: apache-2.0
- Created: 2024-03-06T23:54:27.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-01T01:08:46.000Z (about 1 year ago)
- Last Synced: 2024-04-14T07:50:04.060Z (about 1 year ago)
- Language: Python
- Homepage: https://huggingface.co/spaces/allenai/WildBench
- Size: 7.9 MB
- Stars: 71
- Watchers: 4
- Forks: 6
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- StarryDivineSky - allenai/WildBench
- awesome-golang-ai - WildBench