https://github.com/xingyaoww/mint-bench

Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
https://github.com/xingyaoww/mint-bench

Last synced: about 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/xingyaoww/mint-bench
Owner: xingyaoww
License: apache-2.0
Created: 2023-09-18T05:34:02.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2024-06-04T04:01:10.000Z (about 2 years ago)
Last Synced: 2025-04-10T06:06:28.574Z (about 1 year ago)
Language: Python
Homepage:
Size: 71 MB
Stars: 122
Watchers: 4
Forks: 7
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: docs/CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

StarryDivineSky - xingyaoww/mint-bench
awesome-agent-rl-environments - MINT-Bench - turn interaction with **tools + natural-language feedback**. 586 representative instances repurposed from 8 datasets across reasoning, code, decision-making. 📄 [Paper](https://arxiv.org/abs/2309.10691) (Tool-Use & Multi-Tool Environments)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/xingyaoww/mint-bench

Awesome Lists containing this project