https://github.com/xingyaoww/mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
https://github.com/xingyaoww/mint-bench
Last synced: about 1 year ago
JSON representation
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
- Host: GitHub
- URL: https://github.com/xingyaoww/mint-bench
- Owner: xingyaoww
- License: apache-2.0
- Created: 2023-09-18T05:34:02.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-06-04T04:01:10.000Z (about 2 years ago)
- Last Synced: 2025-04-10T06:06:28.574Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 71 MB
- Stars: 122
- Watchers: 4
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: docs/CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- StarryDivineSky - xingyaoww/mint-bench
- awesome-agent-rl-environments - MINT-Bench - turn interaction with **tools + natural-language feedback**. 586 representative instances repurposed from 8 datasets across reasoning, code, decision-making. 📄 [Paper](https://arxiv.org/abs/2309.10691) (Tool-Use & Multi-Tool Environments)