An open API service indexing awesome lists of open source software.

https://github.com/sierra-research/tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
https://github.com/sierra-research/tau2-bench

ai benchmark conversational-agents language-model-agent llm

Last synced: 6 months ago
JSON representation

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Awesome Lists containing this project