An open API service indexing awesome lists of open source software.

https://github.com/dkealvaro/agent2bench

Agent2Bench is a benchmark that tests LLMs abilities in Daily life computer tasks like booking flights, downloading programs or exiting vim.
https://github.com/dkealvaro/agent2bench

benchmark computer-vision llms

Last synced: 3 months ago
JSON representation

Agent2Bench is a benchmark that tests LLMs abilities in Daily life computer tasks like booking flights, downloading programs or exiting vim.

Awesome Lists containing this project