Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://github.com/THUDM/AgentBench
chatgpt gpt-4 llm llm-agent
Last synced: about 1 month ago
JSON representation
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
- Host: GitHub
- URL: https://github.com/THUDM/AgentBench
- Owner: THUDM
- License: apache-2.0
- Created: 2023-07-28T04:32:06.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-21T10:16:45.000Z (4 months ago)
- Last Synced: 2024-10-29T15:45:33.757Z (about 1 month ago)
- Topics: chatgpt, gpt-4, llm, llm-agent
- Language: Python
- Homepage: https://llmbench.ai
- Size: 22.9 MB
- Stars: 2,189
- Watchers: 28
- Forks: 152
- Open Issues: 47
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-colab-project - AgentBench
- awesome-llm-eval - AgentBench
- awesome-autonomous-gpt - 2023/08/07 - A Comprehensive Benchmark to Evaluate LLMs as Agents. [[paper]](https://arxiv.org/abs/2308.03688) (Projects / Benchmarks)
- StarryDivineSky - THUDM/AgentBench
- awesome-chatgpt - THUDM/AgentBench - A Comprehensive Benchmark to Evaluate LLMs as Agents (Other / Other sdk/libraries)
- Awesome-LLMSecOps - AgentBench