Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://github.com/THUDM/AgentBench
chatgpt gpt-4 llm llm-agent
Last synced: about 2 months ago
JSON representation
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
- Host: GitHub
- URL: https://github.com/THUDM/AgentBench
- Owner: THUDM
- License: apache-2.0
- Created: 2023-07-28T04:32:06.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-19T13:48:34.000Z (3 months ago)
- Last Synced: 2024-06-25T20:04:04.428Z (3 months ago)
- Topics: chatgpt, gpt-4, llm, llm-agent
- Language: Python
- Homepage: https://llmbench.ai
- Size: 23 MB
- Stars: 1,970
- Watchers: 29
- Forks: 132
- Open Issues: 31
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-colab-project - AgentBench
- awesome-llm-eval - AgentBench
- awesome-autonomous-gpt - 2023/08/07 - A Comprehensive Benchmark to Evaluate LLMs as Agents. [[paper]](https://arxiv.org/abs/2308.03688) (Projects / Benchmarks)
- StarryDivineSky - THUDM/AgentBench
- awesome-chatgpt - THUDM/AgentBench - A Comprehensive Benchmark to Evaluate LLMs as Agents (Other / Other sdk/libraries)
- Awesome-LLMSecOps - AgentBench