Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Norditech-AB/SOP-bench
Benchmark for evaluating llm agents to solve real-world standard operating procedures
https://github.com/Norditech-AB/SOP-bench
Last synced: 3 days ago
JSON representation
Benchmark for evaluating llm agents to solve real-world standard operating procedures
- Host: GitHub
- URL: https://github.com/Norditech-AB/SOP-bench
- Owner: Norditech-AB
- Created: 2024-03-24T13:50:55.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-03-24T13:53:10.000Z (10 months ago)
- Last Synced: 2024-03-24T14:46:05.900Z (10 months ago)
- Size: 0 Bytes
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 0
Awesome Lists containing this project
- awesome_ai_agents - Sop-Bench - Benchmark for evaluating llm agents to solve real-world standard operating procedures (Building / Benchmarks)
- awesome_ai_agents - Sop-Bench - Benchmark for evaluating llm agents to solve real-world standard operating procedures (Building / Benchmarks)