Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/Norditech-AB/SOP-bench

Benchmark for evaluating llm agents to solve real-world standard operating procedures
https://github.com/Norditech-AB/SOP-bench

Last synced: 3 days ago
JSON representation

Benchmark for evaluating llm agents to solve real-world standard operating procedures

Awesome Lists containing this project