Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/TheDuckAI/arb

Advanced Reasoning Benchmark Dataset for LLMs
https://github.com/TheDuckAI/arb

benchmark dataset llm

Last synced: about 1 month ago
JSON representation

Advanced Reasoning Benchmark Dataset for LLMs

Awesome Lists containing this project

README

        


duckai logo


Advanced Reasoning Benchmark


arXiv
Lint Status

A DuckAI project in collaboration with the Georgia Institute of Technology, ETH Zürich, Nomos AI, Stanford University Center for Legal Informatics, and the Mila - Quebec AI Institute

### Abstract

ARB is a novel benchmark dataset composed of advanced reasoning problems designed to evaluate LLMs on text comprehension and expert domain reasoning, presenting a more challenging test than prior benchmarks, featuring questions that test deeper knowledge of mathematics, physics, biology, chemistry, and law.

### API Usage

Endpoint url: https://advanced-reasoning-benchmark.netlify.app/api/


The documentation for the complete REST API of the ARB dataset is [here](https://advanced-reasoning-benchmark.netlify.app/api/).

Copyright © 2023 [DuckAI](https://github.com/TheDuckAI)