https://github.com/ArdentAI1/DE-Bench

DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer
https://github.com/ArdentAI1/DE-Bench

Last synced: 4 months ago
JSON representation

DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer

Host: GitHub
URL: https://github.com/ArdentAI1/DE-Bench
Owner: ArdentAI1
License: agpl-3.0
Created: 2024-05-27T06:01:43.000Z (12 months ago)
Default Branch: main
Last Pushed: 2024-12-15T05:34:17.000Z (5 months ago)
Last Synced: 2024-12-15T06:23:42.936Z (5 months ago)
Language: Python
Homepage: http://Ardentai.io
Size: 64.5 KB
Stars: 11
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome_ai_agents - De-Bench - DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer (Building / Testing)
awesome_ai_agents - De-Bench - DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer (Building / Testing)

README

        # DE-Bench

DE Bench: Can Agents Solve Real-World Data Engineering Problems?

This is repository of real world problems for Data Engineering Agents to solve

There is a README within each test folder to explain the problem and the tests

To Run this testing yourself:

1. Clone the repo into wherever you want. Ideally a tests folder

2. Set Environment variables

  a. Set BENCHMARK_ROOT to the full path of the folder you clone the repo into

  b. Set MODEL_PATH to the path to your model

3. Edit the Run_Model.py file to edit the wrapper and import in your model. You must make sure MODEL_PATH is the same path for your model import. Plug in your model to the wrapper function in Run_Model

4. Install requirements.txt with pip install -r requirements.txt

4. Use pytest to run. Pytest to run all or pytest -m "category" to run all tests of a specific category. Pytest supports and and or operators too. Something like pytest -m "one and two" will work.

5. A lot of the tests run on tools or frameworks. We've set up a clean .env file with all the neccesary variables needed. We've tried to optimize the setup of all the tests but it will likely charge some credits through the tools. Keep that in mind

Here's a block to copy




BENCHMARK_ROOT = ""

MODEL_PATH = ""

#Provider Stuff

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ArdentAI1/DE-Bench

Awesome Lists containing this project

README