Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ArdentAI1/DE-Bench
DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer
https://github.com/ArdentAI1/DE-Bench
Last synced: 3 days ago
JSON representation
DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer
- Host: GitHub
- URL: https://github.com/ArdentAI1/DE-Bench
- Owner: ArdentAI1
- License: agpl-3.0
- Created: 2024-05-27T06:01:43.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-12-15T05:34:17.000Z (24 days ago)
- Last Synced: 2024-12-15T06:23:42.936Z (24 days ago)
- Language: Python
- Homepage: http://Ardentai.io
- Size: 64.5 KB
- Stars: 11
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome_ai_agents - De-Bench - DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer (Building / Testing)
- awesome_ai_agents - De-Bench - DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer (Building / Testing)
README
# DE-Bench
DE Bench: Can Agents Solve Real-World Data Engineering Problems?This is repository of real world problems for Data Engineering Agents to solve
There is a README within each test folder to explain the problem and the tests
To Run this testing yourself:
1. Clone the repo into wherever you want. Ideally a tests folder
2. Set Environment variables
a. Set BENCHMARK_ROOT to the full path of the folder you clone the repo into
b. Set MODEL_PATH to the path to your model3. Edit the Run_Model.py file to edit the wrapper and import in your model. You must make sure MODEL_PATH is the same path for your model import. Plug in your model to the wrapper function in Run_Model
4. Install requirements.txt with pip install -r requirements.txt
4. Use pytest to run. Pytest to run all or pytest -m "category" to run all tests of a specific category. Pytest supports and and or operators too. Something like pytest -m "one and two" will work.
5. A lot of the tests run on tools or frameworks. We've set up a clean .env file with all the neccesary variables needed. We've tried to optimize the setup of all the tests but it will likely charge some credits through the tools. Keep that in mind
Here's a block to copy
BENCHMARK_ROOT = ""
MODEL_PATH = ""#Provider Stuff