https://github.com/mckraqs/dataride
Lightning-fast data platform setup toolkit for small projects and PoCs
https://github.com/mckraqs/dataride
data data-engineering python terraform
Last synced: 4 months ago
JSON representation
Lightning-fast data platform setup toolkit for small projects and PoCs
- Host: GitHub
- URL: https://github.com/mckraqs/dataride
- Owner: mckraqs
- License: mit
- Created: 2022-10-18T10:23:42.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-11-23T14:55:23.000Z (about 3 years ago)
- Last Synced: 2025-03-25T01:47:10.782Z (11 months ago)
- Topics: data, data-engineering, python, terraform
- Language: Python
- Homepage:
- Size: 370 KB
- Stars: 6
- Watchers: 1
- Forks: 0
- Open Issues: 8
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# dataride: lightning-fast data platform setup toolkit
---
## Introduction
**dataride** is a Python package that enables creating data platform infrastructure within seconds for small/medium projects as well as PoCs (Proof of Concept). It aims to generate ready-to-deploy code for various frameworks, including tools like Terraform and Apache Airflow. It makes use of YAML configuration files to read data platform features that the user is willing to set up.
## Requirements
The underlying logic makes heavy use of Terraform and Jinja templating. Therefore, to fully exploit package features, it's recommended to install Terraform beforehand (possibly one of the latest stable versions). Instructions on how to do this can be found on the [official Terraform tutorial website](https://learn.hashicorp.com/tutorials/terraform/install-cli).
## Example
Below you can find and example of running the `dataride` CLI, using config examples that were prepared inside the `config_examples/` directory. It takes **20 seconds** to go from ready config file to infrastructure setup generation.
```
dataride create -c config_examples/scenario_aws_s3_and_data_catalog.yaml -d results/infra_s3_and_glue
```

## Documentation
For further description of the package's features, please refer to [docs](https://github.com/mckraqs/dataride/tree/main/docs) directory. All the necessary information is stored there.
## Collaboration
If you see any room for improvement, feel free to submit a PR! Let's develop dataride to suit as many data teams as possible.