An open API service indexing awesome lists of open source software.

https://github.com/punitkumar4871/data_engineering_project

we are building a project which will fetch data from a dynamic pipeline from Github repositary to azure data lake storage Gen 2. after that we will preprocess and clean data in databricks followed by analyzing our datset using big datatools ...... After all preprocessing and analyzing we will train a prediction model on crop yeild.
https://github.com/punitkumar4871/data_engineering_project

Last synced: 4 months ago
JSON representation

we are building a project which will fetch data from a dynamic pipeline from Github repositary to azure data lake storage Gen 2. after that we will preprocess and clean data in databricks followed by analyzing our datset using big datatools ...... After all preprocessing and analyzing we will train a prediction model on crop yeild.

Awesome Lists containing this project

README

          

# data_engineering_project
we are building a project which will fetch data from a dynamic pipeline from Github repositary to azure data lake storage Gen 2. after that we will preprocess and clean data in databricks followed by analyzing our datset using big datatools ...... After all preprocessing and analyzing we will train a prediction model on crop yeild..