Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/gajender0707/data_science_project

This is the End to End Data science project
https://github.com/gajender0707/data_science_project

Last synced: 7 days ago
JSON representation

This is the End to End Data science project

Awesome Lists containing this project

README

        

# data_science_project
This is the End to End Data science project

## Here is the Workflow of the Data-Ingestion-part.....
1. constants/paths.py: Define static paths.
2. entity/data_ingestion_entity.py: Define schemas for structured data.
3. utils/file_operations.py: Utility functions for file and directory handling.
4. config/configuration.py: Manage configurations and initialize entities.
5. components/data_ingestion.py: Implement the data ingestion functionality.
6. main.py: Orchestrate the execution of all components.

## Imporant Note:
Rest of the component like: data_validation, data_transformation, model_trainer and model_evaluation also will follow the same structure and sequence which is followed by the data_ingestion_component.

### Workflow-ML (need to follow the sequence...)
1. Data Ingestion
2. Data Validation
3. Data Transformation
4. Model Trainer
5. Model Evaluation

## Workflows -- Need to do after Every Componenet (Accordingly)
1. update the config.yaml
2. update the schema.yaml
3. update the params.yaml
4. update the entity
5. update the configuration master in src/cofig
6. update the component
7. update the pipeline
8. update the main.py