Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/gajender0707/data_science_project
This is the End to End Data science project
https://github.com/gajender0707/data_science_project
Last synced: 7 days ago
JSON representation
This is the End to End Data science project
- Host: GitHub
- URL: https://github.com/gajender0707/data_science_project
- Owner: Gajender0707
- License: apache-2.0
- Created: 2024-11-28T07:15:07.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2024-12-21T09:24:32.000Z (about 2 months ago)
- Last Synced: 2024-12-21T10:22:24.637Z (about 2 months ago)
- Language: Python
- Size: 165 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# data_science_project
This is the End to End Data science project## Here is the Workflow of the Data-Ingestion-part.....
1. constants/paths.py: Define static paths.
2. entity/data_ingestion_entity.py: Define schemas for structured data.
3. utils/file_operations.py: Utility functions for file and directory handling.
4. config/configuration.py: Manage configurations and initialize entities.
5. components/data_ingestion.py: Implement the data ingestion functionality.
6. main.py: Orchestrate the execution of all components.## Imporant Note:
Rest of the component like: data_validation, data_transformation, model_trainer and model_evaluation also will follow the same structure and sequence which is followed by the data_ingestion_component.### Workflow-ML (need to follow the sequence...)
1. Data Ingestion
2. Data Validation
3. Data Transformation
4. Model Trainer
5. Model Evaluation## Workflows -- Need to do after Every Componenet (Accordingly)
1. update the config.yaml
2. update the schema.yaml
3. update the params.yaml
4. update the entity
5. update the configuration master in src/cofig
6. update the component
7. update the pipeline
8. update the main.py