https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
๐Datamart Analysis with Machine Learning
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
data-analysis data-science dataviz machine-learning ml r retail-analysis rstudio shiny
Last synced: about 1 year ago
JSON representation
๐Datamart Analysis with Machine Learning
- Host: GitHub
- URL: https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
- Owner: smdlabtech
- Created: 2022-04-07T21:16:18.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2025-03-28T17:03:41.000Z (over 1 year ago)
- Last Synced: 2025-03-28T18:22:52.389Z (over 1 year ago)
- Topics: data-analysis, data-science, dataviz, machine-learning, ml, r, retail-analysis, rstudio, shiny
- Language: R
- Homepage:
- Size: 9.78 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ๐ Datamart Analysis with Machine Learning (ML)
[](https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny)
[](https://shiny.rstudio.com/)
[](https://scikit-learn.org/)
## ๐ Links
- ๐ **Application** : [Visual Analytics for ML](https://smd-lab-tech.shinyapps.io/Shiny_Dataviz/)
- ๐ **Report** : [Case Study Report](./_docs/rprt_ana_donnee_avancees_22-1.pdf)
## ๐ Summary
Development of a predictive model for the **"display"** variable using Machine Learning techniques by transforming all continuous variables into categorical for modeling.
### 1๏ธโฃ Data Presentation
๐ **Descriptive analysis** of qualitative and quantitative variables, and their transformation for analysis.
### 2๏ธโฃ Multiple Component Analysis (MCA)
๐ Use of **MCA** to reduce data dimensionality, identify principal components, and interpret results.
### 3๏ธโฃ Modeling
- **Decision Tree**: Classification with specific parameters and a **confusion matrix** to assess performance.
- **Random Forest**: Application of **random forest**, parameter tuning, and classification results.
- **Logistic Regression**: Prediction using logistic regression, including **error rates** and accuracy metrics.
### 4๏ธโฃ Model Comparison
๐ Comparative analysis of three machine learning models: **Decision Tree, Random Forest, and Logistic Regression**.
### 5๏ธโฃ Model Performance (Best Model Analysis)
๐ Evaluation of model performance based on **precision** and **sensitivity**.
๐ **Let's make data-driven decisions!**
---
> [@smdlabtech](https://github.com/smdlabtech)