An open API service indexing awesome lists of open source software.

https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny

๐ŸŒŽDatamart Analysis with Machine Learning
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny

data-analysis data-science dataviz machine-learning ml r retail-analysis rstudio shiny

Last synced: about 1 year ago
JSON representation

๐ŸŒŽDatamart Analysis with Machine Learning

Awesome Lists containing this project

README

          

# ๐Ÿ“Š Datamart Analysis with Machine Learning (ML)
[![GitHub](https://img.shields.io/badge/GitHub-repository-blue?style=flat&logo=github&logoColor=white)](https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny)
[![Shiny](https://img.shields.io/badge/Built_with-R_Shiny-blue?style=flat&logo=r&logoColor=white)](https://shiny.rstudio.com/)
[![Machine Learning](https://img.shields.io/badge/Machine_Learning-Powered-green?style=flat&logo=python&logoColor=white)](https://scikit-learn.org/)











## ๐Ÿ”— Links
- ๐Ÿ“ˆ **Application** : [Visual Analytics for ML](https://smd-lab-tech.shinyapps.io/Shiny_Dataviz/)
- ๐Ÿ“• **Report** : [Case Study Report](./_docs/rprt_ana_donnee_avancees_22-1.pdf)

## ๐Ÿ“Œ Summary
Development of a predictive model for the **"display"** variable using Machine Learning techniques by transforming all continuous variables into categorical for modeling.

### 1๏ธโƒฃ Data Presentation
๐Ÿ“Œ **Descriptive analysis** of qualitative and quantitative variables, and their transformation for analysis.

### 2๏ธโƒฃ Multiple Component Analysis (MCA)
๐Ÿ“‰ Use of **MCA** to reduce data dimensionality, identify principal components, and interpret results.

### 3๏ธโƒฃ Modeling
- **Decision Tree**: Classification with specific parameters and a **confusion matrix** to assess performance.
- **Random Forest**: Application of **random forest**, parameter tuning, and classification results.
- **Logistic Regression**: Prediction using logistic regression, including **error rates** and accuracy metrics.

### 4๏ธโƒฃ Model Comparison
๐Ÿ“Š Comparative analysis of three machine learning models: **Decision Tree, Random Forest, and Logistic Regression**.

### 5๏ธโƒฃ Model Performance (Best Model Analysis)
๐Ÿ“ Evaluation of model performance based on **precision** and **sensitivity**.

๐Ÿš€ **Let's make data-driven decisions!**

---
> [@smdlabtech](https://github.com/smdlabtech)