Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny

🌎Datamart Analysis with Machine Learning
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny

data-analysis data-science dataviz machine-learning ml r rstudio shiny

Last synced: 8 days ago
JSON representation

🌎Datamart Analysis with Machine Learning

Awesome Lists containing this project

README

        

# πŸ“Š Datamart Analysis with Machine Learning (ML)
[![GitHub](https://img.shields.io/badge/GitHub-repository-blue?style=flat&logo=github&logoColor=white)](https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny)
[![Shiny](https://img.shields.io/badge/Built_with-R_Shiny-blue?style=flat&logo=r&logoColor=white)](https://shiny.rstudio.com/)
[![Machine Learning](https://img.shields.io/badge/Machine_Learning-Powered-green?style=flat&logo=python&logoColor=white)](https://scikit-learn.org/)











---

## πŸ”— Links
- πŸ“ˆ **Application** : [Visual Analytics for ML](https://smd-lab-tech.shinyapps.io/Shiny_Dataviz/)
- πŸ“• **Report** : [Case Study Report](./_docs/rprt_ana_donnee_avancees_22-1.pdf)

---

## πŸ“Œ Summary
Development of a predictive model for the **"display"** variable using Machine Learning techniques by transforming all continuous variables into categorical for modeling.

### 1️⃣ Data Presentation
πŸ“Œ **Descriptive analysis** of qualitative and quantitative variables, and their transformation for analysis.

### 2️⃣ Multiple Component Analysis (MCA)
πŸ“‰ Use of **MCA** to reduce data dimensionality, identify principal components, and interpret results.

### 3️⃣ Modeling
- **Decision Tree**: Classification with specific parameters and a **confusion matrix** to assess performance.
- **Random Forest**: Application of **random forest**, parameter tuning, and classification results.
- **Logistic Regression**: Prediction using logistic regression, including **error rates** and accuracy metrics.

### 4️⃣ Model Comparison
πŸ“Š Comparative analysis of three machine learning models: **Decision Tree, Random Forest, and Logistic Regression**.

### 5️⃣ Model Performance (Best Model Analysis)
πŸ“ Evaluation of model performance based on **precision** and **sensitivity**.

---

πŸš€ **Let's make data-driven decisions!**

---
> [@smdlabtech](https://github.com/smdlabtech)