https://github.com/gallillio/data_science-data_visualizer_tool
## About Supervised ML Helper is a Python application that streamlines exploratory data analysis (EDA) and preprocessing for supervised machine learning. Featuring a user-friendly Tkinter interface, it enables users to load CSV files, visualize data, and perform essential transformations, making data preparation accessible for all skill levels.
https://github.com/gallillio/data_science-data_visualizer_tool
data-analysis data-science data-visualization matplotlib numpy pandas seaborn sklearn
Last synced: 7 months ago
JSON representation
## About Supervised ML Helper is a Python application that streamlines exploratory data analysis (EDA) and preprocessing for supervised machine learning. Featuring a user-friendly Tkinter interface, it enables users to load CSV files, visualize data, and perform essential transformations, making data preparation accessible for all skill levels.
- Host: GitHub
- URL: https://github.com/gallillio/data_science-data_visualizer_tool
- Owner: Gallillio
- Created: 2023-09-12T11:53:10.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-03-21T19:58:47.000Z (10 months ago)
- Last Synced: 2025-04-14T10:21:45.829Z (10 months ago)
- Topics: data-analysis, data-science, data-visualization, matplotlib, numpy, pandas, seaborn, sklearn
- Language: Jupyter Notebook
- Homepage:
- Size: 12.7 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Visualizer Tool
## Overview
Data Visualizer Tool is a Python application designed to assist users in performing exploratory data analysis (EDA). The application provides a user-friendly interface built with Tkinter, allowing users to easily load datasets, visualize data, and apply various transformations.
This project was created during my early years at university, and while it serves its purpose, Since Tkinter is considered an outdated tool for building GUIs, I plan to redo it using more modern frameworks like Flask, Django, or FastAPI as well as include Machine Learning capabilities. (If I have the time that is).
## Features
- **Load CSV Files**: Users can select and load CSV files into the application for analysis.
- **Exploratory Data Analysis (EDA)**: The application provides detailed EDA capabilities, including:
- Summary statistics of the dataset.
- Visualization of missing values.
- Detailed analysis of individual columns, including histograms and common values.

- **Data Transformation**: Users can perform various data transformations, including:
- Handling missing values (mean, median, or removal).
- Encoding categorical columns using label encoding.
- Renaming and removing columns.
- Removing duplicates from the dataset.
- **Data Visualization**: The application includes several visualization options:
- Histograms for numerical data.
- Stacked bar charts for categorical data.
- Scatter plots to visualize interactions between two columns.

- **Correlation Analysis**: Users can visualize correlations between different columns in the dataset using heatmaps.
- **Interaction Analysis**: Users can explore interactions between different features in the dataset through scatter plots.
- **User-Friendly Interface**: The application is designed with a simple and intuitive interface, making it accessible for users with varying levels of expertise in data science.
## Getting Started
### Prerequisites
- Python 3.x
- Required libraries:
- NumPy
- Pandas
- Matplotlib
- Seaborn
- Tkinter
- scikit-learn
- Pillow
### Installation
1. Clone the repository:
```bash
git clone https://github.com/yourusername/Supervised_ML_Helper.git
```
2. Navigate to the project directory:
```bash
cd Supervised_ML_Helper
```
3. Install the required libraries:
```bash
pip install -r requirements.txt
```
### Usage
1. You can run the application using the provided Python file for the GUI:
```bash
python Supervised_ML_Classifier_python.py
```
2. Alternatively, there is a Jupyter Notebook available for additional analysis and exploration of the dataset.
3. Follow the on-screen instructions to load your dataset and explore the various features of the application.
## Screenshots
Here are some screenshots of the application in action:
- **Histogram EDA**:

- **Detailed EDA**:

- **Correlation Heatmap**:
