https://github.com/jenishajustin/elevate_eda
Elevate EDA is an interactive data exploration tool designed to simplify Exploratory Data Analysis (EDA). Powered by Python and Streamlit, it offers visualizations, summary statistics, and automated reports to help users gain insights from their datasets efficiently.
https://github.com/jenishajustin/elevate_eda
plotly-express sklearn streamlit
Last synced: about 1 year ago
JSON representation
Elevate EDA is an interactive data exploration tool designed to simplify Exploratory Data Analysis (EDA). Powered by Python and Streamlit, it offers visualizations, summary statistics, and automated reports to help users gain insights from their datasets efficiently.
- Host: GitHub
- URL: https://github.com/jenishajustin/elevate_eda
- Owner: Jenishajustin
- License: mit
- Created: 2024-10-20T12:13:49.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-21T06:23:23.000Z (over 1 year ago)
- Last Synced: 2024-10-21T09:09:49.985Z (over 1 year ago)
- Topics: plotly-express, sklearn, streamlit
- Language: Python
- Homepage:
- Size: 2.26 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Elevate EDA
Elevate EDA is a comprehensive data exploration tool designed to streamline and enhance the process of performing **Exploratory Data Analysis (EDA)**. It provides users with an intuitive interface, powerful visualizations, and insights to better understand and make sense of their datasets.
## Features
- **Data Visualization:** Create a wide range of graphs and charts, including histograms, scatter plots, heatmaps, and more.
- **Summary Statistics:** Easily generate summary statistics for your datasets, including mean, median, variance, and standard deviation.
- **Correlation Analysis:** Visualize the relationships between different features with correlation matrices.
- **Interactive UI:** Built with Streamlit, offering an easy-to-use, interactive interface.
- **Automated Reports:** Export reports in a customizable format for presentations and decision-making.
## Technologies Used
- **Python**: The core programming language used for data processing.
- **Streamlit**: Framework for building the user interface.
- **Pandas**: Data manipulation and analysis library.
- **Matplotlib** & **Seaborn**: Visualization libraries for creating plots.
- **NumPy**: For numerical computing.
- **Scikit-learn**: For data preprocessing and basic machine learning tasks.
## Installation
To get started with Elevate EDA, follow these steps:
1. **Clone the repository:**
```bash
git clone https://github.com/Jenishajustin/elevate-eda.git
```
2. **Navigate to the project directory:**
```bash
cd Elevate_EDA
```
3. **Create a virtual environment (optional but recommended):**
```bash
python -m venv venv
source venv/bin/activate # For Linux/Mac
.\venv\Scripts\activate # For Windows
```
4. **Install the required dependencies:**
```bash
pip install -r requirements.txt
```
5. **Run the Streamlit application:**
```bash
streamlit run app.py
```
6. **Access the web app:**
Open a browser and go to `http://localhost:8501/`.
## Wanna Access Elevate EDA?
Have fun with my EDA tool 👉 - https://elevate-eda-360.streamlit.app/
## Usage
1. Upload your dataset (CSV format).
2. Explore summary statistics, data visualizations, and correlations.
3. Generate insights through visual tools provided in the app.
4. Optionally, download an automated EDA report summarizing the key findings.
## Example
Here's an example of how to use Elevate EDA on a sample dataset:
1. Upload your data using the file uploader.
2. View visualizations like correlation heatmaps, box plots, and pair plots.
3. Analyze outliers and generate descriptive statistics.
## Screenshots
### Main Dashboard


### Visualizations
#### Bar Chart

#### Scatter Plot

#### Density Contour

### Correlation Matrix

### K-Means Clustering

### Cluster Summary

### Data Profile

### Download Filtered Data

## Future Enhancements
- **Feature Selection and Engineering Tools:** Automated feature selection for more advanced EDA.
- **Data Preprocessing:** Clean and preprocess your data, including handling missing values, outliers,formatting and scaling features.
- **Machine Learning Integration:** Add basic machine learning model training to the tool.
- **Custom Reports:** Further customization options for automated reports.
- **More Visualization Options:** Advanced and custom charting options.
## Contribution
We welcome contributions! To contribute:
1. Fork the repository.
2. Create a new branch for your feature/bugfix.
3. Submit a pull request.
## License
This project is licensed under the MIT License - see the [LICENSE](https://github.com/Jenishajustin/Elevate_EDA/blob/main/LICENSE) file for details.
## Contact
If you have any questions or suggestions, feel free to reach out at:
- **Email**: jjenisha950@gmail.com
- **GitHub**: [Jenishajustin][(https://github.com/Jenishajustin)]