Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/saksham-jain177/automated-data-analysis-and-visualization
About Automated Data Analysis and Visualization is a Streamlit web application designed for quick and insightful data analysis. Users can easily upload CSV files, perform automated preprocessing, and generate interactive visualizations such as histograms, scatter plots, and heatmaps.
https://github.com/saksham-jain177/automated-data-analysis-and-visualization
automated-reporting data-analysis data-preprocessing data-science data-visualization datasets exploratory-data-analysis interactive-visualizations machine-learning python streamlit
Last synced: 17 days ago
JSON representation
About Automated Data Analysis and Visualization is a Streamlit web application designed for quick and insightful data analysis. Users can easily upload CSV files, perform automated preprocessing, and generate interactive visualizations such as histograms, scatter plots, and heatmaps.
- Host: GitHub
- URL: https://github.com/saksham-jain177/automated-data-analysis-and-visualization
- Owner: saksham-jain177
- License: mit
- Created: 2024-09-04T13:10:03.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2024-09-04T13:39:54.000Z (5 months ago)
- Last Synced: 2024-11-14T09:20:42.084Z (3 months ago)
- Topics: automated-reporting, data-analysis, data-preprocessing, data-science, data-visualization, datasets, exploratory-data-analysis, interactive-visualizations, machine-learning, python, streamlit
- Language: Python
- Homepage: https://automated-data-analysis-visualization.streamlit.app/
- Size: 6.84 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Automated Data Analysis and Visualization
## Overview
This Streamlit application provides an automated solution for data analysis and visualization. It allows users to upload CSV files, perform basic data preprocessing, generate visualizations, and even train simple machine learning models.
## Features
- Data upload via CSV file
- Basic data information display
- Handling of missing values
- Data normalization and standardization
- Feature engineering (polynomial features)
- Data visualization:
- Histograms
- Correlation heatmaps
- Scatter plots
- Feature importance calculation
- Machine Learning model training and evaluation
- Automated report generation## Installation
1. Clone this repository:
```
git clone https://github.com/saksham-jain177/Automated-Data-Analysis-and-Visualization.git
```
2. Navigate to the project directory:
```
cd Automated-Data-Analysis-and-Visualization
```
3. Install the required packages:
```
pip install -r requirements.txt
```## Usage
Run the Streamlit app:
``` streamlit run auto.py ```Then, follow these steps:
1. Upload your CSV file
2. Choose options for handling missing values and data normalization
3. Explore the generated visualizations
4. Optionally, train and evaluate machine learning models
5. Download the automated report## Dependencies
- streamlit
- pandas
- matplotlib
- seaborn
- plotly
- scikit-learnFor a complete list of dependencies, see `requirements.txt`.
## Contributing
Contributions to this project are welcome! Feel free to submit pull requests or open issues to suggest improvements or report bugs.
## License
This project is open source and available under the [MIT License](LICENSE).
## Acknowledgements
This project was created as a tool for automated data analysis and visualization. It's designed to be a starting point for data exploration and can be extended with additional features and capabilities.
---
Any changes and improvements are welcome! If you have ideas to enhance this tool or find any issues, please don't hesitate to contribute or reach out.