https://github.com/cuadernin/resumendataframe
Interfaz gráfica que te permite realizar un breve análisis de datos.
https://github.com/cuadernin/resumendataframe
gui pandas pyqt5 pyqt5-desktop-application python3 qt5 summary-statistics
Last synced: 2 months ago
JSON representation
Interfaz gráfica que te permite realizar un breve análisis de datos.
- Host: GitHub
- URL: https://github.com/cuadernin/resumendataframe
- Owner: Cuadernin
- Created: 2021-05-08T21:19:16.000Z (about 4 years ago)
- Default Branch: master
- Last Pushed: 2021-06-22T02:58:34.000Z (almost 4 years ago)
- Last Synced: 2024-05-17T16:59:20.012Z (about 1 year ago)
- Topics: gui, pandas, pyqt5, pyqt5-desktop-application, python3, qt5, summary-statistics
- Language: Python
- Homepage:
- Size: 161 KB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# GUI for Data Analysis
![]()
GUI designed in Python with PyQt5 that brings together different tools used in data analysis.
It only allows .csv files but you can modify the code to read other files.## Start 🚀
Main program is called
```
Metodos.py
```
## Use 📦
The GUI is divided into two parts.
* The first consists of a compilation of the graphs most used in exploratory analysis. In two of them, you must select a main variable.
* The second part is about calculations and some data anaysis. You can find the data distribution by two ways and using [sweetviz](https://pypi.org/project/sweetviz/) you can see a dashboard in html format with all the summary about the dataframe. This package is widely used in data analysis.## Methods 📌
In this case, three methods of machine learning are used:
* Random Forest -----> For more info about this method check [Random Forest](https://builtin.com/data-science/random-forest-algorithm)
* Ridge y Lasso -----> For more info about this method check *[Ridge](https://www.analyticsvidhya.com/blog/2016/01/ridge-lasso-regression-python-complete-tutorial/)
* kNN -----> For more info about this method check *[kNN](https://realpython.com/knn-python/)In * you can find, in addition to the explanation of the method, a Python code as an example.
## Notes 📖
Finally, there is a group box called options where you can define the percentage of data to train in the model as well as define the name of the excel or txt according to the chosen output.If you choose a machine learning model, the output will be an excel file with three sheets and an extra one if you want it (Descriptive statistics). With linear regression a txt file is displayed because [statsmodel](https://www.statsmodels.org/stable/index.html) is used.
The GUI is in **Spanish** and some codes have comments for your better understanding.
**If you choose more than one action, the application will probably freeze for a few seconds until all operations are finished. This is normal because after a large number of actions the QT application tends to freeze and needs to be optimized.**