Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/data-describe/data-describe
data⎰describe: Pythonic EDA Accelerator for Data Science
https://github.com/data-describe/data-describe
analysis data-science eda exploratory-data-analysis pypi
Last synced: 3 months ago
JSON representation
data⎰describe: Pythonic EDA Accelerator for Data Science
- Host: GitHub
- URL: https://github.com/data-describe/data-describe
- Owner: data-describe
- License: other
- Created: 2020-05-04T17:58:14.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-02-22T05:20:46.000Z (almost 2 years ago)
- Last Synced: 2024-10-02T22:45:17.654Z (5 months ago)
- Topics: analysis, data-science, eda, exploratory-data-analysis, pypi
- Language: Python
- Homepage:
- Size: 126 MB
- Stars: 295
- Watchers: 13
- Forks: 18
- Open Issues: 77
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- awesome-python-machine-learning-resources - GitHub - 28% open · ⏱️ 19.11.2021): (数据可视化)
README
[data:image/s3,"s3://crabby-images/916e7/916e7d0a47a73f30e2c2f7541d27e55bafa6f3e2" alt="PyPI status"](https://pypi.python.org/pypi/data-describe/)
[data:image/s3,"s3://crabby-images/0f8d8/0f8d80becee55d79bfd6f23de3c18f5c88e0864f" alt="PyPI license"](https://pypi.python.org/pypi/data-describe/)
[data:image/s3,"s3://crabby-images/fe16e/fe16e6b3dd6dd6a71a5e1b97754aa08ac683934d" alt="Downloads"](https://pepy.tech/project/data-describe/month)[data:image/s3,"s3://crabby-images/b976e/b976ea3d44f447397ae6bfc8c594a4447de124e4" alt="PyPI version shields.io"](https://pypi.python.org/pypi/data-describe/)
[data:image/s3,"s3://crabby-images/76cdb/76cdb297bc7831aab0152bf76098894ee0725dce" alt="PyPI pyversions"](https://pypi.python.org/pypi/data-describe/)
[data:image/s3,"s3://crabby-images/c9b96/c9b964a17be9cab53faa13657e6697d484e6f72f" alt="codecov"](undefined)
# data ⎰ describe[data-describe](https://data-describe.ai/) is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.
For more examples of data-describe in action, see the [Quick Start Tutorial](https://data-describe.ai/docs/master/_notebooks/quick_start.html).
## Main Features
data-describe implements the following basic features:
| Feature | Description |
| ----------- | ----------- |
| Data Summary | Curated data summary |
| Data Heatmap | Data variation and missingness heatmap |
| Correlation Matrix | Correlation heatmaps with categorical support |
| Distribution Plots | Generate histograms, violin plots, bar charts |
| Scatterplots | Generate scatterplots and evaluate with scatterplot diagnostics |
| Cluster Analysis | Automated clustering and plotting |
| Feature Ranking | Evaluate feature importance using tree models |## Extended Features
data-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:
* Dimensionality Reduction Methods
* Sensitive Data (PII) Redaction
* Text Pre-processing / Topic Modeling
* Big Data Support## Installation
data-describe can be installed using pip:
```
pip install data-describe
```## Getting Started
```python
import data_describe as dd
help(dd)
```See the [User Guide](https://data-describe.ai/docs/master/_notebooks/user_guide.html) for more information.
## Project Status
data-describe is currently in **beta** status.
## Contributing
data-describe welcomes [contributions from the community](./CONTRIBUTING.md).