{"id":13608111,"url":"https://github.com/data-describe/data-describe","last_synced_at":"2025-04-12T14:32:12.468Z","repository":{"id":38208328,"uuid":"261257841","full_name":"data-describe/data-describe","owner":"data-describe","description":"data⎰describe: Pythonic EDA Accelerator for Data Science ","archived":false,"fork":false,"pushed_at":"2023-02-22T05:20:46.000Z","size":131774,"stargazers_count":299,"open_issues_count":77,"forks_count":19,"subscribers_count":12,"default_branch":"master","last_synced_at":"2025-04-10T17:18:36.439Z","etag":null,"topics":["analysis","data-science","eda","exploratory-data-analysis","pypi"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/data-describe.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2020-05-04T17:58:14.000Z","updated_at":"2025-02-03T10:26:25.000Z","dependencies_parsed_at":"2023-02-16T16:16:21.244Z","dependency_job_id":"4d82ed92-e956-4a3e-8b48-7b8ce68d75f0","html_url":"https://github.com/data-describe/data-describe","commit_stats":{"total_commits":641,"total_committers":14,"mean_commits":"45.785714285714285","dds":0.594383775351014,"last_synced_commit":"eafc810ac20f6c0fb426732f29452756df96b278"},"previous_names":[],"tags_count":5,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/data-describe%2Fdata-describe","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/data-describe%2Fdata-describe/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/data-describe%2Fdata-describe/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/data-describe%2Fdata-describe/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/data-describe","download_url":"https://codeload.github.com/data-describe/data-describe/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248581273,"owners_count":21128139,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["analysis","data-science","eda","exploratory-data-analysis","pypi"],"created_at":"2024-08-01T19:01:24.345Z","updated_at":"2025-04-12T14:32:07.459Z","avatar_url":"https://github.com/data-describe.png","language":"Python","funding_links":[],"categories":["Python","数据可视化"],"sub_categories":[],"readme":"[![PyPI status](https://img.shields.io/pypi/status/data-describe.svg)](https://pypi.python.org/pypi/data-describe/)\n[![PyPI license](https://img.shields.io/pypi/l/data-describe.svg)](https://pypi.python.org/pypi/data-describe/)\n[![Downloads](https://pepy.tech/badge/data-describe/month)](https://pepy.tech/project/data-describe/month)\n\n[![PyPI version shields.io](https://img.shields.io/pypi/v/data-describe.svg)](https://pypi.python.org/pypi/data-describe/)\n[![PyPI pyversions](https://img.shields.io/pypi/pyversions/data-describe.svg)](https://pypi.python.org/pypi/data-describe/)\n[![codecov](https://codecov.io/gh/data-describe/data-describe/branch/master/graph/badge.svg?token=CY0M5NAMXH)](undefined)\n# data ⎰ describe\n\n[data-describe](https://data-describe.ai/) is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.\n\n\u003cimg src=\"/docs/imgs/demo.gif\" height=\"382\" width=\"610\" /\u003e\n\nFor more examples of data-describe in action, see the [Quick Start Tutorial](https://data-describe.ai/docs/master/_notebooks/quick_start.html).\n\n## Main Features\n\ndata-describe implements the following basic features:\n\n| Feature      | Description |\n| ----------- | ----------- |\n| Data Summary      | Curated data summary       |\n| Data Heatmap   | Data variation and missingness heatmap        |\n| Correlation Matrix   | Correlation heatmaps with categorical support        |\n| Distribution Plots   | Generate histograms, violin plots, bar charts        |\n| Scatterplots   | Generate scatterplots and evaluate with scatterplot diagnostics        |\n| Cluster Analysis   | Automated clustering and plotting        |\n| Feature Ranking   | Evaluate feature importance using tree models        |\n\n## Extended Features\n\ndata-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:\n\n* Dimensionality Reduction Methods\n* Sensitive Data (PII) Redaction\n* Text Pre-processing / Topic Modeling\n* Big Data Support\n\n\n## Installation\n\ndata-describe can be installed using pip:\n\n```\npip install data-describe\n```\n\n## Getting Started\n\n```python\nimport data_describe as dd\nhelp(dd)\n```\n\nSee the [User Guide](https://data-describe.ai/docs/master/_notebooks/user_guide.html) for more information.\n\n## Project Status\n\ndata-describe is currently in **beta** status.\n\n## Contributing\n\ndata-describe welcomes [contributions from the community](./CONTRIBUTING.md).\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdata-describe%2Fdata-describe","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdata-describe%2Fdata-describe","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdata-describe%2Fdata-describe/lists"}