{"id":23116085,"url":"https://github.com/danielpuentee/outdpik","last_synced_at":"2025-08-16T21:32:23.059Z","repository":{"id":61769901,"uuid":"554235431","full_name":"DanielPuentee/outdpik","owner":"DanielPuentee","description":"The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.","archived":false,"fork":false,"pushed_at":"2022-10-27T21:05:04.000Z","size":665,"stargazers_count":3,"open_issues_count":0,"forks_count":3,"subscribers_count":2,"default_branch":"main","last_synced_at":"2024-12-15T12:45:16.463Z","etag":null,"topics":["data-analysis","matplotlib","numpy","python"],"latest_commit_sha":null,"homepage":"https://pypi.org/project/outdpik/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DanielPuentee.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"license.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-10-19T13:26:55.000Z","updated_at":"2023-01-05T15:34:16.000Z","dependencies_parsed_at":"2022-10-20T22:15:17.298Z","dependency_job_id":null,"html_url":"https://github.com/DanielPuentee/outdpik","commit_stats":null,"previous_names":[],"tags_count":6,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DanielPuentee%2Foutdpik","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DanielPuentee%2Foutdpik/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DanielPuentee%2Foutdpik/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DanielPuentee%2Foutdpik/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DanielPuentee","download_url":"https://codeload.github.com/DanielPuentee/outdpik/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":230061413,"owners_count":18166703,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","matplotlib","numpy","python"],"created_at":"2024-12-17T04:10:59.101Z","updated_at":"2024-12-17T04:10:59.995Z","avatar_url":"https://github.com/DanielPuentee.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\n\u003ch1 align=\"center\"\u003e\n\u003cimg src=\"https://github.com/DanielPuentee/outdpik/blob/main/branding/logo/primary/outdpik.png?raw=true\" width=\"300\"\u003e\n\u003c/h1\u003e\u003cbr\u003e\n\n# outdpik: Fundamental toolkit for outlier analysis and visualization\n\n[![PyPI Latest Release](https://img.shields.io/pypi/v/outdpik.svg)](https://pypi.org/project/outdpik/)\n[![PyPI License](https://img.shields.io/pypi/l/outdpik.svg)](license.txt)\n[![Package Status](https://img.shields.io/pypi/status/pandas.svg)](https://pypi.org/project/outdpik/)\n[![Documentation Status](https://readthedocs.org/projects/outdpik/badge/?version=latest)](https://outdpik.readthedocs.io/en/latest/)\n[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)\n\n## What is it?\nOutdpik is an open source Python package that provides different methods for outlier detection. \nIt aims to be the fundamental high-level package for this purpose. \nAdditionally, it offers visualization methods for the outlier analysis.\n\n## Main Features\nHere are just a few of the things that outdpik does well:\n\n- It supports numpy arrays and pandas dataframes\n- Multiple outlier detection techniques that can be combined\n- Powerful visualizations\n- Flexible at including one or more columns for the analysis\n\n## Where to get it\nThe source code is currently hosted on GitHub at:\nhttps://github.com/DanielPuentee/outdpik\n\nInstaller for the latest released version is available at the [Python\nPackage Index (PyPI)](https://pypi.org/project/outdpik)\n\n```sh\n# PyPI\npip install outdpik\n```\n\n## How to use outdpik\nExamples of configuring and running outpdik:\n\n```python\nimport outpdik as outdp\noutdp = outdp()\n```\n\nWe proceed to detect outliers returning a dictionary of numeric features and the outliers instances:\n\n```python\noutliers_dict = outdp.outliers(df = df, cols = \"all\")\n```\nPlotting advantages:\n\n```python\noutdp.plot_outliers(df = df, col = \"x\")\n```\n\u003ch1 align=\"center\"\u003e\n\u003cimg src=\"https://github.com/DanielPuentee/outdpik/blob/main/branding/logo/primary/graph.png?raw=true\" width=450 alt=\"Strip plot outliers detection\"\u003e\n\u003c/h1\u003e\u003cbr\u003e\n\n## Dependencies\n- [pandas - Provides fast, flexible, and expressive data structures designed to make working with \"relational\" or \"labeled\" data both easy and intuitive](https://pandas.pydata.org/)\n- [NumPy - Adds support for large, multi-dimensional arrays, matrices and high-level mathematical functions to operate on these arrays](https://www.numpy.org)\n- [SciPy - Includes modules for statistics, optimization, integration, linear algebra, Fourier transforms, signal and image processing, ODE solvers, and more](https://scipy.org/)\n- [matplotlib - Comprehensive library for creating static, animated, and interactive visualizations in Python](https://matplotlib.org/)\n- [seaborn - Provides a high-level interface for drawing attractive statistical graphics](https://seaborn.pydata.org/)\n\n## License\nThis project is licensed under the terms of the [GNU](https://github.com/DanielPuentee/outdpik/blob/main/license.txt) - see the LICENSE file for details.\n\n## Documentation\nThe official documentation is hosted on: https://outdpik.readthedocs.io/en/latest/\n\n## Development\nWant to contribute? Great!\nOpen a discussion in Github in this repo and we will answer as soon as possible.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdanielpuentee%2Foutdpik","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdanielpuentee%2Foutdpik","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdanielpuentee%2Foutdpik/lists"}