https://github.com/skrub-data/skrub
Machine learning with dataframes
https://github.com/skrub-data/skrub
data data-analysis data-cleaning data-preparation data-preprocessing data-science data-wrangling dataframe dataframes dirty-data machine-learning
Last synced: 18 days ago
JSON representation
Machine learning with dataframes
- Host: GitHub
- URL: https://github.com/skrub-data/skrub
- Owner: skrub-data
- License: bsd-3-clause
- Created: 2018-03-12T23:32:22.000Z (almost 8 years ago)
- Default Branch: main
- Last Pushed: 2025-05-06T15:31:20.000Z (9 months ago)
- Last Synced: 2025-05-06T16:50:08.415Z (9 months ago)
- Topics: data, data-analysis, data-cleaning, data-preparation, data-preprocessing, data-science, data-wrangling, dataframe, dataframes, dirty-data, machine-learning
- Language: Python
- Homepage: https://skrub-data.org/
- Size: 12.4 MB
- Stars: 1,380
- Watchers: 20
- Forks: 128
- Open Issues: 100
-
Metadata Files:
- Readme: README.rst
- Changelog: CHANGES.rst
- Contributing: CONTRIBUTING.rst
- License: LICENSE.txt
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- fucking-awesome-machine-learning - skrub - Skrub is a Python library that eases preprocessing and feature engineering for machine learning on dataframes. (Python / General-Purpose Machine Learning)
- awesome-machine-learning - skrub - Skrub is a Python library that eases preprocessing and feature engineering for machine learning on dataframes. (Python / General-Purpose Machine Learning)
README
skrub
=====
.. image:: https://skrub-data.github.io/stable/_static/skrub.svg
:align: center
:width: 50 %
:alt: skrub logo
|py_ver| |pypi_var| |pypi_dl| |codecov| |circleci| |black|
.. |py_ver| image:: https://img.shields.io/pypi/pyversions/skrub
.. |pypi_var| image:: https://img.shields.io/pypi/v/skrub?color=informational
.. |pypi_dl| image:: https://img.shields.io/pypi/dm/skrub
.. |codecov| image:: https://img.shields.io/codecov/c/github/skrub-data/skrub/main
.. |circleci| image:: https://img.shields.io/circleci/build/github/skrub-data/skrub/main?label=CircleCI
.. |black| image:: https://img.shields.io/badge/code%20style-black-000000.svg
**skrub** (formerly *dirty_cat*) is a Python
library that facilitates doing machine learning with dataframes.
If you like the package, spread the word and ⭐ this repository!
You can also join the `discord server `_.
Website: https://skrub-data.org/
See our `examples `_, or check out
the `learning materials `_.
Installation
------------
skrub can easily be installed via ``pip`` or ``conda``. For more installation information, see
the `installation instructions `_.
Contributing
------------
The best way to support the development of skrub is to spread the word!
Also, if you already are a skrub user, we would love to hear about your use cases and challenges in the `Discussions `_ section.
To report a bug or suggest enhancements, please
`open an issue `_.
If you want to contribute directly to the library, then check the
`how to contribute `_ page on
the website for more information.