https://github.com/dirty-cat/dirty_cat
Machine learning on dirty tabular data (legacy clone of skrub)
https://github.com/dirty-cat/dirty_cat
Last synced: 6 months ago
JSON representation
Machine learning on dirty tabular data (legacy clone of skrub)
- Host: GitHub
- URL: https://github.com/dirty-cat/dirty_cat
- Owner: dirty-cat
- License: bsd-3-clause
- Fork: true (skrub-data/skrub)
- Created: 2023-05-16T07:12:13.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-05-30T08:55:26.000Z (over 2 years ago)
- Last Synced: 2024-04-22T12:32:47.735Z (over 1 year ago)
- Language: Python
- Homepage: https://dirty-cat.github.io/
- Size: 2.26 MB
- Stars: 9
- Watchers: 0
- Forks: 1
- Open Issues: 1
Awesome Lists containing this project
- data-matching-software - dirty-cat
- awesome-python-data-science - dirty_cat - Machine learning on dirty tabular data (especially: string-based variables for classifcation and regression). <img height="20" src="img/sklearn_big.png" alt="sklearn"> (Feature Engineering / General)
- awesome-machine-learning - dirty_cat - facilitates machine-learning on dirty, non-curated categories. It provides transformers and encoders robust to morphological variants, such as typos. (Python / General-Purpose Machine Learning)
- awesome-machine-learning - dirty_cat - facilitates machine-learning on dirty, non-curated categories. It provides transformers and encoders robust to morphological variants, such as typos. (Python / General-Purpose Machine Learning)
- awesome-machine-learning - dirty_cat - facilitates machine-learning on dirty, non-curated categories. It provides transformers and encoders robust to morphological variants, such as typos. (Python / General-Purpose Machine Learning)