https://github.com/dataiku/dss-plugin-nlp-preparation
Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼
https://github.com/dataiku/dss-plugin-nlp-preparation
auto-correct dataiku dss-plugin language-detection language-identification natural-language-processing nlp spell-checker text-cleaning
Last synced: 7 days ago
JSON representation
Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼
- Host: GitHub
- URL: https://github.com/dataiku/dss-plugin-nlp-preparation
- Owner: dataiku
- License: apache-2.0
- Created: 2020-02-14T12:32:28.000Z (about 6 years ago)
- Default Branch: main
- Last Pushed: 2026-01-28T13:49:32.000Z (2 months ago)
- Last Synced: 2026-01-29T00:02:05.322Z (2 months ago)
- Topics: auto-correct, dataiku, dss-plugin, language-detection, language-identification, natural-language-processing, nlp, spell-checker, text-cleaning
- Language: Python
- Homepage: https://www.dataiku.com/product/plugins/nlp-preparation/
- Size: 18 MB
- Stars: 22
- Watchers: 20
- Forks: 6
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# Text Preparation Plugin
 

This Dataiku DSS plugin provides recipes to detect languages, correct misspellings and clean text data.
Documentation: https://www.dataiku.com/product/plugins/nlp-preparation/
## Release notes
See the [changelog](CHANGELOG.md) for a history of notable changes to this plugin.
## License
This plugin is distributed under the [Apache License version 2.0](LICENSE).