https://github.com/jonathanraiman/ciseau
:rocket: Tokenize and clean strings in Python
https://github.com/jonathanraiman/ciseau
natural-language-processing python text text-analysis tokenizer xml
Last synced: 12 months ago
JSON representation
:rocket: Tokenize and clean strings in Python
- Host: GitHub
- URL: https://github.com/jonathanraiman/ciseau
- Owner: JonathanRaiman
- License: mit
- Created: 2014-09-29T01:17:11.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2018-01-11T07:27:46.000Z (over 8 years ago)
- Last Synced: 2024-11-12T20:49:28.828Z (over 1 year ago)
- Topics: natural-language-processing, python, text, text-analysis, tokenizer, xml
- Language: Python
- Homepage:
- Size: 39.1 KB
- Stars: 13
- Watchers: 3
- Forks: 4
- Open Issues: 1