Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/anze3db/corpus_cleaner


https://github.com/anze3db/corpus_cleaner

Last synced: 7 days ago
JSON representation

Awesome Lists containing this project

README

        

# Corpus Cleaner

Prepare text for tagging with TreeTagger.

```
# 1. set the path to tree-tagger and slovenian-utf8.par in __main__.py or copy them to the root directory
# 2. Install Poetry dependencies
# 3. Run `poetry run python cleaner`
```