An open API service indexing awesome lists of open source software.

https://github.com/jamnicki/split-corpus

Split-corpus package that provide dividing text corpora into the meaningful parts as close to specified size as possible.
https://github.com/jamnicki/split-corpus

corpora corpus-processing large-files natural-language-processing nlp processing

Last synced: 4 months ago
JSON representation

Split-corpus package that provide dividing text corpora into the meaningful parts as close to specified size as possible.

Awesome Lists containing this project