Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with corpus-processing
A curated list of projects in awesome lists tagged with corpus-processing .
https://github.com/hankcs/treebankpreprocessing
Python scripts preprocessing Penn Treebank and Chinese Treebank
corpus-processing natural-language-processing
Last synced: 27 Oct 2024
https://github.com/jaytimm/corpuslingr
A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.
corpus-processing corpus-search corpus-tools
Last synced: 22 Nov 2024
https://github.com/cscfi/kielipankki-utilities
Scripts for data conversion
corpus-processing corpus-tools korp vrt
Last synced: 10 Nov 2024
https://github.com/clariah/wp6-missieven
General Missives in Text-Fabric
corpus-data corpus-linguistics corpus-processing corpus-tools dutch history nlp
Last synced: 12 Dec 2024
https://github.com/frankier/stiff
Sense Tagged Instances For Finnish
corpus-processing linguistic-corpora nlp word-sense-disambiguation wsd
Last synced: 08 Nov 2024
https://github.com/jamnicki/split-corpus
Split-corpus package that provide dividing text corpora into the meaningful parts as close to specified size as possible.
corpora corpus-processing large-files natural-language-processing nlp processing
Last synced: 21 Dec 2024
https://github.com/rodrigofrancisco/pln
Tareas de Procesamiento del lenguaje natural
Last synced: 30 Nov 2024
https://github.com/ketanmehra003/parallel-corpus-management-tool
This project is designed to help manage and analyze large corpora of text data. It provides tools for importing, processing, and querying text data efficiently.
corpus corpus-data corpus-processing corpus-tools django language-translator-api machine-learning python3
Last synced: 11 Dec 2024
https://github.com/uudigitalhumanitieslab/ianalyzer-readers
Pre-processing functionality used in I-analyzer
Last synced: 30 Nov 2024
https://github.com/mosesab/language-text-extraction-
Gets text and extracts sentences in a language from text using that language's lexicon.
corpus corpus-processing corpus-search english language-processing language-resources languages natural-language-processing nlp python-programming python-standard-library python3 text-processing
Last synced: 12 Nov 2024