An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with corpus-tools

A curated list of projects in awesome lists tagged with corpus-tools .

https://github.com/koskenni/beta

An open source reimplementation of Benny Brodda's BETA in Python

benny-brodda beta corpus-tools hyphenation linguistics open-source string-manipulation string-rewriting

Last synced: 01 May 2025

https://github.com/lennes/spect

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

analysis annotation conversational-speech corpus-linguistics corpus-tools praat spect speech speech-analysis speech-corpus spoken-language transcript transcription

Last synced: 03 Apr 2025

https://github.com/languagemachines/piccl

A set of workflows for corpus building through OCR, post-correction and normalisation

computational-linguistics corpus-linguistics corpus-tools folia nlp ocr workflow

Last synced: 04 Dec 2024

https://github.com/LanguageMachines/PICCL

A set of workflows for corpus building through OCR, post-correction and normalisation

computational-linguistics corpus-linguistics corpus-tools folia nlp ocr workflow

Last synced: 02 Apr 2025

https://github.com/jaytimm/corpuslingr

A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.

corpus-processing corpus-search corpus-tools

Last synced: 22 Nov 2024

https://github.com/liao961120/concordancer

Searching in-memory corpus with Corpus Query Language (CQL)

concordancer corpus-query-language corpus-tools python3

Last synced: 14 Apr 2025

https://github.com/cscfi/kielipankki-utilities

Scripts for data conversion

corpus-processing corpus-tools korp vrt

Last synced: 24 Apr 2025

https://github.com/rishit7/corpusqnatool

CorpusQnATool that uses a Corpus and chatGPT to for answers to input queries.

corpus-tools kmp-algorithm llm

Last synced: 28 Feb 2025

https://github.com/aitor-alvarez/emorabic

Tools for creating speech corpora by extracting audio from YouTube videos

audio corpus-tools speech speech-corpora speech-processing

Last synced: 20 Mar 2025

https://github.com/egorsmkv/asr-corpus-by-microphone

This is a simple solution for people who want to create own corpus for Automatic Speech Recognition with just a microphone

asr automatic-speech-recognition corpus corpus-tools

Last synced: 28 Mar 2025

https://github.com/unhammer/gt-corpustools

branches of https://victorio.uit.no/langtech/trunk/tools/CorpusTools used by Giellatekno.UiT.no for corpus gathering.

corpus-tools giellatekno

Last synced: 18 Feb 2025

https://github.com/ketanmehra003/parallel-corpus-management-tool

This project is designed to help manage and analyze large corpora of text data. It provides tools for importing, processing, and querying text data efficiently.

corpus corpus-data corpus-processing corpus-tools django language-translator-api machine-learning python3

Last synced: 30 Mar 2025