An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with linguistic-analysis

A curated list of projects in awesome lists tagged with linguistic-analysis .

https://github.com/dmitryryumin/interspeech-2023-24-papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission

Last synced: 20 Feb 2025

https://github.com/jtanwk/nytcrossword

An exploration of New York Times crossword answers from 1994-2017, i.e. the Will Shortz era.

crosswords dataviz linguistic-analysis nytimes nytimes-crossword rvest webscraping

Last synced: 20 Nov 2024

https://github.com/thu-keg/chatlog

⏳ ChatLog: Recording and Analysing ChatGPT Across Time

chatgpt detection evaluation feature-extraction knowledge linguistic-analysis time-series-analysis

Last synced: 21 Apr 2025

https://github.com/lsys/lexicalrichness

:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).

data-mining data-science information-retrieval lexical-analysis lexical-analyzer linguistic-analysis natural-language natural-language-processing nlp python

Last synced: 09 Apr 2025

https://github.com/sillsdev/FieldWorks

FieldWorks is a suite of software tools for language and cultural data, with support for complex scripts.

discourse-analysis lexicography linguistic-analysis

Last synced: 15 Nov 2024

https://github.com/arjo129/langcluster

A visuallization for cognates in various languages and how they spread

artificial-intelligence azure-functions clustering d3-visualization linguistic-analysis linguistics

Last synced: 23 Apr 2025

https://github.com/deeptiman/php-dom-parser-translation-tool

A Simple DOM Parser and Translation Tool using PHP, HTML, and MySQL. The translation model is supported for English to Odia language. There is a built in dictionary to support the translation.

apache corpus corpus-tool dom-parser linguist linguistic-analysis linguistic-corpora moses-machine-translation mysql odia-language parallel-corpus parser-generator parser-library php phpmyadmin statistical-machine-translation tomcat-server translation-service translation-tool

Last synced: 01 Jan 2025

https://github.com/alschmut/code2semantics

Parse software-code for semantic identifier names

antlr4 identifier-splitting linguistic-analysis parser python semantic-parser word2vec

Last synced: 18 Feb 2025

https://github.com/nikisetti01/hadoop-mapreduce-letterfrequency-analysis

Simple example of Hadoop Application count letter, with an intersting Romance Language Analysis

hadoop-mapreduce java linguistic-analysis python3

Last synced: 04 Mar 2025

https://github.com/phughesmcr/liwcjs-dictionary

Parse and manipulate multiple LIWC dictionary files.

linguistic-analysis linguistics liwc liwc-dictionaries word-count wordcount

Last synced: 30 Mar 2025

https://github.com/rec0de/tinytawc

A simple script for text analysis using LIWC-compatible dictionaries

linguistic-analysis linguistics liwc

Last synced: 11 Apr 2025

https://github.com/alichtman/text-language-identifier

Accurately identify written English, French or Italian text with up to 99% accuracy.

bigram-model language-identification language-model linguistic-analysis n-grams text-classification-python text-processing

Last synced: 07 Apr 2025

https://github.com/alaaalzahrani/jiwar

Jiwar: A calculator for orthographic, phonological and phonographic neighborhood measures. Supports 40+ languages.

linguistic-analysis linguistics linguistics-field

Last synced: 30 Mar 2025

https://github.com/acdh-oeaw/mara_nlp_suite

Unified NLP research platform for the project MARA: MEDIA REPORTING ON ALGORITHMS, ROBOTICS AND ARTIFICIAL INTELLIGENCE

data-science linguistic-analysis newspaper newspaper-texts nlp nlp-models nlp-training

Last synced: 16 Mar 2025

https://github.com/kivanc57/rquests

The RQuest project uses R to analyze textual data, focusing on tasks like calculating word lengths, comparing languages, and extracting linguistic features with udpipe. It includes statistical methods, visualizations, and stochastic simulations, showcasing diverse approaches to text modeling.

data-science entropy language-comparison linguistic-analysis r statistical-testing stochastic-simulation t-test

Last synced: 10 Mar 2025

https://github.com/languagemachines/foliatest

Test suite for libfolia

cpp folia linguistic-analysis

Last synced: 31 Jan 2025

https://github.com/macbre/faroese-corpus

Some Faroese language statistics taken from fo.wikipedia.org content dump

corpus-linguistics faroe faroese faroese-language linguistic-analysis linguistics python3-script wikipedia-corpus wikipedia-dump

Last synced: 22 Feb 2025

https://github.com/crodriguez1a/kaggle-la-jobs

Helping the City of Los Angeles to structure and analyze its job descriptions

kaggle linguistic-analysis ml nlu python spacy

Last synced: 03 Apr 2025

https://github.com/ashithapallath/name-nationality-classifier-using-deeplearning

This project implements a deep learning-based classifier to identify whether a name is Indian or Non-Indian. By leveraging advanced neural networks to analyze name patterns, the classifier offers accurate predictions, with applications in demographic studies, personalized services, and more.

deep-learning linguistic-analysis

Last synced: 22 Mar 2025