Projects in Awesome Lists tagged with linguistic-analysis
A curated list of projects in awesome lists tagged with linguistic-analysis .
https://github.com/dmitryryumin/interspeech-2023-24-papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission
Last synced: 20 Feb 2025
https://github.com/brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
discourse feature-extraction flesch-kincaid lexical-analysis linguistic-analysis natural-language-processing nlp readability-metrics readability-scores semantic-analysis spacy syntactic-analysis text-classification text-simplification
Last synced: 12 Apr 2025
https://github.com/jtanwk/nytcrossword
An exploration of New York Times crossword answers from 1994-2017, i.e. the Will Shortz era.
crosswords dataviz linguistic-analysis nytimes nytimes-crossword rvest webscraping
Last synced: 20 Nov 2024
https://github.com/thu-keg/chatlog
⏳ ChatLog: Recording and Analysing ChatGPT Across Time
chatgpt detection evaluation feature-extraction knowledge linguistic-analysis time-series-analysis
Last synced: 21 Apr 2025
https://github.com/lsys/lexicalrichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
data-mining data-science information-retrieval lexical-analysis lexical-analyzer linguistic-analysis natural-language natural-language-processing nlp python
Last synced: 09 Apr 2025
https://github.com/sillsdev/FieldWorks
FieldWorks is a suite of software tools for language and cultural data, with support for complex scripts.
discourse-analysis lexicography linguistic-analysis
Last synced: 15 Nov 2024
https://github.com/arjo129/langcluster
A visuallization for cognates in various languages and how they spread
artificial-intelligence azure-functions clustering d3-visualization linguistic-analysis linguistics
Last synced: 23 Apr 2025
https://github.com/deeptiman/php-dom-parser-translation-tool
A Simple DOM Parser and Translation Tool using PHP, HTML, and MySQL. The translation model is supported for English to Odia language. There is a built in dictionary to support the translation.
apache corpus corpus-tool dom-parser linguist linguistic-analysis linguistic-corpora moses-machine-translation mysql odia-language parallel-corpus parser-generator parser-library php phpmyadmin statistical-machine-translation tomcat-server translation-service translation-tool
Last synced: 01 Jan 2025
https://github.com/sadielbartholomew/cf-standard-names-linguistics
Lexical & semantic analysis of the CF Conventions Standard Names
cf-conventions grammar lexical-analysis linguistic-analysis metadata parsing semantic-analysis
Last synced: 11 Apr 2025
https://github.com/dylan-profiler/tangled-up-in-unicode
Access to the Unicode Character Database (UCD)
data-analysis data-quality exploration linguistic-analysis linguistics python unicode
Last synced: 15 Apr 2025
https://github.com/alschmut/code2semantics
Parse software-code for semantic identifier names
antlr4 identifier-splitting linguistic-analysis parser python semantic-parser word2vec
Last synced: 18 Feb 2025
https://github.com/nikisetti01/hadoop-mapreduce-letterfrequency-analysis
Simple example of Hadoop Application count letter, with an intersting Romance Language Analysis
hadoop-mapreduce java linguistic-analysis python3
Last synced: 04 Mar 2025
https://github.com/phughesmcr/liwcjs-dictionary
Parse and manipulate multiple LIWC dictionary files.
linguistic-analysis linguistics liwc liwc-dictionaries word-count wordcount
Last synced: 30 Mar 2025
https://github.com/rec0de/tinytawc
A simple script for text analysis using LIWC-compatible dictionaries
linguistic-analysis linguistics liwc
Last synced: 11 Apr 2025
https://github.com/alichtman/text-language-identifier
Accurately identify written English, French or Italian text with up to 99% accuracy.
bigram-model language-identification language-model linguistic-analysis n-grams text-classification-python text-processing
Last synced: 07 Apr 2025
https://github.com/alaaalzahrani/jiwar
Jiwar: A calculator for orthographic, phonological and phonographic neighborhood measures. Supports 40+ languages.
linguistic-analysis linguistics linguistics-field
Last synced: 30 Mar 2025
https://github.com/acdh-oeaw/mara_nlp_suite
Unified NLP research platform for the project MARA: MEDIA REPORTING ON ALGORITHMS, ROBOTICS AND ARTIFICIAL INTELLIGENCE
data-science linguistic-analysis newspaper newspaper-texts nlp nlp-models nlp-training
Last synced: 16 Mar 2025
https://github.com/kingsdigitallab/dral-django
Distant Reading across Languages
digital-humanities distant-reading linguistic-analysis translation visualization web-api
Last synced: 23 Mar 2025
https://github.com/kivanc57/rquests
The RQuest project uses R to analyze textual data, focusing on tasks like calculating word lengths, comparing languages, and extracting linguistic features with udpipe. It includes statistical methods, visualizations, and stochastic simulations, showcasing diverse approaches to text modeling.
data-science entropy language-comparison linguistic-analysis r statistical-testing stochastic-simulation t-test
Last synced: 10 Mar 2025
https://github.com/macbre/faroese-corpus
Some Faroese language statistics taken from fo.wikipedia.org content dump
corpus-linguistics faroe faroese faroese-language linguistic-analysis linguistics python3-script wikipedia-corpus wikipedia-dump
Last synced: 22 Feb 2025
https://github.com/crodriguez1a/kaggle-la-jobs
Helping the City of Los Angeles to structure and analyze its job descriptions
kaggle linguistic-analysis ml nlu python spacy
Last synced: 03 Apr 2025
https://github.com/ashithapallath/name-nationality-classifier-using-deeplearning
This project implements a deep learning-based classifier to identify whether a name is Indian or Non-Indian. By leveraging advanced neural networks to analyze name patterns, the classifier offers accurate predictions, with applications in demographic studies, personalized services, and more.
deep-learning linguistic-analysis
Last synced: 22 Mar 2025
https://github.com/lsys/lexicaldiversity-example
Hosting MyBinder example for the LexicalRichness package.
binder binder-jupyter-notebook binder-ready binderhub data-mining data-science information-retrieval lexical-analysis lexical-analyzer linguistic-analysis natural-language natural-language-processing nlp python
Last synced: 31 Mar 2025