An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with language-processing

A curated list of projects in awesome lists tagged with language-processing .

https://github.com/pemistahl/lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

language-classification language-detection language-identification language-processing language-recognition natural-language-processing nlp nlp-machine-learning rust rust-crate rust-library

Last synced: 13 May 2025

https://github.com/MarginaliaSearch/MarginaliaSearch

Internet search engine for text-oriented websites. Indexing the small, old and weird web.

alt-search indexer internet-search language-processing no-ai-used no-cloud search-engine small-web web-crawler

Last synced: 05 Apr 2025

https://github.com/knadh/dictpress

A stand-alone web server application for building and publishing full fledged dictionary websites and APIs for any language.

academic academic-website dictionary dictionary-application language language-processing publishing thesaurus wordlist

Last synced: 16 Dec 2025

https://github.com/timkam/schreib-gut

German extension for write-good

cli language-processing linting node

Last synced: 11 Aug 2025

https://github.com/mapado/pynlg

``pynlg`` is a pure python re-implementation of [SimpleNLG-EnFr](https://github.com/rali-udem/SimpleNLG-EnFr), a java library enabling bilingual [text surface realisation](https://en.wikipedia.org/wiki/Realization_%28linguistics%29), based on [SimpleNLG](https://github.com/simplenlg/simplenlg).

language-processing python

Last synced: 30 Apr 2025

https://github.com/pigoz/lat

A set of tools to automate language acquisition through immersion. Includes sentence analysis (from books, subtitles) and Anki cards creation.

anki japanese kindle-highlights language-learning language-processing mpv sub2srs

Last synced: 13 Apr 2025

https://github.com/verifid/ner-d

Python module for Named Entity Recognition (NER) using natural language processing.

language-processing named-entity-recognition natural-language-processing ner recognition

Last synced: 13 Jul 2025

https://github.com/shamspias/google-meet-translator-extension

Google Meet Transcript Translator is a Chrome extension that translates live transcriptions during a Google Meet call into your chosen language. Enhance your global communication.

artificial-intelligence browser-extension chrome-extension communication google-meet-extension javascript language-processing translation

Last synced: 14 Oct 2025

https://github.com/melchisedech333/antlr4-experiments

:wrench: My studies on context-free grammar, using ANTLR4 (C++) to generate the parser files. Some basics are developed, such as token processing, recursion, variable definition, array processing, Abstract Syntax Tree (AST) manipulation, UNICODE support, and error handling.

antlr-language-development antlr4 antlr4-grammar grammar-checker grammar-parser grammar-parser-generator grammar-rules grammars grammars-utility language-development language-processing semantic-analysis semantics syntax syntax-analysis syntax-analyzer syntax-tree tokenization tokenizer tokenizer-parser

Last synced: 11 Apr 2025

https://github.com/buaadreamer/slpkiller

语音和自然语言处理学习/Learning Speech and Language Processing

ai language-processing nlp speech-processing

Last synced: 30 Aug 2025

https://github.com/chandler767/read-the-room

This demo processes conversations in real-time with the Amazon Comprehend natural language processing (NLP) service to gain insights about what was said.

amazon-comprehend amazon-web-services dashboard demo demo-app golang google-chrome key-phrase-extraction language-processing natural-language-processing sentiment-analysis speech-recognition speech-to-text visualization webapp webspeech-api

Last synced: 10 Apr 2025

https://github.com/antonbaumann/german-go-stemmer

An efficient implementation of the German porter-stemming algorithm in Golang.

language-processing nlp porter-stemmer snowball stemming stemming-algorithm

Last synced: 23 Apr 2025

https://github.com/john-hawkins/texturizer

A library and command line application for adding different kinds of features derived from columns of raw text.

feature-engineering feature-extraction language-processing natural-language-processing text-mining text-processing

Last synced: 12 Jul 2025

https://github.com/vineyardbovines/comparative-superlative

returns the comparative or superlative of an adjective

language-processing

Last synced: 07 May 2025

https://github.com/gflohr/lingua-poly

Lingua-Poly is a system for disassembling natural languages.

finnish finnish-language-analysis language language-processing linguistics perl

Last synced: 11 Apr 2025

https://github.com/junaidqadirb/unibal2sayad

An experimental UniBal Script to Sayad script conversion tool

converter language-processing languages transliteration

Last synced: 24 Feb 2025

https://github.com/haileybot/language-detector

Basic library to roughly determine the language of input text

detection language language-detection language-processing node nodejs npm npm-package

Last synced: 11 Apr 2025

https://github.com/cgoliver/arxivdt

Full implementation of decision Tree Classifier of arXiv abstracts into subject.

decision-trees language-processing machine-learning python

Last synced: 15 Mar 2025

https://github.com/7fisdjf/speechgpt-2.0-preview

SpeechGPT-2.0-preview is an advanced natural language processing model specifically designed for generating coherent and realistic speech patterns. This cutting-edge tool leverages state-of-the-art deep learning techniques to enable more accurate and human-like speech synthesis.

advanced ai artificial generator gpt2 intelligence language language-processing model natural preview processing speechgpt technology text-generation

Last synced: 15 Apr 2025

https://github.com/cizodevahm/recommendation-system-on-imdb

This repository contains a Jupyter notebook that demonstrates the creation of a content-based movie recommendation system using Natural Language Processing (NLP) in Python.

imdb language-processing nlp recommendation-system

Last synced: 27 Sep 2025

https://github.com/zejiran/languages-and-machines

Collection of projects made on a language and computation theory course at Universidad de los Andes

abstract-machines automata finite-state-transducers language-processing petri-nets turing-machine uniandes

Last synced: 10 Jun 2025

https://github.com/jgontrum/cky-parser-optimization

Assignments and materials for the syntactic parsing class at Uppsala University.

course language-processing nlp parsing syntactic-parsing university uppsala-university

Last synced: 22 Mar 2025

https://github.com/sylhare/simple-lda

:bookmark: simple lda - latent dirichlet allocation

language-processing latent-dirichlet-allocation lda python

Last synced: 28 Oct 2025

https://github.com/andrianllmm/aklanon-stemmer

A Python library for Aklanon word stemming.

aklanon language-processing nlp stemmer

Last synced: 26 Oct 2025

https://github.com/aggstam/flex-bison-jvm-language

Simple Flex and Bison programs to validate provided SimpleLanguage file syntax, perform semantic analysis and compile to JVM asembly(jasmin) for execution.

compiler flex-bison jvm-bytecode language-processing

Last synced: 02 Mar 2025

https://github.com/andrianllmm/tagalog-stemmer

A Python library for Tagalog word stemming.

language-processing nlp stemmer tagalog

Last synced: 22 Feb 2025

https://github.com/rmncldyo/groq-ai-toolkit

A versatile CLI and Python wrapper for Groq AI's breakthrough LPU Inference Engine. Streamline the creation of chatbots and generate dynamic text with speeds of up to 300 tokens/sec.

artificial-intelligence groq groq-ai groq-ai-api groq-api groqai groqaiapi groqapi language-processing language-processing-unit languageprocessingunit large-laguage-model large-language-models llama2-70b llm llms lpu lpu-inference-engine mixtral-8x7b

Last synced: 23 Feb 2025

https://github.com/owaismohsin001/ameer-virtual-processor

This is virtual machine named Ameer Virtual Processor to which languages can be compiled to, and be ran on. This runs well with either Nuitka or PyPy

language-processing programming-language virtual-machine virtual-processor

Last synced: 28 Mar 2025

https://github.com/RMNCLDYO/groq-ai-toolkit

A versatile CLI and Python wrapper for Groq AI's breakthrough LPU Inference Engine. Streamline the creation of chatbots and generate dynamic text with speeds of up to 300 tokens/sec.

artificial-intelligence groq groq-ai groq-ai-api groq-api groqai groqaiapi groqapi language-processing language-processing-unit languageprocessingunit large-laguage-model large-language-models llama2-70b llm llms lpu lpu-inference-engine mixtral-8x7b

Last synced: 27 Feb 2025

https://github.com/markomanninen/grcriddles

Study and examination of alphabetical and isopsephical riddles of the Ancient Greeks

greek jupyter-notebooks language-processing python semiotic text-analytics

Last synced: 11 Oct 2025

https://github.com/walshyb/stack-compilers

The stages for a compiler I am building for Anthony Dos Reis's Assembler for SUNY New Paltz's Language Processing class.

assembly compiler java language-processing

Last synced: 26 Oct 2025

https://github.com/dotwee/structured-stern-neon-articles

This repository contains approximately 16k user written texts, articles, and poetry pulled from archives of the Stern NEON website. Stern NEON was a community platform where users could write and publish their own articles. Many of the articles are personal stories, poems, or opinion pieces.

art dataset gedichte german german-language language-data language-processing poetry text-classification texts

Last synced: 05 Mar 2025

https://github.com/hese49/liike

This program solves problems of motion of the given Finnish verbal problems.

equation-solver language-processing motion sympy tkinter-gui

Last synced: 29 Mar 2025

https://github.com/adversing/basic.c

A comprehensive (and small) BASIC language interpreter implementation in C

basic basic-interpreter c interpreter language-processing

Last synced: 26 Jun 2025

https://github.com/0xeab/salty-utility

Utility for processing bording school menus

data-processing dishes emoji formatter language-processing parser sql tagging

Last synced: 12 Oct 2025

https://github.com/harshpatel44/nlp-simple-library

This repository contains NLP library created for simple NLP tasks.

language-processing library nlp nltk nltk-library python

Last synced: 18 Mar 2025

https://github.com/hyper-node/language_detector

Python program for detecting language of texts

language-detection language-processing

Last synced: 24 Jun 2025

https://github.com/aggstam/flex-robot-language

Simple Flex program producing the corresponding C code to validate the syntax of provided RobotLanguage (.rl) code file.

c flex language-processing

Last synced: 02 Mar 2025

https://github.com/anshkaran7/grammer-ai

Grammar AI is your intelligent writing assistant that provides real-time grammar corrections, style suggestions, and writing improvements using advanced language models. Built with Next.js, TypeScript, and Tailwind CSS, it helps you communicate more effectively.

ai grammer-checker language-processing nextjs openai react tailwindcss typescript vercel writing-tool

Last synced: 19 Jun 2025

https://github.com/kamiazya/nlp100

100 works related to language processing.『言語処理100本ノック 2015』

language-processing python study training

Last synced: 08 Apr 2025

https://github.com/mowies/newspaper-finder

A model to determine the releasing newspaper from its articles

classification language-processing python

Last synced: 27 Mar 2025

https://github.com/abhisingam/debate-motion-classifier

A program to classify and categorise debate motions based on type and topic. Python | NLP | Text Analysis | NLTK

classification language-processing nlp nltk python3 text-analysis

Last synced: 09 Jul 2025

https://github.com/athkarandikar/two-pass-assembler

Implementation of two-pass assembler in Java. Pass 1 deals with the generation of literal, symbol, and pool tables. Pass 2 deals with intermediate code generation.

assembler java language-processing pass-2-assembler

Last synced: 22 Feb 2025

https://github.com/tadiusfrank2001/phonology

analyze phonological structures, processes, and rules that govern sound patterns, including features like assimilation, dissimilation, and vowel harmony in lingustic systems

language-processing optimality-theory phonological-features phonological-mapping phonological-rules phonology

Last synced: 22 Feb 2025

https://github.com/aggstam/flex-bison-json

Simple Flex/Bison programs to validate the syntax and perform semantic analysis to JSON files.

flex-bison json language-processing

Last synced: 02 Mar 2025

https://github.com/thertzlor/entropy-wordsmith

Generate passphrases/natural language passwords of maximum complexity and grandiloquence. Completely excessive.

language-processing passphrase passphrase-generator password-generator security wordnet

Last synced: 16 Mar 2025

https://github.com/aggstam/flex-c-declarations

Simple Flex program producing the corresponding C code to display all the declarions of another C code file.

c flex language-processing

Last synced: 02 Mar 2025

https://github.com/mtlh/censored-strings

https://edabit.com/challenge/Wv9ZeXyC32EMfRWGB My solution in various languages to this problem.

censored-words coding-challenge language-processing

Last synced: 22 Feb 2025

https://github.com/arthurcfranklin/azure-language-studio

Documentação e reflexões sobre o uso do Azure Speech Studio e Language Studio, feito no bootcamp da DIO. Com insights, aprendizados, desafios e possibilidades futuras para soluções de IA em voz e linguagem natural.

artificial-intelligence azure azure-cognitive-services language-processing speech-recognition speech-to-text text-to-speech voice-recognition

Last synced: 15 Jun 2025

https://github.com/unit-mesh/treesitter-artifacts

The TreeSitter binary for AutoDev

ast language-processing

Last synced: 01 Aug 2025

https://github.com/mikeleo03/customer-review-ml-case

🏆 Top 10 @ TensorFlow Group ML Olympiad 2024 - Developing a model to classify each customer opinon into several Likert scale with data gathered directly from Google Reviews

language-processing logistic-regression lstm ml-olympiad random-forest sastrawi svc-svm xgboost

Last synced: 19 Aug 2025

https://github.com/ubugeeei/jimall

Toy minimum LISP language processing system in Rust

language-processing lisp rust toy-project

Last synced: 30 Mar 2025