Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/TianyuZhuuu/CHIP2018

CHIP2018问句匹配大赛 Rank6解决方案

nlp pytorch sentence-similarity

Last synced: 06 Nov 2024

https://github.com/chinnichaitanya/spellwise

🚀 Extremely fast fuzzy matcher & spelling checker in Python!

caverphone editex levenshtein natural-language-processing nlp spellcheck spelling-correction trie typox

Last synced: 27 Oct 2024

https://github.com/kalebu/desktop-chatbot-app

A python knowledge-based chatbot application built with Tkinter

chatbot chatbot-application data-science nlp nlp-projects python-tanzania python3 tanzania

Last synced: 09 Nov 2024

https://github.com/jishnujayakumar/mlrc2020-embedkgqa

This is the code for the MLRC2020 challenge w.r.t. the ACL 2020 paper Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings

embeddings knowledge-graph machine-learning multi-hop-reasoning neural-embeddings nlp

Last synced: 30 Nov 2024

https://github.com/tomaarsen/ttstextnormalization

Convert English text from written expressions into spoken forms

competition nlp normalization spoken-forms text-normalization tts

Last synced: 08 Nov 2024

https://github.com/stanford-oval/ovalchat

OVALChat is a customizable Web app aimed at conducting user studies with chatbots

chatbots crowdsourcing nextjs nlp react tailwindcss

Last synced: 06 Nov 2024

https://github.com/adamspannbauer/lexrankr

Extractive Text Summariztion with lexRankr (an R package implementing the LexRank algorithm)

lexrank lexrank-algorithm nlp r r-package rstat

Last synced: 27 Oct 2024

https://github.com/quentinlintz/synthetic-data-generator

🦄 Use GPT to generate and label data

gpt-35-turbo nlp synthetic-data

Last synced: 21 Dec 2024

https://github.com/pyurbans/urbans

A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.

artificial-intelligence data-science machine-translation nlp python

Last synced: 10 Nov 2024

https://github.com/code-kern-ai/embedders

With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.

classification machine-learning named-entity-recognition natural-language-processing ner nlp python representation-learning similarity-search

Last synced: 10 Nov 2024

https://github.com/priyamakeshwari/teachgpt

An AI Powered teacher that can help you learn your topics faster before exam

ai hacktoberfest hacktoberfest2023 llm machine-learning nlp python

Last synced: 19 Jan 2025

https://github.com/rchtgpt/gitg0

a magnificent tool to auto-suggest everything you need before pushing a git commit • built @mlh-fellowship

cli-tool dev-tool javascript nlp npm

Last synced: 15 Feb 2025

https://github.com/jpmanson/llm_templates

Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.

chatbot cohere gemma huggingface jinja2 library llama2 llama3 llm mistral nlp nlp-library phi3 template

Last synced: 24 Jan 2025

https://github.com/rileynwong/rpi-poetry-generator

Poetry theremin: use Raspberry Pi with hardware sensors to generate poetry using NLP techniques, based on physical light and distance conditions

distance-sensor generative-art generative-poetry generative-text hardware interactive interactive-art interactive-text-generation light-sensor natural-language-processing nlp nltk poetry python raspberry-pi rpi sensors sentiment-analysis

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-benchmarks

💫 Runtime performance comparison of spaCy against other NLP libraries

benchmarking benchmarks natural-language-processing nlp spacy

Last synced: 17 Jan 2025

https://github.com/li-plus/rouge-metric

A Python wrapper of the official ROUGE-1.5.5.pl script and a re-implementation of full ROUGE metrics.

machine-learning nlp pypi python rouge rouge-metric summarization

Last synced: 06 Nov 2024

https://github.com/fer-aguirre/pmdm

Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges

nlp social-network-analysis text-classification

Last synced: 05 Nov 2024

https://github.com/dpressel/arcs-py

Arc-Eager and Arc-Hybrid Greedy Dependency Parser with Dynamic Oracle in Python (with no Dependencies!)

nlp nlp-dependency-parsing

Last synced: 28 Oct 2024

https://github.com/yhy1117/x-mixup

Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".

cross-lingual-transfer manifold-mixup nlp

Last synced: 21 Dec 2024

https://github.com/banyh/PyStanfordNLP

A Python Wrapper of Stanford Chinese Segmenter

nlp postagging python-wrapper stanford stanford-chinese-segmenter

Last synced: 14 Nov 2024

https://github.com/joshdevins/demo-es-lang-ident

Demo: Elasticsearch Language Identification

demo elasticsearch language-identification nlp search

Last synced: 23 Oct 2024

https://github.com/canclid/sentences

粵語對話語料

natural-language-processing nlp

Last synced: 12 Feb 2025

https://github.com/juliastrings/tinysegmenter.jl

Julia version of TinySegmenter, compact Japanese tokenizer

julia nlp tinysegmenter

Last synced: 23 Jan 2025

https://github.com/primaprashant/ai-customer-support

📚 Curated collection of blogs and papers on how different companies are using machine learning in production for better customer support.

ai applied-data-science applied-machine-learning applied-ml artificial-intelligence customer-service customer-support data-science deep-learning machine-learning natural-language-processing nlp paper production tech-blog

Last synced: 19 Feb 2025

https://github.com/richardlitt/thesis

My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University

dissertation endangered-languages low-resource-languages lrl nlp nlproc saarland saarland-university thesis

Last synced: 04 Feb 2025

https://github.com/mush42/libtashkeel

Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models

arabic diacritics nlp tashkeel

Last synced: 04 Jan 2025

https://github.com/xv44586/knowledge-distillation-nlp

some demos of Knowledge Distillation in NLP

bert keras knowledge-distillation nlp

Last synced: 17 Nov 2024

https://github.com/pnnl/cactus

LLM Agent that leverages cheminformatics tools to provide informed responses.

cheminformatics chemistry foundation-models llm llm-agent nlp science

Last synced: 25 Nov 2024

https://github.com/percevalw/metanno

Annotator building tool for Jupyter

annotator customizable jupyter modular nlp

Last synced: 08 Nov 2024

https://github.com/gmontamat/poor-mans-transformers

Implement Transformers (and Deep Learning) from scratch in NumPy

deep-learning from-scratch machine-learning ml-framework neural-network nlp transformers

Last synced: 30 Oct 2024

https://github.com/ahammadmejbah/ahammadmejbah

Data Science || Machine Learning || Deep Learning || Computer Vision || NLP Enthusiast Talks about #datascience, #deeplearning, #dataanalytics, #machinelearning, and #machinelearningalgorithms

artificial-intelligence computer-vision data-science deep-learning machine-learning nlp python

Last synced: 11 Nov 2024

https://github.com/tomhosking/hercules

Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)

nlp opinion-summarization summarization vq-vae

Last synced: 30 Dec 2024

https://github.com/alexcg1/easy_text_generator

Generate text from machine-learning models right in your browser

machine-learning nlp python streamlit

Last synced: 08 Jan 2025

https://github.com/revdotcom/words2num

Convert words to numbers

inverse-text-normalization nlp

Last synced: 11 Nov 2024

https://github.com/lkstrp/newspaper-scraper

The all-in-one Python package for seamless newspaper article indexing, scraping, and processing – supports public and premium content!

news newspaper nlp parser scraper

Last synced: 07 Nov 2024

https://github.com/winkjs/wink-porter2-stemmer

Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter

natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer

Last synced: 09 Nov 2024

https://github.com/tlkh/t2t-tuner

Convenient Text-to-Text Training for Transformers

gpt huggingface language-model nlp pytorch t5 transformers

Last synced: 07 Nov 2024

https://github.com/proycon/deepfrog

An NLP-suite powered by deep learning

deep-learning deep-neural-networks dutch folia frog nlp transformers

Last synced: 08 Nov 2024

https://github.com/hpprc/defsent

DefSent: Sentence Embeddings using Definition Sentences

bert natural-language-processing nlp transformers

Last synced: 27 Oct 2024

https://github.com/o19s/skipchunk

Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr

elasticsearch knowledge-graph nlp solr

Last synced: 20 Nov 2024

https://github.com/vishnunkumar/doc_transformers

Document processing using transformers

ai ml nlp ocr

Last synced: 16 Nov 2024

https://github.com/contextlab/abstract2paper

Auto-generate an entire paper from a prompt or abstract using NLP

auto-text gpt-neo nlp notebook-jupyter text-generation

Last synced: 06 Nov 2024

https://github.com/kklemon/flashperceiver

Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.

attention-mechanism deep-learning flash-attention nlp perceiver transformer

Last synced: 19 Nov 2024

https://github.com/dluman/rusty

Rust bindings for the spaCy library.

nlp rust

Last synced: 16 Nov 2024

https://github.com/liyucheng09/llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

benchmark evaluation llm llms nlp

Last synced: 30 Oct 2024

https://github.com/thunlp/babelnet-sememe-prediction

Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"

babelnet nlp semantics sememe

Last synced: 10 Nov 2024

https://github.com/bloomberg/mixce-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

language-model machine-learning nlp python pytorch transformer

Last synced: 09 Nov 2024

https://github.com/wassname/phoneme2grapheme

Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")

cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling

Last synced: 15 Oct 2024

https://github.com/artitw/bert_qa

Accelerating the development of question-answering systems based on BERT and TF 2.0

artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp

Last synced: 28 Oct 2024

https://github.com/wetneb/pynif

A small Python library for NLP Interchange Format (NIF) for NER(D) systems

entity-linking gerbil named-entity-recognition nif nlp python

Last synced: 28 Oct 2024

https://github.com/azu/nlp-pattern-match

Natural Language pattern matching library for JavaScript.

english japanese javascript morphological-analysis nlcst nlp pos

Last synced: 01 Nov 2024

https://github.com/kampersanda/sif-embedding

Rust implementation of SIF and uSIF: Simple and fast sentence embedding

nlp sentence-embeddings vector-search

Last synced: 29 Nov 2024

https://github.com/bramvanroy/astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

alignment linguistics nlp parallel-corpus parsing spacy stanza translation

Last synced: 14 Oct 2024

https://github.com/spacyturk/spacyturk

spaCyTurk - trained models & pipelines for Turkish

floret nlp nlp-library spacy turkish-nlp

Last synced: 15 Feb 2025

https://github.com/cmccomb/rust-stop-words

Common stop words in a variety of languages

languages natural-language-procressing nlp nltk rust-crate stopwords

Last synced: 16 Feb 2025

https://github.com/ayoolaolafenwa/trainnlp

Sample tutorials for training Natural Language Processing Models with Transformers

huggingface-transformers masked-language-models natural-language-processing nlp transformers

Last synced: 01 Jan 2025

https://github.com/fursovia/geometric_embedding

"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation

embeddings nlp

Last synced: 17 Nov 2024

https://github.com/tencent-ailab/season

[EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarization

nlp summarization summarization-model

Last synced: 18 Nov 2024

https://github.com/pb2204/spam-detection-model

SPAM-Detection-Model Is A NLP Model To Detect SPAM Messages...

ai mechine-learning nlp

Last synced: 17 Jan 2025

https://github.com/abhaskumarsinha/minimalgpt

MinimalGPT is a concise, adaptable, and streamlined code framework that encompasses the essential components necessary for the construction, training, inference, and fine-tuning of the GPT model. This framework is implemented exclusively using Keras and TensorFlow, ensuring compatibility and coherence within the broader deep learning ecosystem.

ai artificial-intelligence fine-tuning generative-model gpt gpt-2 gpt-models keras keras-tensorflow language-model llm machine-learning neural-network nlp nlp-machine-learning tensorflow tensorflow2 training transformer transformer-architecture

Last synced: 21 Nov 2024

https://github.com/princeton-nlp/lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

language nlp world-models

Last synced: 09 Jan 2025

https://github.com/r13i/twitter-sentiment-analysis

What if we could see the emotions and moods of people through the breadcrumbs they leave on Twitter ?

docker emotions influxdb kafka natural-language-processing nlp python sentiment-analysis tweets twitter

Last synced: 07 Dec 2024

https://github.com/arbox/wlapi

Ruby based API for the project Wortschatz Leipzig.

computational-linguistics natural-language-processing nlp ruby rubynlp

Last synced: 15 Nov 2024

https://github.com/kotartemiy/topic-labeled-news-dataset

100k+ topic labeled news articles published from thousands of news websites

media news nlp topic topic-modeling topics

Last synced: 18 Jan 2025

https://github.com/nicolay-r/awesome-sentiment-attitude-extraction

A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards other Object mentioned in text such as: entities, events, etc.

aaai awesome awesome-list chatgpt deep-learning emnlp language-model low-resource-nlp machine-learning naacl natural-language-processing nips nlp relation-classification sentiment-analysis sentiment-attitude-extraction stance-detection state-of-the-art trends

Last synced: 19 Dec 2024

https://github.com/sno2/bertml

Use common pre-trained ML models in Deno!

bert deno machine-learning nlp rust

Last synced: 10 Dec 2024

https://github.com/faisal-khann/desktop-voice-assistant

JARVIS is a desktop voice assistant inspired by the fictional AI assistant of Tony Stark from the Marvel Cinematic Universe. This project aims to create a functional voice-controlled assistant that can perform various tasks such as providing information, setting reminders, controlling applications, and executing commands based on user input.

artificial-intelligence nlp pip-install python python3 qtdesigner speech-recognition

Last synced: 29 Jan 2025

https://github.com/proycon/foliapy

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

clariah clarin computational-linguistics folia nlp pynlpl xml

Last synced: 19 Dec 2024

https://github.com/systats/textlearnR

A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.

classification hyperparameter-optimization keras nlp r text-mining

Last synced: 22 Nov 2024

https://github.com/bminixhofer/gerpt2

German small and large versions of GPT2.

common-crawl german gpt2 language-model machine-learning nlp

Last synced: 28 Oct 2024

https://github.com/mfarragher/obsidian-nlp-analytics

Proofs of concept for workflows that augment Obsidian.md knowledge management via NLP analytics & modelling

knowledge-management nlp nlp-machine-learning obsidian-md python

Last synced: 23 Oct 2024

https://github.com/ianramzy/article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

fact-extractor flask named-entity-recognition nlp summarization web-scraping

Last synced: 19 Nov 2024

https://github.com/vaibhavs10/ml-with-text

[Tutorial] Demystifying Natural Language Processing with Python

machine-learning natural-language-processing nlp python text-classification

Last synced: 19 Dec 2024