Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-15 00:20:20 UTC
- JSON Representation
https://github.com/shrebox/Personified-Chatbot
A personified chatbot responding to a query based on the answering pattern of Dr. APJ Abdul Kalam using Information Retrieval, Natural Language Processing, and Deep Learning techniques.
apj-abdul-kalam chatbot deep-learning information-retrieval lstm natural-language-processing nlp ranking-algorithm seq2seq-chatbot seq2seq-model summarization word2vec
Last synced: 11 Nov 2024
https://github.com/kampersanda/tongrams-rs
Rust library providing fast language model queries in compressed space
compression elias-fano language-model ngrams nlp trie
Last synced: 11 Nov 2024
https://github.com/liebeck/spacy-iwnlp
German lemmatization with IWNLP as extension for spaCy
nlp spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/quickgrid/AI-Resources
Research Paper Summaries, Setup & Performance Notes, Resource Links on AI, Deep Learning, NLP, Computer Vision for my learning.
ai ai-notes ai-research blender computer-vision deep-learning nlp paper-summaries papers research-paper research-paper-summaries
Last synced: 02 Nov 2024
https://github.com/korpling/pepper
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
annotations converter format java linguistic-formats linguistics nlp pepper
Last synced: 15 Nov 2024
https://github.com/botpress/nlu
This repo contains every ML/NLU related code written by Botpress in the NodeJS environment. This includes the Botpress Standalone NLU Server.
Last synced: 05 Nov 2024
https://github.com/quickgrid/ai-resources
Research Paper Summaries, Setup & Performance Notes, Resource Links on AI, Deep Learning, NLP, Computer Vision for my learning.
ai ai-notes ai-research blender computer-vision deep-learning nlp paper-summaries papers research-paper research-paper-summaries
Last synced: 07 Aug 2024
https://github.com/mgechev/gently-js
Module which returns the offensive words in a string. A soft reminder to be nicer to each other ❤️.
Last synced: 22 Oct 2024
https://github.com/davidsvy/Neural-Scam-Artist
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
dataset deduplication fine-tuning fraud gpt2 huggingface lsh minhash nlp pytorch readability scam transformer web-scraping
Last synced: 05 Aug 2024
https://github.com/ikegami-yukino/rakutenma-python
Rakuten MA (Python version)
chinese japanese-language nlp part-of-speech-tagger pos-tagging python word-segmentation
Last synced: 12 Oct 2024
https://github.com/i008/nyyelp
predicting yelp review rating using recurrent neural networks
deep-learning nlp python recurrent-neural-networks yelp-dataset
Last synced: 13 Nov 2024
https://github.com/code-kern-ai/sequence-learn
With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.
machine-learning named-entity-recognition natural-language-processing ner nlp python
Last synced: 10 Nov 2024
https://github.com/hscspring/ALL4AI
AI Related Tools/Projects
ai jupyter linux machine-learning nlp python ssh toolbox
Last synced: 07 Nov 2024
https://github.com/4ai/agn
Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)
bert deep-learning nlp text-classification
Last synced: 10 Nov 2024
https://github.com/hscspring/all4ai
AI Related Tools/Projects
ai jupyter linux machine-learning nlp python ssh toolbox
Last synced: 28 Oct 2024
https://github.com/dbklim/russian_subtitles_dataset
Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
bot cnn corpus dataset lstm machine-learning ml natural-language-processing nlp nlu rnn russian subtitles text text-analysis text-processing word2vec
Last synced: 11 Nov 2024
https://github.com/generall/oneshotnlp
PyTorch text matching models implementation for One-Shot Named Entity Linking
Last synced: 14 Oct 2024
https://github.com/amsqr/NaiveSumm
NaiveSumm is a naive summarization approach based on Luhn1958 work "The Automatic Creation of Literature Abstracts" It uses the frequencies of words in the document in order to calculate and extract the sentences that include the most frequent words.
natural-language-processing nlp python summarization
Last synced: 31 Oct 2024
https://github.com/jawahar273/practNLPTools-lite
Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-of-speech (POS) tags, chunking (CHK), name entity recognition (NER), semantic role labeling (SRL) and syntactic parsing (PSG) with skip-gram all in Python and still more features will be added. The website give is for downlarding Senna tool
nlp practnlptools3 senna senna-nlp
Last synced: 07 Aug 2024
https://github.com/rileynwong/poetry-generator
Generate poetry based on text corpus input
generative-art generative-poetry generative-text natural-language-processing nlp poetry poetry-generator text
Last synced: 14 Oct 2024
https://github.com/hpprc/bert-classification-tutorial-2024
【2024年版】BERTによるテキスト分類
Last synced: 27 Oct 2024
https://github.com/AmrHendy/programming-language-translator
An easy way to use the released TransCoder by Facebook AI Research to convert code from one programming language to another using unsupervised neural machine translation (NMT) systems that use deep-learning to translate text from one natural language to another and is trained only on monolingual source data.
machine-translation nlp programming-language transcoder transformer unsupervised-deep-learning unsupervised-translation
Last synced: 06 Aug 2024
https://github.com/proycon/gecco
Generic Environment for Context-Aware Correction of Orthography
nlp python spelling-correction
Last synced: 08 Nov 2024
https://github.com/senthilchandrasegaran/textplorer
Visual analytics application for qualitative text analysis
nlp text-visualization visual-analytics
Last synced: 27 Oct 2024
https://github.com/uetchy/homebrew-nlp
🍺 a Homebrew keg that specialized in Natural Language Processing.
homebrew natural-language-processing nlp
Last synced: 18 Oct 2024
https://github.com/KGCP/MEL-TNNT
Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)
metadata-extraction named-entity-recognition natural-language-processing nlp nlp-ner pipeline
Last synced: 03 Aug 2024
https://github.com/pfnet-research/vat_nmt
Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019
acl2019 adversarial neural-machine-translation nlp nmt vat
Last synced: 15 Nov 2024
https://github.com/adriangonz/statistical-nlp-17
Repository for group 17 on the Statistical Natural Language Processing module at UCL
Last synced: 22 Oct 2024
https://github.com/KGCP/MEL-TNNT/
Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)
metadata-extraction named-entity-recognition natural-language-processing nlp nlp-ner pipeline
Last synced: 14 Aug 2024
https://github.com/minerva-ml/steppy-toolkit
Curated set of transformers that make your work with steppy faster and more effective :telescope:
data-science deep-learning keras keras-models machine-learning nlp open-source pipeline pipeline-framework python python3 pytorch pytorch-models reproducibility reproducible-research steppy steppy-toolkit steps tensorflow tensorflow-models
Last synced: 26 Sep 2024
https://github.com/sap-samples/acl2022-self-contrastive-decorrelation
Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".
ai nlp research research-paper sample self-contrastive self-contrastive-decorrelation self-supervised-learning sentence-embeddings
Last synced: 15 Nov 2024
https://github.com/stanford-oval/ovalchat
OVALChat is a customizable Web app aimed at conducting user studies with chatbots
chatbots crowdsourcing nextjs nlp react tailwindcss
Last synced: 06 Nov 2024
https://github.com/adamspannbauer/lexrankr
Extractive Text Summariztion with lexRankr (an R package implementing the LexRank algorithm)
lexrank lexrank-algorithm nlp r r-package rstat
Last synced: 27 Oct 2024
https://github.com/kalebu/desktop-chatbot-app
A python knowledge-based chatbot application built with Tkinter
chatbot chatbot-application data-science nlp nlp-projects python-tanzania python3 tanzania
Last synced: 09 Nov 2024
https://github.com/TianyuZhuuu/CHIP2018
CHIP2018问句匹配大赛 Rank6解决方案
nlp pytorch sentence-similarity
Last synced: 06 Nov 2024
https://github.com/alan-turing-institute/prompto
An open source library for asynchronous querying of LLM endpoints
deep-learning hut23 large-language-models llm-eval llm-evaluation llms machine-learning natural-language-processing nlp python transformer transformers
Last synced: 13 Nov 2024
https://github.com/code-kern-ai/embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
classification machine-learning named-entity-recognition natural-language-processing ner nlp python representation-learning similarity-search
Last synced: 10 Nov 2024
https://github.com/appcoda/naturallanguageprocessing
A Quick Demo for NLP in Swift 4
demo nlp playgrounds swift swift4
Last synced: 15 Nov 2024
https://github.com/humansignal/brand-sentiment-analysis
Scripts utilizing Heartex platform to build brand sentiment analysis from the news
lstm-sentiment-analysis natural-language-processing nlp nlp-machine-learning nlp-sentiment-classifier nlp-tutorial sentiment sentiment-analyser sentiment-analysis sentiment-classification tensorflow-text-classifiers transfer-learning
Last synced: 14 Nov 2024
https://github.com/pyurbans/urbans
A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.
artificial-intelligence data-science machine-translation nlp python
Last synced: 10 Nov 2024
https://github.com/tomaarsen/ttstextnormalization
Convert English text from written expressions into spoken forms
competition nlp normalization spoken-forms text-normalization tts
Last synced: 08 Nov 2024
https://github.com/ekinakyurek/knetlayers.jl
Useful Layers for Knet
computer-vision deep-learning machine-learning nlp
Last synced: 15 Oct 2024
https://github.com/artitw/text2class
Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT
artificial-intelligence bert categorization classification classifier machine-learning natural-language-processing natural-language-understanding nlp tensorflow text text-classification transformers
Last synced: 28 Oct 2024
https://github.com/chinnichaitanya/spellwise
🚀 Extremely fast fuzzy matcher & spelling checker in Python!
caverphone editex levenshtein natural-language-processing nlp spellcheck spelling-correction trie typox
Last synced: 27 Oct 2024
https://github.com/breezedeus/loveshare
breezedeus的各种分享
cnocr cnstd cv deep-learning llm nlp ocr pix2text
Last synced: 15 Nov 2024
https://github.com/deepset-ai/haystack-search-pipeline-streamlit
🚀 Template Haystack Search Application with Streamlit
Last synced: 06 Nov 2024
https://github.com/doccano/spacy-partial-tagger
A simple library for training named entity recognition model from partially annotated data
named-entity-recognition natural-language-processing nlp spacy weak weak-supervision weakly-supervised-learning
Last synced: 10 Oct 2024
https://github.com/yuvalpinter/nytwit
New York Times Word Innovation Types dataset
computational-linguistics corpus dataset news nlp
Last synced: 27 Oct 2024
https://github.com/twardoch/split-markdown4gpt
A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.
data-preprocessing gpt gpt-3 gpt-35-turbo gpt-35-turbo-16k gpt-4 markdown markdown-processing mistletoe natural-language-processing nlp openai openai-gpt python split-text summarization text-analysis text-processing text-summarization text-tokenization
Last synced: 27 Oct 2024
https://github.com/li-plus/rouge-metric
A Python wrapper of the official ROUGE-1.5.5.pl script and a re-implementation of full ROUGE metrics.
machine-learning nlp pypi python rouge rouge-metric summarization
Last synced: 06 Nov 2024
https://github.com/banyh/PyStanfordNLP
A Python Wrapper of Stanford Chinese Segmenter
nlp postagging python-wrapper stanford stanford-chinese-segmenter
Last synced: 14 Nov 2024
https://github.com/explosion/spacy-benchmarks
💫 Runtime performance comparison of spaCy against other NLP libraries
benchmarking benchmarks natural-language-processing nlp spacy
Last synced: 25 Sep 2024
https://github.com/derintelligence/en-az-parallel-corpus
English-Azerbaijani parallel language corpus
azerbaijan azerbaijani-translation corpus language linguistics nlp parallel translation
Last synced: 13 Nov 2024
https://github.com/winkjs/wink-porter2-stemmer
Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter
natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer
Last synced: 09 Nov 2024
https://github.com/avacaondata/nlpboost
Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.
deep-learning hyperparameter-optimization hyperparameter-tuning natural-language-generation natural-language-processing natural-language-understanding nlp pytorch text-classification text-generation
Last synced: 11 Oct 2024
https://github.com/bretttolbert/verbecc-svc
Dockerized Python microservice with REST API for verbs conjugation in French, Spanish and Portuguese
conjugation conjugator french french-language french-nlp linguistics machine-learning natural-language natural-language-processing nlp portuguese-language portuguese-verbs romanian romanian-language scikit-learn spanish-language spanish-verbs verb-conjugation
Last synced: 18 Oct 2024
https://github.com/dpressel/arcs-py
Arc-Eager and Arc-Hybrid Greedy Dependency Parser with Dynamic Oracle in Python (with no Dependencies!)
Last synced: 28 Oct 2024
https://github.com/revdotcom/words2num
Convert words to numbers
inverse-text-normalization nlp
Last synced: 11 Nov 2024
https://github.com/tirendazacademy/hugging-face-tutorials
Getting started with Hugging Face
deep-learning hugging-face huggingface huggingface-datasets huggingface-library huggingface-pipeline huggingface-transformer huggingface-transformers image-classification machine-learning natural-language-processing nlp pretrained-models pytorch sentiment-analysis tensorflow text-classification transfer-learning
Last synced: 08 Nov 2024
https://github.com/eric-haibin-lin/nlp-notebooks
A collection of natural language processing notebooks.
deep-learning deep-learning-tutorial natural-language-generation natural-language-inference natural-language-processing natural-language-understanding nlp nlp-resources
Last synced: 28 Oct 2024
https://github.com/anthonysigogne/web-search-engine
API - a simple web search engine
api elasticsearch google-search indexing nlp python search-engine
Last synced: 12 Nov 2024
https://github.com/yhy1117/x-mixup
Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".
cross-lingual-transfer manifold-mixup nlp
Last synced: 28 Aug 2024
https://github.com/fer-aguirre/pmdm
Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges
nlp social-network-analysis text-classification
Last synced: 05 Nov 2024
https://github.com/rileynwong/rpi-poetry-generator
Poetry theremin: use Raspberry Pi with hardware sensors to generate poetry using NLP techniques, based on physical light and distance conditions
distance-sensor generative-art generative-poetry generative-text hardware interactive interactive-art interactive-text-generation light-sensor natural-language-processing nlp nltk poetry python raspberry-pi rpi sensors sentiment-analysis
Last synced: 14 Oct 2024
https://github.com/salesforce/overture
Library for soft prompt tuning
deep-learning nlp prompt-tuning python pytorch soft-prompt-tuning
Last synced: 08 Nov 2024
https://github.com/percevalw/metanno
Annotator building tool for Jupyter
annotator customizable jupyter modular nlp
Last synced: 08 Nov 2024
https://github.com/ahammadmejbah/ahammadmejbah
Data Science || Machine Learning || Deep Learning || Computer Vision || NLP Enthusiast Talks about #datascience, #deeplearning, #dataanalytics, #machinelearning, and #machinelearningalgorithms
artificial-intelligence computer-vision data-science deep-learning machine-learning nlp python
Last synced: 11 Nov 2024
https://github.com/oroszgy/hungarian-text-mining-workshop
Materials for the Text Mining workshop held in the HuNLP meetup, June 2017
classification hungarian information-extraction keyword-extraction machine-learning meetup natural-language-processing nlp python scikit-learn sentiment-analysis spacy spacy-models text-mining text-mining-workshop textacy tutorial workshop
Last synced: 08 Nov 2024
https://github.com/richardlitt/thesis
My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University
dissertation endangered-languages low-resource-languages lrl nlp nlproc saarland saarland-university thesis
Last synced: 21 Oct 2024
https://github.com/joshdevins/demo-es-lang-ident
Demo: Elasticsearch Language Identification
demo elasticsearch language-identification nlp search
Last synced: 23 Oct 2024
https://github.com/gmontamat/poor-mans-transformers
Implement Transformers (and Deep Learning) from scratch in NumPy
deep-learning from-scratch machine-learning ml-framework neural-network nlp transformers
Last synced: 30 Oct 2024
https://github.com/contextlab/abstract2paper
Auto-generate an entire paper from a prompt or abstract using NLP
auto-text gpt-neo nlp notebook-jupyter text-generation
Last synced: 06 Nov 2024
https://github.com/koenvervloesem/rasa-docker-arm
Rasa Docker image for ARMv7. Runs on a Raspberry Pi.
arm armhf armv7 armv7l bot bot-framework bots chatbot chatbots chatbots-framework docker docker-image machine-learning natural-language-processing nlp nlu rasa raspberry-pi raspberry-pi-4 raspberry-pi-4b
Last synced: 27 Sep 2024
https://github.com/vishnunkumar/doc_transformers
Document processing using transformers
Last synced: 16 Nov 2024
https://github.com/liyucheng09/llm-compressive
Longitudinal Evaluation of LLMs via Data Compression
benchmark evaluation llm llms nlp
Last synced: 30 Oct 2024
https://github.com/deepraj1729/tchatbot
A ChatBot framework to create customizable all purpose Chatbots using NLP, Tensorflow, Speech Recognition
artificial-intelligence chatbot-framework conda deep-learning framework git github machine-learning neural-networks nlp nltk numpy pip pypi python3 sklearn speech-recognition tensorflow virtual-environment
Last synced: 14 Oct 2024
https://github.com/artitw/bert_qa
Accelerating the development of question-answering systems based on BERT and TF 2.0
artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp
Last synced: 28 Oct 2024
https://github.com/study-assist/browser-extension
A tool to help you organise your bookmarks intelligently
bookmarks bookmarks-manager browser-extension data-analysis machine-learning natural-language-processing nlp
Last synced: 06 Nov 2024
https://github.com/azu/nlp-pattern-match
Natural Language pattern matching library for JavaScript.
english japanese javascript morphological-analysis nlcst nlp pos
Last synced: 01 Nov 2024
https://github.com/bramvanroy/astred
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
alignment linguistics nlp parallel-corpus parsing spacy stanza translation
Last synced: 14 Oct 2024
https://github.com/arbox/wlapi
Ruby based API for the project Wortschatz Leipzig.
computational-linguistics natural-language-processing nlp ruby rubynlp
Last synced: 15 Nov 2024
https://github.com/wetneb/pynif
A small Python library for NLP Interchange Format (NIF) for NER(D) systems
entity-linking gerbil named-entity-recognition nif nlp python
Last synced: 28 Oct 2024
https://github.com/proycon/deepfrog
An NLP-suite powered by deep learning
deep-learning deep-neural-networks dutch folia frog nlp transformers
Last synced: 08 Nov 2024
https://github.com/cmccomb/rust-stop-words
Common stop words in a variety of languages
languages natural-language-procressing nlp nltk rust-crate stopwords
Last synced: 12 Oct 2024
https://github.com/alexcg1/easy_text_generator
Generate text from machine-learning models right in your browser
machine-learning nlp python streamlit
Last synced: 27 Oct 2024
https://github.com/hpprc/defsent
DefSent: Sentence Embeddings using Definition Sentences
bert natural-language-processing nlp transformers
Last synced: 27 Oct 2024
https://github.com/wassname/phoneme2grapheme
Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")
cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling
Last synced: 15 Oct 2024
https://github.com/fursovia/geometric_embedding
"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation
Last synced: 03 Aug 2024
https://github.com/tlkh/t2t-tuner
Convenient Text-to-Text Training for Transformers
gpt huggingface language-model nlp pytorch t5 transformers
Last synced: 07 Nov 2024
https://github.com/bloomberg/mixce-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
language-model machine-learning nlp python pytorch transformer
Last synced: 09 Nov 2024
https://github.com/thunlp/babelnet-sememe-prediction
Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"
Last synced: 10 Nov 2024
https://github.com/AnthonyMRios/adversarial-relation-classification
Unsupervised domain adaptation method for relation extraction
bioinformatics biomedical-data-science machine-learning natural-language-processing nlp nlp-machine-learning relation-extraction
Last synced: 15 Nov 2024
https://github.com/hc200ok/manual-data-masking
A lightweight javascript library for manual data masking
data-masking dataset dataset-generation ecmascript2020 javascript library manual-data-masking nlp
Last synced: 11 Nov 2024
https://github.com/mindspore-courses/deepnlp-models-mindspore
About MindSpore implementations of various Deep NLP models in cs-224n(Stanford Univ)
deep-learning mindspore nlp tutorial
Last synced: 09 Nov 2024
https://github.com/anthonysigogne/keyword-mining
API - extract a list of keywords from a text.
docker keyword keyword-extraction nlp python-2 seo
Last synced: 12 Oct 2024
https://github.com/talmago/spacy_crfsuite
sequence tagging with spaCy and crfsuite
crf crf-model crfsuite entity-extraction entity-extraction-extension entity-tagging nlp sklearn-crfsuite spacy spacy-extension spacy-ner
Last synced: 12 Oct 2024
https://github.com/shukur-alom/spam_mail_detector_using_ml
This Model can detectany kind of spam mail. Here i use ML Algorithm. If use use my code pleace give me my cradit
algorithm artificial-intelligence artificial-intelligence-algorithms artificial-intelligence-models artificial-intelligence-projects deep-learning detectany-kind mail ml natural-language-processing nlp nlp-machine-learning python python-3 python3 spam spam-mail tensorflow tensorflow2
Last synced: 12 Oct 2024
https://github.com/bububa/jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
chinese-nlp chinese-word-segmentation classification clustering cws ner nlp pos segmentation
Last synced: 08 Nov 2024