Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-20 00:15:35 UTC
- JSON Representation
https://github.com/denismosolov/alice-entities-library
Набор именованных сущностей для платформы Яндекс.Диалоги. Используйте при создании навыков Алисы.
alice-sdk alice-skills nlp yandex-dialogs
Last synced: 16 Nov 2024
https://github.com/jfilter/german-lemmatizer
✂️ Python package (using a Docker image under the hood) to lemmatize German texts.
german lemmatization lemmatizer natural-language-processing nlp python
Last synced: 11 Nov 2024
https://github.com/gesiscss/ptm
Introduction to Natural Language Processing with a special emphasis on the analysis of Job Advertisements
binder data-science information-retrieval labour-market nlp r text-mining topic-modeling
Last synced: 09 Nov 2024
https://github.com/julesbelveze/nhelper
🧪 Behavioral testing of NLP models 🧪
behavioral deep-learning machine-learning natural-language-processing nlp testing
Last synced: 08 Nov 2024
https://github.com/dthung1602/bert-relation-extraction
Extract relations from text using BERT model
bert machine-learning natural-language-processing nlp pytorch relation-extraction transformer
Last synced: 09 Nov 2024
https://github.com/cyclecycle/visualise-spacy-tree
Create dependency tree plots from SpaCy Doc objects
Last synced: 14 Oct 2024
https://github.com/undertheseanlp/speech_classification
Vietnamese Speech Classification experiments
nlp speech vietnamese vietnamese-nlp
Last synced: 11 Nov 2024
https://github.com/luluw8071/text-sentiment-analysis
Fine-Tuning Distil BERT and LSTM for Comparative Analysis
bert bert-fine-tuning lstm-neural-networks nlp pytorch sentiment-classification text-classification
Last synced: 12 Nov 2024
https://github.com/iglee/outrunjulesverne
Personalizing unique travel experiences using data science.
data-mining gis-data jules-verne natural-language-processing nlp personalizing-travels python recommender-system scraping spark travel travelling-salesman-problem tripadvisor unsupervised-learning
Last synced: 06 Nov 2024
https://github.com/dai-wenxun/pointer-generator-networks
Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
nlp pointer-generator pytorch-implementation seq2seq summarization
Last synced: 14 Oct 2024
https://github.com/vochicong/datalab-nlp
NLP extension to Google Cloud Datalab
docker google-cloud-datalab japanese nlp
Last synced: 21 Oct 2024
https://github.com/vblagoje/reno-auto
Automatically generate comprehensive Reno release notes for your PR requests
github-actions nlp release-automation
Last synced: 09 Nov 2024
https://github.com/centre-for-humanities-computing/conspiracies
A python package for discovering and examining conspiracies using NLP.
conspiracies conspiracy knowledge-graph nlp spacy
Last synced: 14 Oct 2024
https://github.com/viveksck/simplicity
Code and Data for Simple Models for Word Formation in English Slang
Last synced: 14 Oct 2024
https://github.com/yunsii/fasttext.wasm.js
Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.
browser browser-extension fasttext language language-detection language-detector language-identification natural-language natural-language-processing nlp node nodejs wasm web-extension webassembly worker
Last synced: 09 Nov 2024
https://github.com/vaskonov/negochat_corpus
Negochat Corpus - a dialogue corpus in the negotiation domain
chat chat-bot chatbot corpus dataset dialogs dialogue-corpus dialogue-systems machine-learning natural-language-processing natural-language-understanding negotiation negotiations nlp
Last synced: 14 Oct 2024
https://github.com/renovamen/image-captioning
PyTorch re-implementation of some papers on image captioning | 图像描述
adaptive-attention attention captions cv image-captioning nlp pytorch show-and-tell show-attend-and-tell
Last synced: 10 Nov 2024
https://github.com/amey-thakur/sentiment-analyzer
A simple Python Program to Analyze Sentiments using TextBlob Python Library.
amey ameythakur analysis natural-language-processing nlp sentiment-analysis textblob textblob-sentiment-analysis
Last synced: 09 Nov 2024
https://github.com/oneapi-src/disease-prediction
AI Starter Kit for the implementation of AI-based NLP Disease Prediction system using Intel® Extension for PyTorch* and Intel® Neural Compressor
Last synced: 05 Nov 2024
https://github.com/guenthermi/postgres-retrofit
Tools to create database-specific text value embeddings from word embedding datasets
in-database-analytics learning machine machine-learning nlp postgresql word-embeddings word2vec
Last synced: 15 Oct 2024
https://github.com/kaushalpowar/talk_to_pdf
Talk to your pdf using OpenAI
ai ghdesktop github gpt-3 learn llm nlp nlp-machine-learning opeanai
Last synced: 06 Nov 2024
https://github.com/javierarce/silabea
Node package that split Spanish words into syllables.
language nlp spanish syllable syllable-count syllables
Last synced: 23 Oct 2024
https://github.com/rosette-api-community/rosettepedia
Augment Rosette API entity extraction results with information from Wikipedia.
entities entity-extraction language mediawiki mediawiki-api natural-language-processing nlp python wikidata wikipedia
Last synced: 09 Oct 2024
https://github.com/cyclecycle/role-pattern-nlp
Build and match patterns for semantic role labelling / information extraction with SpaCy
nlp python semantic-role-labeling spacy
Last synced: 12 Oct 2024
https://github.com/rajspeaks/machine-learning-approach-to-bengali-pos-tagging-using-bnlp
Machine Learning approach to Bengali Corpus POS (Parts of Speech) Tagging using BNLP (Bengali Natural Language Processing) Toolkit. This is the Minor Project Presentation at Heritage Institute of Technology under the mentorship of Prof. Sandipan Ganguly.
bengali-natural-language-processing bengali-nlp bnlp crf-model deep-learning machine-learning ml natural-language-generation natural-language-processing natural-language-toolkit natural-language-understanding nlp pos-tagger pos-tagging python3 rajdeep-das rajspeaks
Last synced: 23 Oct 2024
https://github.com/tawabshakeel/datacamp-projects
Data Camp Python Projects
data-science keras machine-learning matplotlib nlp numpy pandas python scipy seaborn sklearn
Last synced: 13 Oct 2024
https://github.com/renanmav/parabains-bot
Muitos parabains
bot deep-learning machine-learning nlp tensorflow twitter
Last synced: 18 Oct 2024
https://github.com/ymcui/mrc-model-analysis
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models (iScience)
Last synced: 28 Oct 2024
https://github.com/ekramasif/ai-lab-final-project
Improving News Classification Model Using Support Vector Machine and Naive Bayes
artificial-intelligence classification dataset naive-bayes-classifier news-classification nlp svm-classifier text-classification
Last synced: 11 Oct 2024
https://github.com/epwalsh/allennlp-manager
[WORK IN PROGRESS] Your manager for AllenNLP experiments
Last synced: 24 Oct 2024
https://github.com/jinensetpal/archimede
DANKMEMES Shared Task for EVALITA 2020
bert-embeddings ensemble-learning evalita2020 machine-learning memes nlp python pytorch tensorflow2 transfer-learning
Last synced: 23 Oct 2024
https://github.com/jpoehnelt/eleventy-plugin-related
Plugin for related posts in Eleventy.
eleventy eleventy-plugin natural nlp tf-idf
Last synced: 10 Oct 2024
https://github.com/softmarshmallow/robbin
🔠 an open dictions platform (both students and developers are welcome!)
deno dictionary dictionary-app education flutter gre nestjs nlp prisma python robbin sat students
Last synced: 28 Oct 2024
https://github.com/michaelhly/farglot
A Transformer-based SocialNLP toolkit for Farcaster
Last synced: 18 Oct 2024
https://github.com/gsarti/svevo-letters-analysis
Topic Modeling and Sentiment Analysis on Italo Svevo Epistolary Corpus
data-science digital-humanities dssc italian italian-nlp italo-svevo machine machine-learning nlp nlproc research-project sentiment-analysis text-mining topic-modeling university-of-trieste
Last synced: 23 Oct 2024
https://github.com/argilla-io/argilla-server
A Python native FastAPI server for the Argilla backend.
api argilla fastapi llm machine-learning nlp server
Last synced: 12 Nov 2024
https://github.com/salesforce/isea
Official code repository for "iSEA: An Interactive Pipeline of Semantic Error Analysis for NLP Models"
deep-learning hci human-in-the-loop machine-learning nlp visualization
Last synced: 08 Nov 2024
https://github.com/tokenmill/dictionary-annotator
Fast and configurable UIMA dictionary annotator.
annotators csv dictionary dkpro nlp ruta
Last synced: 10 Nov 2024
https://github.com/tokenmill/snowball
Snowball version of the Porter stemmer for the Lithuanian language.
lithuanian-language nlp porter-stemmer snowball stemmer
Last synced: 10 Nov 2024
https://github.com/imwildcat/aitk
Artificial Intelligence Toolkit, a powerful tool that makes your life better.
ai baidu cloud-ai cloud-ml-engine computer-vision google machine-learning machine-translation nlp speech-recognition tencent text-to-speech
Last synced: 18 Oct 2024
https://github.com/zhudotexe/chatgpt-api-demo
Demo of how to use the ChatGPT API to create a chat application right in your terminal.
chatgpt chatgpt-api gpt-3 machine-learning natural-language-processing nlp openai openai-api
Last synced: 27 Oct 2024
https://github.com/softmarshmallow/inked-server
☁️ prisma server for inked project
elasticsearch express mongodb neo4j nlp prisma prisma-client typescript
Last synced: 11 Oct 2024
https://github.com/pppw/text-to-image
deep-learning fastai generative-adversarial-network nlp
Last synced: 24 Oct 2024
https://github.com/simplyyan/galaktaglare
A broad, easy and fast framework for machine/deep learning in Go.
algorithms analysis analytics anomaly-detection automation bash data-science decision-trees deep-learning deep-neural-networks go golang image-classification image-processing machine-learning machine-learning-algorithms nlp text-classification tts voice-recognition
Last synced: 09 Oct 2024
https://github.com/madrugado/gia-corpus
Corpus of exam tests for 9-graders in Russia for NLP/ML purposes
corpus natural-language-processing nlp russian-corpus
Last synced: 09 Nov 2024
https://github.com/proycon/nederlab-pipeline
Linguistic enrichment pipeline for historical dutch, as used in the Nederlab project
dutch historical-dutch historical-linguistics natural-language-processing nederlab nextflow nlp workflow
Last synced: 19 Oct 2024
https://github.com/sammous/allennlp_tagger
Academic Paper/Document classifier based on SCOPUS categories
allennlp classification nlp python pytorch scopus
Last synced: 19 Oct 2024
https://github.com/dave-tucker/ocaml-nlp
Simple NLP for OCaml
natural-language-processing nlp ocaml
Last synced: 12 Oct 2024
https://github.com/yuanjie-ai/dnn
Learning ...
deep-learning nlp nlp-machine-learning
Last synced: 16 Oct 2024
https://github.com/ksdkamesh99/stackquestioner
A Django Based Web App for predicting a given query belongs to which language i.e 'R', 'java', 'javascript', 'PHP', 'python'. The model is trained using LSTM's with a training accuracy of 97% and testing accuracy of 80%. The data that the model is trained with queries collected from a dataset in Kaggle originally extracted from StackOverflow.
django embeddings keras-tensorflow lstm nlp stackoverflow
Last synced: 24 Oct 2024
https://github.com/jeff-vincent/spacy-passive-to-active-voice
A rule-based, English language, passive voice converter. Requires spaCy.
nlp python3 spacy-nlp stanford-nlp
Last synced: 19 Nov 2024
https://github.com/gentaiscool/miners
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.
benchmark classification deep-learning deep-learning-models efficient generation language-model large-language-models llm machine-learning miner miners ml multilingual nlp retrieval semantic-retrieval sentence-transformers transformers
Last synced: 08 Nov 2024
https://github.com/nurseiit/46th-soz
✍️ Abai's 46th word (Qara Søz) by "AI". Character based RNN from Tensorflow Docs.
ai kazakh ml nextjs nlp tensorflow
Last synced: 13 Oct 2024
https://github.com/fastent/fastent
custom models for named-entity recognition
data-annotation data-generation named-entities named-entity-recognition natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/kardelruveyda/langchain-mystic
langchain-mystic is a LangChain framework based bot that currently specializes in dream interpretation and fortune-telling. While these are its primary features for now, stay tuned to see what other mystical abilities
chatbot chatbots langchain langchain-python mystic nlp
Last synced: 08 Nov 2024
https://github.com/frankier/wikiparse
Scrapes some Finnish word definitions from English Wiktionary.
computational-linguistics dictionary finnish natural-language-processing nlp wikimarkup wikitionary
Last synced: 08 Nov 2024
https://github.com/maxim5/cs224n-2019-winter
All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford
cs224n deep-learning machine-learning nlp stanford-nlp
Last synced: 05 Nov 2024
https://github.com/yangboz/chatbotsmessager
🤖 iMessage to Emotional Artificial Intelligence Chat Bots
appstore chatbot hash nlp objective-c screenshot
Last synced: 27 Oct 2024
https://github.com/comcast/syntaviz
A visualization interface for analyzing a (very large) corpus of natural-language queries.
clustering nlp visualization voice-recognition
Last synced: 14 Nov 2024
https://github.com/sap-samples/acl2023-micse
Source code for ACL 2023 paper "miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings".
contrastive embedding few-shot-learning learning low-shot-learning nlp sample self-supervised-learning sentence sssl
Last synced: 15 Nov 2024
https://github.com/stefanieschneider/unstruwwel
Detect and Parse Historic Dates in R
Last synced: 20 Nov 2024
https://github.com/jpruden92/dialogflow-nlp-to-nlpjs
Transform your Dialogflow NLP model to a NLP.js model
chat chatbot dialogflow nlp nlpjs nlu text voice
Last synced: 17 Nov 2024
https://github.com/pharo-ai/Polyglot
A library for Natural Language Processing
natural-language-processing nlp pharo
Last synced: 17 Nov 2024
https://github.com/chandru-21/end-to-end-movie-recommendation-system-with-deployment-using-docker-and-kubernetes
Content Based Recommendation system uses attributes of the content to recommend similar content. It doesn't have a cold-start problem because it works through attributes or tags of the content, such as actors, genres or directors, so that new movies can be recommended right away.
contentbasedfiltering docker endtoend endtoendpipeline kubernetes machine-learning movie-recommendation movierecommendationsystem nlp python recommendation-system recommender-system streamlit streamlit-webapp
Last synced: 15 Nov 2024
https://github.com/jgolafshan/wallstreetsocial
Analyze Reddit posts/comments, uses an NLP model to recognize stock symbols and options positions
data-analysis flexible machine-learning mentioned-stocks named-entity-recognition natural-language-processing nlp reddit social-media-analysis sql wallstreetbets
Last synced: 20 Nov 2024
https://github.com/faisalahmedbijoy/natural-language-processing-with-python
Natural Language Processing with Python
bangla cse cse-kuet kuet learnwithfaisal natural-language-processing nlp regular-expressions text-classification youtube
Last synced: 18 Nov 2024
https://github.com/ashwinpn/conditional-random-fields
An analysis and implementation of Conditional Random Fields.
conditional-random-fields crf data-science machine-learning nlp nlp-machine-learning pos-tagging probabilistic-graphical-models
Last synced: 15 Nov 2024
https://github.com/euler16/charrnn
Character Level Language Modelling using PyTorch
char-rnn deep-learning lstm nlp
Last synced: 17 Nov 2024
https://github.com/geekalexis/two-stage-sum
Two-stage text summarization with BERT and BART
nlp summarization text-summarization
Last synced: 13 Nov 2024
https://github.com/aditeyabaral/nlpc
Natural Language Toolkit built using the C Programming Language
c machine-learning nlp nlp-machine-learning nltk
Last synced: 16 Nov 2024
https://github.com/ammarlodhi255/image-captioning-system-to-assist-the-blind
An image captioning system that is able to predict and speak out a caption of an image taken by visually impaired.
bidirectional-lstm cnn computer-vision computer-vision-algorithms css3 deep-learning html5 image-captioning javascript javascript-es6 lstm-neural-network ml-web nlp nlp-machine-learning resnet resnet-50 vgg16
Last synced: 17 Nov 2024
https://github.com/estamos/word2vec-thesis
🎓 Diploma Thesis | A Word2vec comparative study of CBOW and Skipgram
cbow continuous-bag-of-words gensim gensim-word2vec machine-learning nlp skipgram skipgram-algorithm word-embeddings word2vec
Last synced: 15 Nov 2024
https://github.com/hassanalgoz/text-generation
Generate and predict text, using Recurrent Neural Networks. (Keras+Tensorflow+Gensim)
gensim-word2vec gru keras-tensorflow lstm machine-learning nlp rnn text-processing word2vec
Last synced: 12 Nov 2024
https://github.com/aflah02/cleansetext
This is a simple library to help you clean your textual data
cleaning-data nlp preprocessing pypi text
Last synced: 20 Nov 2024
https://github.com/webpolis/musai
Machine learning-powered music generation. Full-featured tokenizer, customization options, and high-quality output files. Integration with music production tools.
deep-learning generative-art large-language-models llm machine-learning midi music music-generation nlp recurrent-neural-networks rnn text-generation tokenizer vae variational-autoencoder
Last synced: 15 Nov 2024
https://github.com/aflah02/nlp-albumentations-data-augmentation
This repository contains helper functions which can help you generate additional data points depending on your NLP task.
Last synced: 20 Nov 2024
https://github.com/ajdavidl/nlp-packages
List of packages developed with focus on natural language processing.
natural-language-processing nlp
Last synced: 20 Nov 2024
https://github.com/nateraw/pytorch-lightning-azureml
Narrow the gap between research and production 😎
azure azureml nlp pytorch-lightning transformers
Last synced: 17 Nov 2024
https://github.com/eliask93/transformer-models-for-domain-specific-machine-translation
Example application for the task of fine-tuning pretrained machine translation models on highly domain-specific, self-extracted translated sentences
bitext-mining machine-translation marian-nmt nllb nllb200 nlp sentence-extraction sentence-transformers t5
Last synced: 13 Nov 2024
https://github.com/imsanjoykb/automated-spam-mail-detection-and-flask-deployment
This is an simple NLP project in which the model is able to predict the incoming mail whether it is spam or not spam(ham). As we seen in gmail automatically the mail is classified and stored in spam or inbox so this project is prototype.
flask machine-learning naive-bayes-classifier nlp python scikit-learn
Last synced: 17 Nov 2024
https://github.com/benja1972/topicphrase
Simple project for extraction of key-phrases from single document based on Sentence Trasfomers
bert-embeddings clusters embeddings key-phrase-extraction nlp noun-phrases-candidates sentence-transformers topics
Last synced: 18 Nov 2024
https://github.com/hyeonsangjeon/colab-tensorflow-tpu-example
colab-tensorflow-tpu-example
bert colab colab-notebook confusion-matrix histogram kobert nlp resolver-library sagemaker sentence sentence-embeddings sentence-transformers sentiment tpu transformers
Last synced: 17 Nov 2024
https://github.com/nlpie/mtap
MTAP: A framework for distributed text analysis using gRPC and microservices-based architecture.
framework grpc java microservices mtap natural-language-processing nlp pipelines python text-analysis
Last synced: 17 Nov 2024
https://github.com/thakur-nandan/topic-modeling
This repository contains as intuitive example on topic-modeling using regular LDA, and how GuidedLDA is better than regular LDA
gensim guidedlda latent latent-dirichlet-allocation lda nlp nlp-machine-learning nltk python seededlda text topic-modeling
Last synced: 15 Oct 2024
https://github.com/tuanacelik/what-would-mother-say
💁♀️ A Tweet creation Agent that fetches usernames last k tweets and generates a tweet about the requested topic
agent haystack llm nlp opensource
Last synced: 28 Oct 2024
https://github.com/ills-montreal/emir
When is an Embedding Model More Promising than Another?, NeurIPS'24
embedders information-theory molecule nlp representation-learning
Last synced: 06 Nov 2024
https://github.com/iampukar/language-processing
Solutions to NLP coursework from National Research University Higher School of Economics, through Coursera
coursera hse-aml natural-language-processing nlp
Last synced: 07 Nov 2024
https://github.com/ppasupat/factored-span-parsing
Code for the EMNLP 2019 paper "Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog"
Last synced: 08 Nov 2024
https://github.com/m-elbably/symspell-ex
Distributed spelling correction & fuzzy search based on symmetric delete spelling correction algorithm (SymSpell)
algorithm nlp spelling-correction
Last synced: 06 Nov 2024
https://github.com/rosette-api/csharp
Rosette API Client Library for C#
capi csharp entity-extraction language-identification machine-learning morphology name-translation natural-language-processing nlp nuget rosette text-analysis text-analytics text-embedding visual-studio
Last synced: 12 Nov 2024
https://github.com/finn-no/keras-conv-sentence-classifier
Keras Implementation of "Convolutional Neural Networks for Sentence Classification"
convolutional-neural-network keras nlp
Last synced: 14 Nov 2024
https://github.com/xiaohk/stat333_project_2
Madison restaurant Yelp rating prediction based on review text
convolutional-neural-networks logistic-regression machine-learning naive-bayes-classifier nlp
Last synced: 07 Nov 2024
https://github.com/avestura/persiannews
📰 My final project for NLP course
csharp fsharp guilan-university nlp persian persian-nlp
Last synced: 14 Nov 2024
https://github.com/ssciwr/ammico
AI-based Media and Misinformation Content Analysis Tool: Analyze text and images
classification computer-vision nlp text-extraction translation
Last synced: 09 Nov 2024