Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-15 00:20:20 UTC
- JSON Representation
https://github.com/thunlp/few-nerd
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
deep-learning entity-typing few-shot-learning named-entity-recognition nlp
Last synced: 17 Nov 2024
https://github.com/graykode/commit-autosuggestions
A tool that AI automatically recommends commit messages.
bert commit-autosuggestions natural-language nlp text-generation
Last synced: 13 Nov 2024
https://github.com/gutfeeling/beginner_nlp
A curated list of beginner resources in Natural Language Processing
natural-language-processing nlp nlp-resources
Last synced: 07 Aug 2024
https://github.com/Oneflow-Inc/libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
data-parallelism deep-learning distributed-training large-scale model-parallelism nlp oneflow pipeline-parallelism self-supervised-learning transformer vision-transformer
Last synced: 16 Nov 2024
https://github.com/towhee-io/examples
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging
Last synced: 13 Nov 2024
https://github.com/neurocult/agency
🕵️♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.
agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database
Last synced: 06 Nov 2024
https://github.com/qipeng/gcn-over-pruned-trees
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)
dependency-parse-trees dependency-parsing information-extraction natural-language-processing nlp relation-extraction
Last synced: 02 Nov 2024
https://github.com/omarsar/nlp_highlights
The most important NLP highlights of 2018 (PDF Report)
analytics artificial-intelligence conversational-ai deep-learning health nlp technology
Last synced: 13 Oct 2024
https://github.com/polm/fugashi
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
cython-wrapper japanese mecab nlp tokenizer
Last synced: 11 Nov 2024
https://github.com/sgrvinod/a-pytorch-tutorial-to-sequence-labeling
Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
co-training conditional-random-fields crf entity-extraction entity-recognition language-model nlp pos-tagger pos-tagging pytorch pytorch-tutorial sequence-labeling sequence-tagger
Last synced: 14 Nov 2024
https://github.com/dair-ai/nlp_fundamentals
📘 Contains a series of hands-on notebooks for learning the fundamentals of NLP
Last synced: 17 Nov 2024
https://github.com/neuralmagic/sparsezoo
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
computer-vision deep-learning-algorithms deep-learning-models mobilenet models-optimized nlp object-detection-model pretrained-models pruning quantization resnet smaller-models sparse-quantized-models sparsification-recipe transfer-learning yolo
Last synced: 15 Nov 2024
https://github.com/kefirski/pytorch_RVAE
Recurrent Variational Autoencoder that generates sequential data implemented with pytorch
deep-learning nlp python pytorch vae
Last synced: 02 Nov 2024
https://github.com/yuhaozhang/tacred-relation
PyTorch implementation of the position-aware attention model for relation extraction
information-extraction natural-language-processing nlp relation-extraction
Last synced: 15 Nov 2024
https://github.com/shamspias/customizable-gpt-chatbot
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
artificial-intelligence autogpt chatbot conversational-ai data-preprocessing django django-rest-framework gpt-3 gpt-voice langchain langchain-python longchain machine-learning natural-language-processing nlp python voice-chat voice-recognition voice-to-text voice-transcription
Last synced: 06 Nov 2024
https://github.com/neuml/tldrstory
📊 Semantic search for headlines and story text
machine-learning nlp python search txtai
Last synced: 01 Nov 2024
https://github.com/ymcui/PERT
PERT: Pre-training BERT with Permuted Language Model
bert nlp plm pre-trained-model pytorch tensorflow transformers
Last synced: 16 Nov 2024
https://github.com/kakaobrain/word2word
Easy-to-use word-to-word translations for 3,564 language pairs.
bilingual-lexicon-extraction nlp opensubtitles translation
Last synced: 17 Nov 2024
https://github.com/nashex/gpt4-playground
Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.
gpt4 gpt4-api nextjs nlp openai playground
Last synced: 09 Nov 2024
https://github.com/hit-scir/huozi
活字通用大模型
fine-tuning large-language-models llm nlp
Last synced: 17 Nov 2024
https://github.com/adamlui/ai-web-extensions/
🤖 AI browser extensions & userscripts to enhance your web experience
ai amazon artificialintelligence brave chat chatbot chatgpt chrome-extensions duckduckgo firefox-addons google gpt-4 greasemonkey javascript machine-learning ml nlp openai userscripts web-extensions
Last synced: 01 Nov 2024
https://github.com/dsdanielpark/amazing-bard-prompts
This repo includes Google Bard prompt curation to use Bard better.
amazing amazing-bard-prompts amazing-serise bard bard-prompts google google-bard google-bard-prompts large-language-models nlp prompt prompt-engineering
Last synced: 05 Nov 2024
https://github.com/planeshifter/node-word2vec
Node.js interface to the Google word2vec tool.
Last synced: 13 Nov 2024
https://github.com/dongjunlee/transformer-tensorflow
TensorFlow implementation of 'Attention Is All You Need (2017. 6)'
attention deep-learning experiments hb-experiment nlp tensorflow transformer translation
Last synced: 15 Nov 2024
https://github.com/Planeshifter/node-word2vec
Node.js interface to the Google word2vec tool.
Last synced: 02 Nov 2024
https://github.com/keiffster/program-y
Python 3.x based AIML 2.0 Chatbot interpreter, framework, related programs and knowledge files
ai aiml aiml2 api chatbot framework nlp nlp-parsing python python3 tutorial virtual virtualassistant
Last synced: 29 Oct 2024
https://github.com/ibm/zshot
Zero and Few shot named entity & relationships recognition
ai deep-learning few-shot few-shot-learning machine-learning named-entity-recognition natural-language-processing natural-language-understanding ned ner nlp nlp-library pytorch relation-extraction relationship-extraction spacy transformer zero-shot zero-shot-learning
Last synced: 14 Oct 2024
https://github.com/DongjunLee/transformer-tensorflow
TensorFlow implementation of 'Attention Is All You Need (2017. 6)'
attention deep-learning experiments hb-experiment nlp tensorflow transformer translation
Last synced: 07 Nov 2024
https://github.com/ymcui/pert
PERT: Pre-training BERT with Permuted Language Model
bert nlp plm pre-trained-model pytorch tensorflow transformers
Last synced: 28 Oct 2024
https://github.com/wagamamaz/tensorlayer-tricks
How to use TensorLayer
computer-vision data-science deep-learning keras lasagne machine-learning natural-language-processing neural-network neural-networks nlp reinforcement-learning tensorboard tensorflow tensorflow-experiments tensorflow-framework tensorflow-library tensorflow-models tensorflow-tutorials tensorlayer tflearn
Last synced: 13 Oct 2024
https://github.com/Nashex/gpt4-playground
Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.
gpt4 gpt4-api nextjs nlp openai playground
Last synced: 04 Nov 2024
https://github.com/adamlui/ai-web-extensions
🤖 AI userscripts & browser extensions to enhance your web experience
ai amazon artificialintelligence brave chat chatbot chatgpt chrome-extensions duckduckgo firefox-addons google gpt-4 greasemonkey javascript machine-learning ml nlp openai userscripts web-extensions
Last synced: 29 Oct 2024
https://github.com/maif/melusine
📧 Melusine: Use python to automatize your email processing workflow
courriels datascience emails natural-language-processing nlp nlp-machine-learning python python3
Last synced: 16 Nov 2024
https://github.com/MAIF/melusine
📧 Melusine: Use python to automatize your email processing workflow
courriels datascience emails natural-language-processing nlp nlp-machine-learning python python3
Last synced: 03 Nov 2024
https://github.com/explosion/displacy
:boom: displaCy.js: An open-source NLP visualiser for the modern web
css javascript natural-language-processing nlp spacy svg visualization
Last synced: 25 Sep 2024
https://github.com/dccuchile/spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora
fasttext-embeddings glove-embeddings nlp spanish word-embeddings word2vec-embeddinngs
Last synced: 05 Aug 2024
https://github.com/Koziev/NLP_Datasets
My NLP datasets for Russian language
Last synced: 13 Nov 2024
https://github.com/momegas/megabots
🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
chatbot faiss fastapi gpt-35-turbo gpt-4 information-retrieval langchain llama natural-language-processing nlp pinecone prompt-engineering python question-answering s3
Last synced: 11 Oct 2024
https://github.com/domluna/memn2n
End-To-End Memory Network using Tensorflow
memory-networks nlp tensorflow
Last synced: 17 Nov 2024
https://github.com/izuna385/entity-linking-recent-trends
Recent trends of Entity Linking, Disambiguation, and Representation.
bert entity-disambiguation entity-language-model entity-linking entity-representation entity-resolution natural-language-processing nlp
Last synced: 18 Oct 2024
https://github.com/OpenBMB/BMList
A List of Big Models
ai api code computer-vision deep-learning natural-language-processing nlp paper pretrained-models speech-recognition visualization
Last synced: 16 Nov 2024
https://github.com/deepset-ai/covid-qa
API & Webapp to answer questions about COVID-19. Using NLP (Question Answering) and trusted data sources.
api corona covid-19 covid-data faq nlp question-answering search
Last synced: 06 Nov 2024
https://github.com/alibaba-edu/simple-effective-text-matching
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
deep-learning nlp quora-question-pairs snli tensorflow
Last synced: 06 Nov 2024
https://github.com/CogStack/OpenGPT
A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).
chatgpt gpt-4 health healthcare huggingface llm medicine nlp opengpt
Last synced: 16 Nov 2024
https://github.com/jacksonllee/pycantonese
Cantonese Linguistics and NLP
cantonese computational-linguistics jyutping linguistics natural-language-processing nlp part-of-speech-tagging pycantonese python stop-words word-segmentation
Last synced: 04 Aug 2024
https://github.com/graphaware/neo4j-nlp
NLP Capabilities in Neo4j
algorithms graph-database machine-learning neo4j nlp opennlp stanford-corenlp
Last synced: 26 Sep 2024
https://github.com/thisandagain/troll
Language sentiment analysis and neural networks... for trolls.
javascript moderation neural-network nlp sentiment sentiment-analysis
Last synced: 17 Nov 2024
https://github.com/kyzhouhzau/nlpgnn
1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and GraphSAGE based on message passing.
albert albert-ner bert bert-cls bert-ner bilstm-attention gan gcn gin gnn gpt2 graph-classfication graph-convolutional-networks graphsage message-passing nlp tensorflow2 textcnn textgcn tf2
Last synced: 14 Oct 2024
https://github.com/oswaldoludwig/Seq2seq-Chatbot-for-Keras
This repository contains a new generative model of chatbot based on seq2seq modeling.
chatbot conversational-agents deep-learning dialogue dialogue-generation gan generative-adversarial-network glove keras nlp seq2seq
Last synced: 02 Nov 2024
https://github.com/xplip/pixel
Research code for pixel-based encoders of language (PIXEL)
deep-learning deep-neural-networks language-model machine-learning nlp pytorch
Last synced: 14 Nov 2024
https://github.com/davidmigloz/langchain_dart
Build LLM-powered Dart/Flutter applications.
ai dart flutter generative-ai llms nlp
Last synced: 03 Nov 2024
https://github.com/wuba/qa_match
A simple effective ToolKit for short text matching
58 ai deep-learning dssm lstm machine-learning nlp qabot qatools tensorflow
Last synced: 16 Nov 2024
https://github.com/yunwei37/covid-19-nlp-vis
使用 flask + pyecharts 搭建的新冠肺炎疫情数据可视化交互分析网站平台,包含疫情数据获取、每日疫情地图、曲线图展示,数据统计分析、态势感知、确诊人数预测分析算法设计、NLP舆情监测等任务(部署在http://covid.yunwei123.tech/)
covid-19 flask maps nlp pyecharts visualization
Last synced: 17 Nov 2024
https://github.com/shibing624/dialogbot
dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。
chatbot deep-learning dialog dialogbot nlp qa question-answering
Last synced: 12 Nov 2024
https://github.com/machine-learning-apps/issue-label-bot
Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow
Last synced: 29 Sep 2024
https://github.com/machine-learning-apps/Issue-Label-Bot
Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow
Last synced: 25 Oct 2024
https://github.com/discopy/discopy
The Python toolkit for computing with string diagrams.
category-theory diagrams nlp quantum-computing
Last synced: 09 Aug 2024
https://github.com/drahnr/cargo-spellcheck
Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar
cargo cargo-plugin cargo-spellcheck grammar grammar-mistakes grammarchecker hacktoberfest hunspell languagetool nlp spellchecker spelling
Last synced: 13 Nov 2024
https://github.com/HIT-SCIR/huozi
活字通用大模型
fine-tuning large-language-models llm nlp
Last synced: 08 Nov 2024
https://github.com/zjunlp/openue
[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text
bert event-extraction intent-classification named-entity-recognition natural-language-processing nlp nlp-extraction-tasks openue pytorch relation-extraction slot-filling triple-extraction
Last synced: 16 Nov 2024
https://github.com/explosion/prodigy-openai-recipes
✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
annotation-tool few-shot-learning gpt-3 nlp openai openai-api prodigy zero-shot-learning
Last synced: 25 Sep 2024
https://github.com/dpressel/dliss-tutorial
Tutorial for International Summer School on Deep Learning, 2019
deep-learning machine-learning nlp
Last synced: 17 Nov 2024
https://github.com/swhl/ai-competition-collections
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
competition cv data-discovery graph-neural-networks knowledge-graph nlp recommender-system speech
Last synced: 15 Nov 2024
https://github.com/jcrodriguez1989/chatgpt
Interface to ChatGPT from R
assistant chatgpt gpt-3 gpt-4 hacktoberfest llm nlp openai r rstats rstats-package rstatses rstudio rstudio-addin
Last synced: 11 Oct 2024
https://github.com/asahi417/lm-question-generation
Multilingual/multidomain question generation datasets, models, and python library for question generation.
bart nlp pytorch question-answering question-generation t5
Last synced: 04 Nov 2024
https://github.com/cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
analysis deep-learning language-model language-models machine-learning nlp transformers
Last synced: 06 Aug 2024
https://github.com/hlasse/textdescriptives
A Python library for calculating a large variety of metrics from text
dependency-distance descriptive-statistics nlp python readability readability-scores spacy spacy-extension statistics syntactic-analysis
Last synced: 14 Oct 2024
https://github.com/mcs07/chemdataextractor
Automatically extract chemical information from scientific documents
chemistry information-extraction natural-language-processing nlp python text-mining
Last synced: 14 Nov 2024
https://github.com/xiangking/ark-nlp
A private nlp coding package, which quickly implements the SOTA solutions.
Last synced: 06 Nov 2024
https://github.com/JetRunner/BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
bert glue model-compression nlp transformers
Last synced: 03 Nov 2024
https://github.com/qiangsiwei/bert_distill
BERT distillation(基于BERT的蒸馏实验 )
bert classification distillation nlp
Last synced: 02 Nov 2024
https://github.com/CUNY-CL/wikipron
Massively multilingual pronunciation mining
computational-linguistics g2p language linguistics nlp phonetics phonology pronunciation python-api scraped-data speech
Last synced: 04 Nov 2024
https://github.com/UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
bert domain-adaptation information-retrieval nlp transformers vector-search
Last synced: 05 Aug 2024
https://github.com/xkzhangsan/xk-time
xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达式和时间NLP等的工具,使用Java8(JSR-310),线程安全,简单易用,多达70几种常用日期格式化模板,支持Java8时间类和Date,轻量级,无第三方依赖。
calendar cron cron-java8 date datetimeformatter-formatter dateutil formatter java jsr-310 localdate localdatetime nlp time timeconvertion
Last synced: 04 Aug 2024
https://github.com/graykode/ai-docstring
Visual Studio Code extension to quickly generate docstrings for python functions using AI(NLP) technology.
bert code-summarization docstrings nlp vs-code-extenstion
Last synced: 04 Nov 2024
https://github.com/natasha/yargy
Rule-based facts extraction for Russian language
earley-parser information-extraction morphology nlp python russian tomita tomita-parser
Last synced: 17 Nov 2024
https://github.com/TengHu/ActionWeaver
Make function calling with LLM easier
chatgpt-functions nlp openai-api openai-chatgpt openai-function-call openai-functions python
Last synced: 05 Nov 2024
https://github.com/alibaba-edu/simple-effective-text-matching-pytorch
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
deep-learning nlp pytorch quora-question-pairs snli
Last synced: 06 Nov 2024
https://github.com/GaoQ1/rasa_nlu_gq
turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)
bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow
Last synced: 02 Nov 2024
https://github.com/HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
dependency-distance descriptive-statistics nlp python readability readability-scores spacy spacy-extension statistics syntactic-analysis
Last synced: 04 Aug 2024
https://github.com/abhijithneilabraham/tableqa
AI Tool for querying natural language on tabular data.
ai csv database machine-learning nl2sql nlp qa querying-natural-language question-answering sql sql-generation sql-query table-qa tableqa tabular-data
Last synced: 13 Nov 2024
https://github.com/abhimishra91/insight
Repository for Project Insight: NLP as a Service
docker fastapi huggingface huggingface-transformer machine-learning microservice natural-language-processing nlp streamlit streamlit-webapp transformer transformers-models
Last synced: 14 Nov 2024
https://github.com/hankcs/multi-criteria-cws
Simple Solution for Multi-Criteria Chinese Word Segmentation
bi-lstm-crf cws dynet multi-criteria-cws nlp
Last synced: 17 Nov 2024
https://github.com/dair-ai/nlp_newsletter
📰Natural language processing (NLP) newsletter
deep-learning machine-learning nlp
Last synced: 10 Nov 2024
https://github.com/textpipe/textpipe
Textpipe: clean and extract metadata from text
language-identification named-entities named-entity-recognition nlp text-analysis text-processing
Last synced: 06 Nov 2024
https://github.com/phospho-app/phospho
Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)
ai analytics generative-ai llm nextjs nlp ollama python self-hosted typescript
Last synced: 13 Oct 2024
https://github.com/charles9n/bert-sklearn
a sklearn wrapper for Google's BERT model
bert conll-2003 language-model named-entity-recognition natural-language-processing ner nlp pytorch scikit-learn transfer-learning
Last synced: 02 Nov 2024
https://github.com/undertheseanlp/nlp-vietnamese-progress
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
Last synced: 11 Nov 2024
https://github.com/kevinlu1248/pyate
PYthon Automated Term Extraction
ai nlp symbolic-ai term-extraction
Last synced: 28 Sep 2024
https://github.com/gagolews/stringi
Fast and portable character string processing in R (with the Unicode ICU)
icu icu4c natural-language-processing nlp r regex regexp string-manipulation stringi stringr text text-processing tidy-data unicode
Last synced: 26 Oct 2024
https://github.com/hankcs/hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
chinese-text-segmentation hanlp lucene nlp solr traditional-chinese
Last synced: 17 Nov 2024
https://github.com/daac-tools/vibrato
🎤 vibrato: Viterbi-based accelerated tokenizer
japanese morphological-analysis nlp rust segmentation tokenization tokenizer
Last synced: 07 Nov 2024
https://github.com/gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
bilingual code-mixed code-mixing code-switch code-switching language nlp papers research speech
Last synced: 08 Nov 2024
https://github.com/jackdh/RasaTalk
A chatbot framework for Rasa NLU
bot botkit bots chatbot chatbot-framework conversational-ai nlp nodejs rasa rasa-nlu react
Last synced: 30 Oct 2024
https://github.com/sekwiatkowski/Komputation
Komputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
artificial-intelligence convolutional-neural-networks cuda framework gpu jvm kotlin machine-learning neural-networks nlp nvidia recurrent-neural-networks seq2seq
Last synced: 02 Nov 2024