Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-20 00:15:35 UTC
- JSON Representation
https://github.com/bretttolbert/verbecc-svc
Dockerized Python microservice with REST API for verbs conjugation in French, Spanish and Portuguese
conjugation conjugator french french-language french-nlp linguistics machine-learning natural-language natural-language-processing nlp portuguese-language portuguese-verbs romanian romanian-language scikit-learn spanish-language spanish-verbs verb-conjugation
Last synced: 18 Oct 2024
https://github.com/winkjs/wink-porter2-stemmer
Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter
natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer
Last synced: 09 Nov 2024
https://github.com/percevalw/metanno
Annotator building tool for Jupyter
annotator customizable jupyter modular nlp
Last synced: 08 Nov 2024
https://github.com/revdotcom/words2num
Convert words to numbers
inverse-text-normalization nlp
Last synced: 11 Nov 2024
https://github.com/gmontamat/poor-mans-transformers
Implement Transformers (and Deep Learning) from scratch in NumPy
deep-learning from-scratch machine-learning ml-framework neural-network nlp transformers
Last synced: 30 Oct 2024
https://github.com/anthonysigogne/web-search-engine
API - a simple web search engine
api elasticsearch google-search indexing nlp python search-engine
Last synced: 12 Nov 2024
https://github.com/ahammadmejbah/ahammadmejbah
Data Science || Machine Learning || Deep Learning || Computer Vision || NLP Enthusiast Talks about #datascience, #deeplearning, #dataanalytics, #machinelearning, and #machinelearningalgorithms
artificial-intelligence computer-vision data-science deep-learning machine-learning nlp python
Last synced: 11 Nov 2024
https://github.com/tlkh/t2t-tuner
Convenient Text-to-Text Training for Transformers
gpt huggingface language-model nlp pytorch t5 transformers
Last synced: 07 Nov 2024
https://github.com/vishnunkumar/doc_transformers
Document processing using transformers
Last synced: 16 Nov 2024
https://github.com/wassname/phoneme2grapheme
Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")
cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling
Last synced: 15 Oct 2024
https://github.com/contextlab/abstract2paper
Auto-generate an entire paper from a prompt or abstract using NLP
auto-text gpt-neo nlp notebook-jupyter text-generation
Last synced: 06 Nov 2024
https://github.com/proycon/deepfrog
An NLP-suite powered by deep learning
deep-learning deep-neural-networks dutch folia frog nlp transformers
Last synced: 08 Nov 2024
https://github.com/bramvanroy/astred
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
alignment linguistics nlp parallel-corpus parsing spacy stanza translation
Last synced: 14 Oct 2024
https://github.com/hpprc/defsent
DefSent: Sentence Embeddings using Definition Sentences
bert natural-language-processing nlp transformers
Last synced: 27 Oct 2024
https://github.com/bloomberg/mixce-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
language-model machine-learning nlp python pytorch transformer
Last synced: 09 Nov 2024
https://github.com/liyucheng09/llm-compressive
Longitudinal Evaluation of LLMs via Data Compression
benchmark evaluation llm llms nlp
Last synced: 30 Oct 2024
https://github.com/thunlp/babelnet-sememe-prediction
Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"
Last synced: 10 Nov 2024
https://github.com/deepraj1729/tchatbot
A ChatBot framework to create customizable all purpose Chatbots using NLP, Tensorflow, Speech Recognition
artificial-intelligence chatbot-framework conda deep-learning framework git github machine-learning neural-networks nlp nltk numpy pip pypi python3 sklearn speech-recognition tensorflow virtual-environment
Last synced: 14 Oct 2024
https://github.com/alexcg1/easy_text_generator
Generate text from machine-learning models right in your browser
machine-learning nlp python streamlit
Last synced: 27 Oct 2024
https://github.com/study-assist/browser-extension
A tool to help you organise your bookmarks intelligently
bookmarks bookmarks-manager browser-extension data-analysis machine-learning natural-language-processing nlp
Last synced: 06 Nov 2024
https://github.com/artitw/bert_qa
Accelerating the development of question-answering systems based on BERT and TF 2.0
artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp
Last synced: 28 Oct 2024
https://github.com/cmccomb/rust-stop-words
Common stop words in a variety of languages
languages natural-language-procressing nlp nltk rust-crate stopwords
Last synced: 12 Oct 2024
https://github.com/wetneb/pynif
A small Python library for NLP Interchange Format (NIF) for NER(D) systems
entity-linking gerbil named-entity-recognition nif nlp python
Last synced: 28 Oct 2024
https://github.com/azu/nlp-pattern-match
Natural Language pattern matching library for JavaScript.
english japanese javascript morphological-analysis nlcst nlp pos
Last synced: 01 Nov 2024
https://github.com/koenvervloesem/rasa-docker-arm
Rasa Docker image for ARMv7. Runs on a Raspberry Pi.
arm armhf armv7 armv7l bot bot-framework bots chatbot chatbots chatbots-framework docker docker-image machine-learning natural-language-processing nlp nlu rasa raspberry-pi raspberry-pi-4 raspberry-pi-4b
Last synced: 27 Sep 2024
https://github.com/arbox/wlapi
Ruby based API for the project Wortschatz Leipzig.
computational-linguistics natural-language-processing nlp ruby rubynlp
Last synced: 15 Nov 2024
https://github.com/AnthonyMRios/adversarial-relation-classification
Unsupervised domain adaptation method for relation extraction
bioinformatics biomedical-data-science machine-learning natural-language-processing nlp nlp-machine-learning relation-extraction
Last synced: 15 Nov 2024
https://github.com/tencent-ailab/season
[EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarization
nlp summarization summarization-model
Last synced: 18 Nov 2024
https://github.com/fursovia/geometric_embedding
"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation
Last synced: 17 Nov 2024
https://github.com/kklemon/flashperceiver
Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.
attention-mechanism deep-learning flash-attention nlp perceiver transformer
Last synced: 19 Nov 2024
https://github.com/fedenunez/tulp
Tulp is a command-line tool that can help you create and process piped content using the power of ChatGPT directly from the terminal.
chatgpt chatgpt-api console llm nlp shell unix-shell
Last synced: 13 Nov 2024
https://github.com/greenelab/preprint-similarity-search
A web app that uses machine learning to recommend the most suitable journals based on the text content of your preprint
journals nlp nlp-machine-learning web-app
Last synced: 13 Nov 2024
https://github.com/gdamdam/sumo
Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more
automatic-summarization content-extraction entity-recognition nlp nltk semantic-analysis sentence-extraction
Last synced: 14 Nov 2024
https://github.com/vaibhavs10/ml-with-text
[Tutorial] Demystifying Natural Language Processing with Python
machine-learning natural-language-processing nlp python text-classification
Last synced: 25 Oct 2024
https://github.com/kxsystems/nlp
Natural-language processing library
clustering dataset embedpy kdb natural-language-processing nlp parsing python q vector
Last synced: 07 Nov 2024
https://github.com/KxSystems/nlp
Natural-language processing library
clustering dataset embedpy kdb natural-language-processing nlp parsing python q vector
Last synced: 12 Nov 2024
https://github.com/centrefordigitalhumanities/tscan
T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf
dutch-language feature-extraction nlp text-difficulty
Last synced: 06 Nov 2024
https://github.com/google-research/pangea
Panoramic Graph Environment Annotation toolkit, for collecting audio and text annotations in panoramic graph environments such as Matterport3D and StreetLearn.
annotation-tool computer-vision crowdsourcing nlp
Last synced: 10 Nov 2024
https://github.com/yashdew/assessor
An open-source Resume Analyzer and Ranking tool for recruiters and candidates.
flask hacktoberfest hacktoberfest2021 nextjs nlp python spacy
Last synced: 27 Oct 2024
https://github.com/michellebonat/fed_funds_ml
Use machine learning (NLP) to demonstrate whether Federal Funds rate changes can be accurately predicted using just the FOMC - the US Federal Reserve Bank - meetings minutes.
ai federal-reserve-bank finance financial-services machine-learning nlp python3
Last synced: 08 Nov 2024
https://github.com/proycon/foliapy
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
clariah clarin computational-linguistics folia nlp pynlpl xml
Last synced: 31 Oct 2024
https://github.com/varunon9/sentence-type-classifier
Classify English sentences into assertive, negative, interrogative, imperative and exclamatory based on grammar.
english-grammar nlp nlp-machine-learning sentence-classification
Last synced: 27 Oct 2024
https://github.com/hc200ok/manual-data-masking
A lightweight javascript library for manual data masking
data-masking dataset dataset-generation ecmascript2020 javascript library manual-data-masking nlp
Last synced: 11 Nov 2024
https://github.com/bububa/jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
chinese-nlp chinese-word-segmentation classification clustering cws ner nlp pos segmentation
Last synced: 08 Nov 2024
https://github.com/mindspore-courses/deepnlp-models-mindspore
About MindSpore implementations of various Deep NLP models in cs-224n(Stanford Univ)
deep-learning mindspore nlp tutorial
Last synced: 09 Nov 2024
https://github.com/code-kern-ai/refinery-python-sdk
Official Python SDK for Kern AI refinery.
active-learning data-centric-ai deep-learning labeling labeling-tool machine-learning natural-language-processing neural-search nlp python sdk spacy supervised-learning text-annotation text-classification transformer
Last synced: 01 Nov 2024
https://github.com/neokd/datastorehouse
DataStoreHouse is an open-source project that aims to create a collaborative platform for gathering and sharing a wide variety of datasets. It provides a centralised repository where individuals and organisations can contribute, discover, and collaborate on diverse datasets for various domains.
api csv datasets good-first-issue hacktoberfest hacktoberfest2023 json machinelearning nextjs13 nlp open-source opensource opensource-projects python reactjs
Last synced: 27 Oct 2024
https://github.com/bminixhofer/gerpt2
German small and large versions of GPT2.
common-crawl german gpt2 language-model machine-learning nlp
Last synced: 28 Oct 2024
https://github.com/mmxgn/sprl-spacy
Implementation of Spatial Role Labeling using the Spacy NLP framework.
nlp problog spacy spatial-role-labeling sprl
Last synced: 10 Oct 2024
https://github.com/climbsrocks/empythy
Automated NLP sentiment predictions- batteries included, or use your own data
automated-machine-learning batteries-included machine-learning machinelearning nlp nlp-library nlp-machine-learning nlp-sentiment-classifier sentiment sentiment-classification sentiment-classifier sentiment-predictions
Last synced: 23 Oct 2024
https://github.com/MeiFagundes/PolarisAI
Personal Assistant Engine built with ML.NET.
asp-net-core dotnet machine-learning ml-net mlnet natural-language-processing nlp personal-assistant personal-assistant-engine rest-api
Last synced: 11 Nov 2024
https://github.com/shukur-alom/spam_mail_detector_using_ml
This Model can detectany kind of spam mail. Here i use ML Algorithm. If use use my code pleace give me my cradit
algorithm artificial-intelligence artificial-intelligence-algorithms artificial-intelligence-models artificial-intelligence-projects deep-learning detectany-kind mail ml natural-language-processing nlp nlp-machine-learning python python-3 python3 spam spam-mail tensorflow tensorflow2
Last synced: 12 Oct 2024
https://github.com/anthonysigogne/keyword-mining
API - extract a list of keywords from a text.
docker keyword keyword-extraction nlp python-2 seo
Last synced: 12 Oct 2024
https://github.com/IITGuwahati-AI/Fake-News-Detection
Detecting Fake News using AI
bert clickbait fake-news-articles fake-news-detection huggingface huggingface-transformer natural-language-processing nlp python3 pytorch tensorflowjs transformer
Last synced: 04 Nov 2024
https://github.com/martinthoma/lidtk
Language Identification Toolkit
language-identification language-identification-toolkit machine-learning mit-license nlp nlp-machine-learning python-3 python-3-5
Last synced: 12 Oct 2024
https://github.com/ianramzy/article-summary-deep-learning
📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!
fact-extractor flask named-entity-recognition nlp summarization web-scraping
Last synced: 19 Nov 2024
https://github.com/talmago/spacy_crfsuite
sequence tagging with spaCy and crfsuite
crf crf-model crfsuite entity-extraction entity-extraction-extension entity-tagging nlp sklearn-crfsuite spacy spacy-extension spacy-ner
Last synced: 12 Oct 2024
https://github.com/mfarragher/obsidian-nlp-analytics
Proofs of concept for workflows that augment Obsidian.md knowledge management via NLP analytics & modelling
knowledge-management nlp nlp-machine-learning obsidian-md python
Last synced: 23 Oct 2024
https://github.com/aditya-xq/text-emotion-detection-using-nlp
machine-learning nlp python text-sentiment
Last synced: 11 Nov 2024
https://github.com/liamdugan/summary-qg
Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"
natural-language-processing nlp question-answer-generation question-answering question-generation
Last synced: 27 Oct 2024
https://github.com/igorbenav/clientai
A unified client for seamless interaction with multiple AI providers.
ai api api-rest artificial-intelligence language-model llm nlp ollama ollama-client openai openai-api python replicate replicate-api
Last synced: 28 Oct 2024
https://github.com/spacyturk/spacyturk
spaCyTurk - trained models & pipelines for Turkish
floret nlp nlp-library spacy turkish-nlp
Last synced: 12 Oct 2024
https://github.com/bryanlimy/dnn-dependency-parser
TensorFlow implementation of A Fast and Accurate Dependency Parser using Neural Networks
dependency-parser mlp neural-network nlp tensorflow
Last synced: 23 Oct 2024
https://github.com/yutkin/news-aggregator
Classification and aggregation of russian news articles. University coursework.
classification coursework machine-learning news news-aggregator nlp university
Last synced: 05 Nov 2024
https://github.com/stanford-oval/schema2qa
Schema2QA Question Answering Dataset
dataset natural-language-processing nlp semantic-parsing voice-assistant
Last synced: 06 Nov 2024
https://github.com/neurotech-hq/pysimilar
A python library for computing the similarity between two string(text) based on cosine similarity
cosine-similarity natural-language natural-language-processing natural-language-understanding nlp python-tanzania tanzania
Last synced: 08 Nov 2024
https://github.com/Hoiy/berserker
Berserker - BERt chineSE woRd toKenizER
bert bert-chinese chinese-nlp chinese-word-segmentation nlp sequence-to-sequence state-of-the-art tensorflow tokenizer tpu
Last synced: 02 Nov 2024
https://github.com/systats/textlearnR
A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.
classification hyperparameter-optimization keras nlp r text-mining
Last synced: 05 Aug 2024
https://github.com/MilaNLProc/bertlang
A web interface to understand language-specific BERT-models
artificial-intelligence bert-model machine-learning nlp nlp-machine-learning
Last synced: 28 Aug 2024
https://github.com/sno2/bertml
Use common pre-trained ML models in Deno!
bert deno machine-learning nlp rust
Last synced: 17 Aug 2024
https://github.com/damo-nlp-sg/bgca
Code and Data for "Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis" (ACL 2023)
aspect-based-sentiment-analysis natural-language-processing nlp
Last synced: 13 Nov 2024
https://github.com/jakartaresearch/maleo
Wrapper library for text cleansing, preprocessing in NLP
indonesian-language machine-learning nlp nlp-library
Last synced: 15 Nov 2024
https://github.com/spidy20/flask_nlp_chatbot
This is simple chatbot using NLP which is implemented on Flask WebApp.
chatbot chatbot-framework chatterbot flask flask-api flask-application nlp nlp-chatbot nlp-machine-learning nltk
Last synced: 15 Nov 2024
https://github.com/nlpie/biomedicus
BioMedICUS: A biomedical and clinical NLP engine.
biomedical-informatics health-informatics natural-language-processing nlp text-analysis
Last synced: 17 Nov 2024
https://github.com/koichiyasuoka/supar-unidic
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models
dependency-parser japanese-language nlp
Last synced: 16 Nov 2024
https://github.com/amazon-science/pizza-semantic-parsing-dataset
The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, whose semantics cannot be captured by flat slots and intents.
dataset natural-language-processing nlp semantic-parsing
Last synced: 12 Nov 2024
https://github.com/thefcraft/nsfw-prompt-detection-sd
NSFW Prompt Detection for Stable Diffusion
deep-learning lstm nlp nsfw-detection python stable-diffusion tensorflow
Last synced: 13 Nov 2024
https://github.com/daoyuanli2816/transformer-tutorial-cn
一个transformer模型的简单的中文教程
chinese-simplified huggingface nlp transformer tutorial-code
Last synced: 08 Nov 2024
https://github.com/tomhosking/hercules
Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)
nlp opinion-summarization summarization vq-vae
Last synced: 27 Oct 2024
https://github.com/centre-for-humanities-computing/odycy
A general-purpose NLP pipeline for Ancient Greek
ancient-greek machine-learning natural-language-processing nlp python spacy
Last synced: 09 Nov 2024
https://github.com/manikandanthangavelu/scikitcrf_ner
Python library for custom entity recognition using Sklearn CRF
crfsuite entities entity-extraction entity-recognition named-entity-recognition ner nlp python scikitcrf-ner sklearn sklearn-crfsuite
Last synced: 31 Oct 2024
https://github.com/undertheseanlp/sentiment
Vietnamese Sentiment Analysis
nlp sentiment-analysis vietnamese vietnamese-nlp
Last synced: 11 Nov 2024
https://github.com/caiyinqiong/study_notes
This is my study notes for my PhD in AI, NLP, IR, and more.
information-retrieval mechine-learing nlp notes paper-list
Last synced: 10 Nov 2024
https://github.com/fahdseddik/deeplearning.ai-natural-language-processing-specialization
This is all my notebooks, lab solutions, and assignments for the DeepLearning.AI Natural Language Processing Specialization on Coursera.
attention-model coursera coursera-specialization deeplearning-ai natural-language-processing nlp probabilistic-models sequence-models vector-space-models
Last synced: 07 Nov 2024
https://github.com/megagonlabs/ebe-dataset
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
dataset japanese-language nlp text-classification text-generation
Last synced: 10 Nov 2024
https://github.com/liaad/tieval
An Evaluation Framework for Temporal Information Extraction Systems
evaluation-framework information-extraction nlp temporal-relations
Last synced: 10 Nov 2024
https://github.com/yuanxiaosc/Deep_dynamic_contextualized_word_representation
TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model and ELMo's deep context word representation.
Last synced: 02 Nov 2024
https://github.com/innodatalabs/tbert
PyTorch port of BERT ML model
bert-model natural-language-processing neural-network nlp pytorch
Last synced: 02 Nov 2024
https://github.com/jfilter/hyperhyper
🧮 Python package to construct word embeddings for small data using PMI and SVD
embeddings nlp pmi pmi-svd ppmi python python-package word-analogy word-embeddings word-similarity
Last synced: 11 Nov 2024
https://github.com/kavgan/clinical-concepts
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.
clinical-concepts clinical-nlp clinical-notes concept-graph graph-nlp nlp paper terminologies
Last synced: 30 Oct 2024
https://github.com/mekhyw/cookiebot-telegram-group-bot
Conversational AI group bot for Telegram. It can also schedule posts, combat raiders/spammers, generate memes, scrape images, provide drawing ideas, call all members and more!
google-cloud-platform google-cloud-pubsub llm nlp opencv python telegram
Last synced: 12 Nov 2024
https://github.com/jovotech/snips-nlu-server
An open source natural language understanding (NLU) server
Last synced: 07 Nov 2024
https://github.com/arne-cl/brat-embedded-visualization-examples
minimal examples of brat annotation visualizations
annotation brat javascript nlp visualization
Last synced: 10 Nov 2024
https://github.com/chrislemke/deep-martin
Text simplification for a better world: Deep-Martin Transformer 🤗
deep-learning huggingface nlp python pytorch text-simplification transformers
Last synced: 16 Nov 2024
https://github.com/saransh-cpp/ocred
Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials.
hacktoberfest image-processing nlp nltk-python ocr python tesseract-ocr
Last synced: 08 Nov 2024
https://github.com/wazzabeee/pyspark-etl-twitter
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
delta-lake docker etl etl-pipeline etl-process kafka kafka-consumer kafka-producer kafka-streams mongodb nlp pyspark python sentiment-analysis spark spark-streaming tweet-analysis tweet-classification twitter twitter-sentiment-analysis
Last synced: 13 Nov 2024
https://github.com/anakin87/llama2-haystack
Using Llama2 with Haystack, the NLP/LLM framework.
haystack large-language-models llama llm nlp
Last synced: 23 Oct 2024
https://github.com/hsgodhia/squad_rasor_nn
Pytorch implementation of the RaSoR paper "Learning Recurrent Span Representations for Extractive Question Answering" (Lee et al. 2016) and experiments with various neural components
deep-learning machine-comprehension nlp pytorch
Last synced: 07 Aug 2024