spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-05-12 00:26:57 UTC
- JSON Representation
https://github.com/izuna385/pubtator-multiprocess-parser
Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.
allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy
Last synced: 28 Mar 2025
https://github.com/jonathanfox5/lemon_tizer
LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.
lemmatization lemmatizer spacy wrapper
Last synced: 10 Apr 2025
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 18 Mar 2025
https://github.com/abdiasarsene/analysis_and_findings
Librairies
bert gpt missingno nltk pandas seaborn sklearn spacy statsmodels tensorflow
Last synced: 26 Feb 2025
https://github.com/gugarosa/brainy
🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.
api machine-learning nlp python spacy
Last synced: 28 Mar 2025
https://github.com/srstevenson/keyword-extractor
Extract keywords from plain text documents
Last synced: 20 Nov 2024
https://github.com/karimosman89/legal-document-nlp
Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.
nltk python scikit-learn spacy
Last synced: 19 Feb 2025
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 29 Mar 2025
https://github.com/aiatyourservice/deeplearningforcoders
Hey, this repo contains code from deep learning specialization by Andrew NG
deep-learning nltk python pytorch spacy
Last synced: 29 Mar 2025
https://github.com/florensadimer/nlp_ner_soccer_pt-br
Anotação Manual e Comparação com Modelos Treinados
annotation llm machine-learning ner nlp spacy
Last synced: 12 Apr 2025
https://github.com/arjunravi26/chatbot-ai
A chatbot for responding to AI related queries
langchain langchain-community pinecone python rag regrex spacy stramlit
Last synced: 23 Feb 2025
https://github.com/codebasics/ner-resume-parser
A tutorial for NER Resume Parser to get the keywords out of a resume.
mlflow mlflow-tracking nlp python spacy spacy-models spacy-nlp
Last synced: 06 Mar 2025
https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques
using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text
healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp
Last synced: 05 Apr 2025
https://github.com/serenasensini/medspacy-tutorial
OSDAY 2023 @ Florence - Use case to show medspaCy functionalities.
medspacy nlp nlp-machine-learning spacy spacy-nlp spacy-pipeline
Last synced: 14 Mar 2025
https://github.com/md-emon-hasan/nlp-codebasics
Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.
bag-of-words bow gensim gensim-word2vec lematization nlp nlp-library nlp-machine-learning nltk nltk-python python3 spacy text-classification text-processing tokenization
Last synced: 22 Feb 2025
https://github.com/jash271/youglance
Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking
cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling
Last synced: 22 Mar 2025
https://github.com/medspacy/nlp_preprocessor
SpaCy component for modifying the string of a doc before tokenizing.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 26 Feb 2025
https://github.com/zevio/pcu
Plateforme de Connaissances Unifiées (PCU) project (i.e Unified Knowledge Platform)
extraction json keyphrase-extraction kleis knowledge knowledge-extraction langdetect pcu pcu-io pcu-json pcu-keyphrase pcu-language pcu-nlp pcu-pdf pcu-relation pdf python spacy text workflow
Last synced: 24 Feb 2025
https://github.com/omar7tech/text-summarization
This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.
natural-language-processing python spacy text-summarization tokenization
Last synced: 05 Apr 2025
https://github.com/fferegrino/zeldakg
A TLOZ inspired knowledge graph
infobox knowledge-graph nltk pandas python spacy wikidata
Last synced: 15 Dec 2024
https://github.com/geetisha/advanced-eda-and-text-mining
Advanced EDA and Text Mining
jupyter-notebook matplotlib nltk numpy pandas python spacy textblob wordcloud
Last synced: 18 Mar 2025
https://github.com/kailejie/ner
This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.
Last synced: 05 Apr 2025
https://github.com/pyladiesams/nlp-projects-with-spacy-may2024
NLP projects with spaCy
Last synced: 05 Apr 2025
https://github.com/sloev/spacy_onnx_sentiment_english
english sentiment model for spacy
onnx-models sentiment-analysis spacy spacy-pipeline
Last synced: 28 Mar 2025
https://github.com/somenath203/named-entity-recognizer
Click below to checkout the website
huggingface huggingface-spaces named-entity-recognition ner spacy streamlit torch transformers
Last synced: 04 Mar 2025
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 09 Feb 2025
https://github.com/den1ksk/nlp-with-disastertweets
Kaggle competition
bert data-science deeplearning kaggle machine-learning nlp nltk pytorch spacy transformers xgboost
Last synced: 15 Mar 2025
https://github.com/thyripian/core
This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.
api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite
Last synced: 01 Apr 2025
https://github.com/aadityasivas/spacy-text-summarization
A simple text summarizer built with spaCy
jupyter-notebook nlp python spacy
Last synced: 09 Apr 2025
https://github.com/turbolent/spacy-thrift-docker
spacy-thrift as a Docker container
docker named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 28 Mar 2025
https://github.com/oroszgy/spacy-tokenizer-benchmark
Quick and dirty scripts to measure the performance of spaCy
benchmark natural-language-processing nlp python spacy tokenizer
Last synced: 28 Mar 2025
https://github.com/vidhi1290/chatbot-with-rasa-nlu-model-and-python
This project builds an intelligent chatbot using Rasa NLU for an E-Commerce business 🛍️. The chatbot can handle user queries like product information, pricing, and order management 💬. With spacy and TensorFlow pipelines 🧠 for training, and MongoDB for storing data 📦, it offers seamless, context-aware conversations
aichatbot artificial-intelligence chatbot jupyter-notebook matplotlib nlu nlu-chatbot pandas pymongo python rasa-chatbot rasa-nlu spacy spacy-nlp tensorflow
Last synced: 09 Apr 2025
https://github.com/sukanyadutta52/sentiment-analysis
An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis
flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard
Last synced: 28 Mar 2025
https://github.com/robgc/sento-processing
A Natural Language Processing tool designed to perform sentiment analysis on tweets and store the results obtained.
async asyncpg nlp python sentiment-analysis spacy spacy2
Last synced: 12 Apr 2025
https://github.com/ajaykumar095/natural_language_processing
Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.
ann nltk-python python rnn spacy tensorflow text-preprocessing textblob
Last synced: 09 Apr 2025
https://github.com/2pa4ul2/mcq-quiz-maker-nlp
Quizzable a quiz generator for short reviews with Spacy and NLTK
flask nlp nltk python question-generation quizapp spacy
Last synced: 05 Apr 2025
https://github.com/whatevery1says/preprocessing
WE1S Preprocessing -- workflow preparing documents for import as WE1S data
digital-humanities humanities news nltk preprocessing spacy topic-modeling
Last synced: 04 Mar 2025
https://github.com/prthd/ai-powered-voice-assisted-object-locator
🔍 Real-time object detection with voice command integration using YOLOv5 (Objects365), OpenCV, MediaPipe, spaCy NLP, and SpeechRecognition. Enhances accessibility by guiding users to locate indoor objects with directional feedback relative to their position. Ideal for smart-home, accessibility tech, and assistive applications.
computer-vision nlp object-detection opencv python real-time-systems spacy speech-recognition voice-assistant yolov5
Last synced: 09 Apr 2025
https://github.com/turbolent/telescope
Go explore
compiler nlp parser question-answering scala spacy sparql
Last synced: 28 Mar 2025
https://github.com/neuledge/spacy-api
An spaCy API service
docker machine-learning microservice nlp python spacy
Last synced: 13 Apr 2025
https://github.com/thjbdvlt/solipcysme
spaCy pipeline for french focused on personal pronouns, fictions and first person point of view texts.
french french-nlp lemmatization morphological-analysis natural-language-processing nlp nlp-french normalization part-of-speech-tagging pos-tagging spacy spacy-extensions tokenization word-embeddings
Last synced: 06 Mar 2025
https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem
The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.
chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp
Last synced: 05 Apr 2025
https://github.com/lucasspinola/monitorbot-api
API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.
Last synced: 07 Apr 2025
https://github.com/ivan-kleshnin/spacy-benchmarks
Comparison of Spacy performance with different architectures, corpuses, hyperparams...
clearnlp nlp penn-treebank spacy universal-dependencies universaldependencies
Last synced: 07 Mar 2025
https://github.com/datarohit/nlp-course-files
The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.
nlp nlp-machine-learning nltk numpy panda python sklearn spacy
Last synced: 09 Apr 2025
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 09 Apr 2025
https://github.com/keshabkjha/climasense
ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.
html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app
Last synced: 31 Mar 2025
https://github.com/araobp/bach-network
J. S. Bach's network with spaCy(NLP)
Last synced: 11 Mar 2025
https://github.com/aitechhero/nonullsense-nlp
Natural Language Processing (NLP) with libraries like spaCy, Transformers, and NLTK.
ai artificial-intelligence huggingface natural-language-processing nlp nltk python spacy text-analysis transformers
Last synced: 26 Feb 2025
https://github.com/anquetos/nasa-apod-database
etl-pipeline galaxy image json nasa-apod object-oriented-programming pandas pillow space spacy
Last synced: 05 Apr 2025
https://github.com/turbolent/spacy-http
spaCy as a HTTP service
api named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy
Last synced: 22 Mar 2025
https://github.com/xsenzaki/automated-essay-checker
A project requirement for the subject 'CS303 - Automata Theory'
aes automated-essay-grading automated-essay-scoring automated-essay-scoring-system cosine-similarity jaccard-similarity natural-language-processing nlp pyqt6 python spacy spacy-nlp
Last synced: 25 Feb 2025
https://github.com/ivan-abernado/news-scrapping-en-es-visualization-sanalysis
News Scrapping & WordCloud (EN/ES) using the libraries newspaper3k for news scrapping, Spacy for text preprocessing, and WordCloud for wordcloud plotting
english matplotlib-pyplot newspaper3k nlp-machine-learning pandas python regex scrapping spacy spanish tqdm wordcloud-library wordcloud-visualization
Last synced: 25 Feb 2025
https://github.com/tbarlow12/wiki-answer
I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions
nlp python question-answering spacy wikipedia
Last synced: 28 Mar 2025
https://github.com/veldhub/veld_chain__mara_load_and_publish_models
Chain velds for publishing self-trained MARA models to huggingface.
Last synced: 14 Mar 2025
https://github.com/viniciusmecosta/cvclassifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 05 Mar 2025
https://github.com/403errors/ai-docparser
An application framework developed using the latest AI technologies to extract the values of specific pre-defined keys from a given PDF document. Also generating a document summary using the key & values extracted in the while doing so.
automation csv-export nlp pdf-files python3 regex reinforcement-learning spacy
Last synced: 14 Mar 2025
https://github.com/viniciusmecosta/CvClassifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 05 Mar 2025
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 05 Apr 2025
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 06 Apr 2025
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 06 Apr 2025
https://github.com/arvind-4/summarizer
Simple Summarizer using python
flask flask-app flask-applications gunicorn-flask-webserver gunicorn-service html html5 javascript python python-dotenv pythoon-3 spacy summarizer textblob vanilla-javascript
Last synced: 17 Feb 2025
https://github.com/ashenoooone/semantic-book-analyzer
Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍
fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript
Last synced: 05 Mar 2025
https://github.com/wesslen/textcat-reddit-cooking
spaCy Textcat model on relevant Reddit Cooking
Last synced: 06 Apr 2025
https://github.com/sukanyadutta52/topic_modeling
What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?
climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy
Last synced: 17 Feb 2025
https://github.com/tristan-mcinnis/spacy-models-setup-and-testing
A Python utility for downloading, storing, and testing Spacy language models for English and Chinese NLP tasks.
chinese english nlp python simple-project spacy testing
Last synced: 04 Apr 2025
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 12 Apr 2025
https://github.com/veldhub/veld_chain__apis_ner_transform_to_gold
Chain velds encapsulating extraction and conversion of gold data.
named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 14 Mar 2025
https://github.com/veldhub/veld_code__spacy
Code velds encapsulating usage of spaCy.
Last synced: 14 Mar 2025
https://github.com/veldhub/veld_chain__akp_ner_inference
A chain veld encapsulating NER inference.
named-entity-recognition ner nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 14 Mar 2025
https://github.com/veldhub/veld_data__akp_ner_linkedcat
data veld containg machine inferenced named entities and context data.
nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 14 Mar 2025
https://github.com/jwalsh/syntree-generator
A tool for converting French literary text into S-expression syntax trees for linguistic analysis, with visualization capabilities
abstract-syntax-tree constituency-parsing emacs french linguistics literary-analysis nlp org-mode parser proust python s-expression spacy syntax-analysis syntax-tree
Last synced: 05 Mar 2025
https://github.com/thjbdvlt/spacy-french-parser
syntactic dependency parser for french using spacy
french nlp nlp-french spacy spacy-parser syntactic-dependency-parsing universal-dependencies
Last synced: 05 Apr 2025
https://github.com/trikztr/gptscrape
GPTScrape: A tool for web scraping that uses spaCy for NLP and GPT4All for converting scraped text into structured JSON.
ai data-extraction data-scraping gpt gpt4all llm npl python scraping spacy spacy-nlp web-scraping
Last synced: 05 Apr 2025
https://github.com/thjbdvlt/spacy-presque
normalisation de mots (français) pour spacy
french nlp normalization spacy spacy-extensions
Last synced: 05 Apr 2025
https://github.com/coueghlani/nlp
Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos
mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy
Last synced: 05 Apr 2025
https://github.com/presizhai/rmp-ai-assistant
This project implements a RAG system for a Rate My Professor service, leveraging Pinecone for vector storage and OpenAI for text embeddings. It preprocesses professor reviews using SpaCy for cleaning and sentiment analysis, enabling the AI assistant to provide more nuanced recommendations and insights based on student queries.
generative-ai large-language-model natural-language-processing openai software-development software-engineering spacy
Last synced: 05 Apr 2025
https://github.com/thjbdvlt/spacy-viceverser
lemmatisation du français avec hunspell et spacy
french hunspell lemmatization nlp nlp-french spacy
Last synced: 05 Apr 2025
https://github.com/atharvapathak/customer_sentiment_analysis
Customer sentiment analysis is the process of using natural language processing (NLP) and machine learning techniques to analyze and understand the feelings, opinions, and attitudes expressed by customers in textual data, such as reviews, feedback, and social media posts.
cnn naive-bayes nlp nltk spacy stemming text-mining tokenization
Last synced: 05 Apr 2025
https://github.com/lexxai/goit_python_ds_hw_12
Модуль 12. Основи NLP.
nlp nlp-machine-learning nlp-spacy nltk nltk-tokenizer spacy spacy-nlp
Last synced: 05 Apr 2025
https://github.com/zofiaqlt/nlp_libraries_tweets_analysis
🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)
Last synced: 01 Mar 2025
https://github.com/chewzzz1014/fyp
backend fastapi final-year-project flair fyp machine-learning ner project spacy transformer
Last synced: 22 Mar 2025
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 28 Mar 2025
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 07 Apr 2025
https://github.com/free-analytics/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 26 Feb 2025
https://github.com/stephenombuya/ai-powered-writing-assistant
An advanced writing assistant that helps users improve their writing through grammar checking, style analysis, and intelligent suggestions.
flask-application pytest python3 spacy sqlalchemy sqlite3 textblob-sentiment-analysis writing-assistant
Last synced: 26 Feb 2025
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 06 Mar 2025