An open API service indexing awesome lists of open source software.

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/pabvald/chatbot

**RelintBot1** - Prototype of a QnA chatbot for the service of International Relations of the University of Valladolid (UVa)

bachelor-thesis chatbot prototype question-answering spacy

Last synced: 23 Mar 2025

https://github.com/bonysmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

correction kenlm spacy spelling symspell ukrainian

Last synced: 09 Feb 2025

https://github.com/sukanyadutta52/sentiment-analysis

An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis

flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard

Last synced: 28 Mar 2025

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 20 Nov 2024

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 19 Feb 2025

https://github.com/fferegrino/zeldakg

A TLOZ inspired knowledge graph

infobox knowledge-graph nltk pandas python spacy wikidata

Last synced: 15 Dec 2024

https://github.com/debugger404/multilanguage-pos

Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.

multilanguage named-entity-recognition ner python3 spacy

Last synced: 08 Apr 2025

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 09 Apr 2025

https://github.com/thyripian/core

This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.

api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite

Last synced: 01 Apr 2025

https://github.com/aadityasivas/spacy-text-summarization

A simple text summarizer built with spaCy

jupyter-notebook nlp python spacy

Last synced: 09 Apr 2025

https://github.com/vidhi1290/chatbot-with-rasa-nlu-model-and-python

This project builds an intelligent chatbot using Rasa NLU for an E-Commerce business 🛍️. The chatbot can handle user queries like product information, pricing, and order management 💬. With spacy and TensorFlow pipelines 🧠 for training, and MongoDB for storing data 📦, it offers seamless, context-aware conversations

aichatbot artificial-intelligence chatbot jupyter-notebook matplotlib nlu nlu-chatbot pandas pymongo python rasa-chatbot rasa-nlu spacy spacy-nlp tensorflow

Last synced: 09 Apr 2025

https://github.com/ajaykumar095/natural_language_processing

Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.

ann nltk-python python rnn spacy tensorflow text-preprocessing textblob

Last synced: 09 Apr 2025

https://github.com/prthd/ai-powered-voice-assisted-object-locator

🔍 Real-time object detection with voice command integration using YOLOv5 (Objects365), OpenCV, MediaPipe, spaCy NLP, and SpeechRecognition. Enhances accessibility by guiding users to locate indoor objects with directional feedback relative to their position. Ideal for smart-home, accessibility tech, and assistive applications.

computer-vision nlp object-detection opencv python real-time-systems spacy speech-recognition voice-assistant yolov5

Last synced: 09 Apr 2025

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 09 Apr 2025

https://github.com/keshabkjha/climasense

ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.

html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app

Last synced: 31 Mar 2025

https://github.com/araobp/bach-network

J. S. Bach's network with spaCy(NLP)

graphology spacy visjs

Last synced: 11 Mar 2025

https://github.com/isabelleysseric/question-answering

Building a Natural Language Question & Answer Search Engine with corpus in Python language.

corpus deep-learning nlp qa question-answering spacy whoosh

Last synced: 20 Feb 2025

https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy

I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.

machine-learning named-entity-recognition ner spacy spacy-nlp

Last synced: 01 Mar 2025

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 09 May 2025

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 10 May 2025

https://github.com/jtlicardo/process-visualizer-web

Web interface for the process-visualizer project

bert bpmn nlp openai spacy

Last synced: 09 May 2025

https://github.com/randika00/ism-web-automation-y23cp-web

Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.

2captcha-api beautifulsoup regex scrapy selenium spacy webdriver

Last synced: 28 Mar 2025

https://github.com/csfelix/nlp-0-spacy-course

💬 Advanced NLP with Spacy Course

natural-language-processing nlp python spacy

Last synced: 26 Mar 2025

https://github.com/lilivalgo/analisis_reportes_onu_cambio_climatico

Web Scraping, manipulación de files.PDF, NPL con SpaCy

beautifulsoup4 pandas pypdf2 python requests spacy wordcloud

Last synced: 28 Mar 2025

https://github.com/jblake1965/elucidoc

Screens legal text and extracts sentences containing user input party name-predicate phrases

excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word

Last synced: 17 Mar 2025

https://github.com/bjam24/agh-natural-language-processing

This respository contains projects made for the NLP course at the AGH UST in 2024 / 2025. They received maximum grade 5.0.

agh elasticsearch language-modeling language-modelling levenshtein llm ner neural-search nlp prompt-enginering question-answering rag regex spacy text-classificaiton text-classification

Last synced: 17 Mar 2025

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 17 Mar 2025

https://github.com/arjunravi26/chatbot-ai

A chatbot for responding to AI related queries

langchain langchain-community pinecone python rag regrex spacy stramlit

Last synced: 23 Feb 2025

https://github.com/izuna385/pubtator-multiprocess-parser

Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.

allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy

Last synced: 28 Mar 2025

https://github.com/jonathanfox5/lemon_tizer

LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.

lemmatization lemmatizer spacy wrapper

Last synced: 10 Apr 2025

https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques

using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text

healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 05 Apr 2025

https://github.com/md-emon-hasan/nlp-codebasics

Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.

bag-of-words bow gensim gensim-word2vec lematization nlp nlp-library nlp-machine-learning nltk nltk-python python3 spacy text-classification text-processing tokenization

Last synced: 22 Feb 2025

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 05 Apr 2025

https://github.com/henx117/chatbot

My chatbot python project

chatbot python python3 spacy

Last synced: 12 Apr 2025

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 28 Mar 2025

https://github.com/robgc/sento-processing

A Natural Language Processing tool designed to perform sentiment analysis on tweets and store the results obtained.

async asyncpg nlp python sentiment-analysis spacy spacy2

Last synced: 12 Apr 2025

https://github.com/2pa4ul2/mcq-quiz-maker-nlp

Quizzable a quiz generator for short reviews with Spacy and NLTK

flask nlp nltk python question-generation quizapp spacy

Last synced: 05 Apr 2025

https://github.com/cllspy/nlp-playground

application to understand key concepts of nlp

ml nlp spacy

Last synced: 02 Apr 2025

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 09 Apr 2025

https://github.com/yashaswini-lankalapalli/text-summarization

Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.

nlp python spacy transformers

Last synced: 28 Mar 2025

https://github.com/mugambi645/basic-spacy-nlp

Basic NLP with spacy

nlp spacy

Last synced: 05 Mar 2025

https://github.com/ntinouldinho/machine-learning-classification-and-speech-generation

Explored Greek Parliament Proceedings and tried to classify each speech to a corresponding parliamentary political party.

artificial-intelligence classification-machine-learning machine-learning neural-networks pandas python sklearn spacy

Last synced: 29 Mar 2025

https://github.com/arnabd64/spacy-ner-hf-space

A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.

gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification

Last synced: 08 Feb 2025

https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods

Text-Summarizer-Using-NLP-and-TF-IDF-Methods

nlp spacy text-summarization

Last synced: 29 Mar 2025

https://github.com/wesslen/textcat-reddit-cooking

spaCy Textcat model on relevant Reddit Cooking

prodigy spacy textcat

Last synced: 06 Apr 2025

https://github.com/vuchkov/tdd-python-llp

TDD in Python with Flask/Rest API and ML/LLP covered by Selenium tests - test-driven development, large language processing, machine learning

flask llp ml python rest-api selenium selenium-python spacy tdd tests

Last synced: 31 Mar 2025

https://github.com/salma-4/nlp-task

Preprocessing using NLTK ,SPACY

nltk-library python spacy svm-model

Last synced: 15 Mar 2025

https://github.com/imvladikon/spacy-trankit

💥 Trankit models directly in spaCy💥

nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit

Last synced: 23 Mar 2025

https://github.com/tusharthakur8267/ocr_sentiment_analysis_text_summarization

This project extracts text from images using OCR, analyzes sentiment (positive, negative, neutral), and summarizes text for quick insights. It utilizes Python, Tesseract-OCR, NLP libraries, and Flask/FastAPI for deployment.

flask image-processing nlp opencv pil python sentiment-analysis spacy tesser

Last synced: 19 Apr 2025

https://github.com/ashenoooone/semantic-book-analyzer

Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍

fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript

Last synced: 05 Mar 2025

https://github.com/ayaz-amin/speechpos

A simple Python script that tags speech to parts-of-speech

deep-learning machine-learning python3 spacy

Last synced: 24 Mar 2025

https://github.com/parthapray/pii_scrubbing_llm

This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)

chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn

Last synced: 06 Apr 2025

https://github.com/cmll21/nyt

This repository contains solvers for two New York Times puzzles: Wordle and Connections.

connections nlp nltk puzzles solver spacy wordle

Last synced: 01 Mar 2025

https://github.com/vanheemstrasystems/spacy

SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.

spacy

Last synced: 11 May 2025

https://github.com/ahmedkhaled404/ner-with-spacy

Named entity recognition using traditional NLP methods

machine-learning matplotlib ner nlp nlp-machine-learning python spacy

Last synced: 05 Apr 2025

https://github.com/michaelkinfu/hknews-headline-analysis

The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.

beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5

Last synced: 05 Apr 2025

https://github.com/asrot0/spacy_ner

SpaCy-based NER🧠 implementation for extracting and classifying entities from text✨

machine-learning ner nlp spacy textclassification

Last synced: 12 May 2025

https://github.com/arkadiuszkaros/nlp-book-pos-extractor

This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.

extractor nlp part-of-speech-tagging python spacy

Last synced: 28 Mar 2025

https://github.com/direct-phonology/phony

phonology in spaCy!

linguistics nlp phonology python spacy

Last synced: 13 Mar 2025

https://github.com/iv4n-ga6l/nlp-chatbot-api

A NLP project leveraging NLTK for extracting weather data.

flask nlp-api nlp-chatbot nltk python spacy transformers

Last synced: 21 Mar 2025

https://github.com/itsdaiton/named-entity-visualizer

NEV short for Named Entity Visualizer is a tool to visualize entities found in unstructured text built in Python.

named-entity-linking named-entity-recognition natural-language-processing nlp-pipeline python spacy wikidata

Last synced: 05 Apr 2025

https://github.com/prateekrajsrivastav/question-answering-model

This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.

huggingface-transformers matplotlib nltk numpy pandas seaborn spacy

Last synced: 06 Apr 2025

https://github.com/abdiasarsene/mapping_and_evolution_of_circular_business_models

This repository analyzes trends in circular business models using NLP techniques like LDA for thematic analysis, word clouds for visualization, and spaCy for semantic exploration.

lda-model seaborn sklearn spacy wordcloud-visualization

Last synced: 13 Mar 2025

https://github.com/lfoppiano/docker-image-spacy

Docker image for shipping spacy

docker image spacy

Last synced: 05 Apr 2025

https://github.com/kr1shnasomani/summarai

Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)

natural-language-processing pytextrank pytorch sentencepiece spacy transformers

Last synced: 08 Apr 2025

https://github.com/defrecord/para-spacy-lisp

A bridge between Emacs and spaCy NLP processing via WebSockets

elisp emacs lisp natural-language-processing nlp python spacy websocket

Last synced: 16 Mar 2025

https://github.com/zackakil/nlp-using-word-vectors

Code resources for Central London Data Science Project Nights meetup on word vectors

machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors

Last synced: 05 May 2025

https://github.com/karimosman89/resume-screening

Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.

machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing

Last synced: 28 Apr 2025

https://github.com/etienne-bobo/information-retreival_project

In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.

information-retrieval nlp prodigy spacy

Last synced: 27 Feb 2025

https://github.com/ledsouza/nlp-article-classification

This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.

gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp

Last synced: 26 Mar 2025

https://github.com/kluiverjh/spacynamedentityrecognition

Example application to extract Named Entity Recognition (NER) with spaCy. And create an executable for spaCy (optional).

extraction keyword ner pyinstaller spacy

Last synced: 07 May 2025

https://github.com/rahul1582/named-entity-recognition

A keras implementation of Bidirectional-LSTM for Named Entity Recognition.

bidirectional-lstm keras named-entity-recognition spacy tensorflow

Last synced: 31 Mar 2025

https://github.com/viniciusmecosta/CvClassifier

A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.

catboost fastapi python3 sklearn spacy

Last synced: 05 Mar 2025

spaCy Awesome Lists
spaCy Categories