An open API service indexing awesome lists of open source software.

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/jromero132/bachelor_thesis_code

Code of my Bachelor Thesis in Computer Science at the University of Havana, Cuba.

artificial-intelligence medline medlineplus natural-language-processing nlp nltk nltk-python ontology-learning python3 spacy spacy-nlp

Last synced: 07 Apr 2025

https://github.com/bonysmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

correction kenlm spacy spelling symspell ukrainian

Last synced: 09 Feb 2025

https://github.com/whatevery1says/preprocessing

WE1S Preprocessing -- workflow preparing documents for import as WE1S data

digital-humanities humanities news nltk preprocessing spacy topic-modeling

Last synced: 04 Mar 2025

https://github.com/md-emon-hasan/nlp-codebasics

Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.

bag-of-words bow gensim gensim-word2vec lematization nlp nlp-library nlp-machine-learning nltk nltk-python python3 spacy text-classification text-processing tokenization

Last synced: 22 Feb 2025

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 09 May 2025

https://github.com/sukanyadutta52/sentiment-analysis

An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis

flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard

Last synced: 28 Mar 2025

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 05 Apr 2025

https://github.com/izuna385/arxiv-checker

Single Page Application and its deployment for GCE.

docker docker-compose fastapi nginx react react-bootstrap spacy tdd

Last synced: 28 Mar 2025

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 28 Mar 2025

https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem

The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.

chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 20 Nov 2024

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 19 Feb 2025

https://github.com/surajiyer/topic-analysis

Python library to perform topic detection on textual data that are generated over time.

agglomerative-clustering gaussian-mixture-models nlp spacy spectral-clustering textual-data topic-analysis topic-modeling

Last synced: 29 Mar 2025

https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques

using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text

healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 05 Apr 2025

https://github.com/laurenzv/covbot

A small chatbot written as part of my bachelor thesis.

chatbot corenlp covid-19 docker python spacy sqlite vuejs

Last synced: 05 Apr 2025

https://github.com/jonathanfox5/lemon_tizer

LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.

lemmatization lemmatizer spacy wrapper

Last synced: 10 Apr 2025

https://github.com/izuna385/pubtator-multiprocess-parser

Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.

allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy

Last synced: 28 Mar 2025

https://github.com/florensadimer/nlp_ner_soccer_pt-br

Anotação Manual e Comparação com Modelos Treinados

annotation llm machine-learning ner nlp spacy

Last synced: 12 Apr 2025

https://github.com/snehadharne/vaers-symptomextractionwithai

VAERS Adverse Event Analysis for COVID 19 Vaccine : A hybrid approach combining LLMs (Gemini 1.5 Flash) and statistical methods for enhanced vaccine safety signal detection. Analyzes temporal and associative relationships in VAERS symptom data.

apriori-algorithm associative-analysis gemini-flash ner ollama pandas spacy symptom-analysis temporal-analysis

Last synced: 14 Apr 2025

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 04 Apr 2025

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 09 Apr 2025

https://github.com/fferegrino/zeldakg

A TLOZ inspired knowledge graph

infobox knowledge-graph nltk pandas python spacy wikidata

Last synced: 15 Dec 2024

https://github.com/arjunravi26/chatbot-ai

A chatbot for responding to AI related queries

langchain langchain-community pinecone python rag regrex spacy stramlit

Last synced: 23 Feb 2025

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 17 Mar 2025

https://github.com/bjam24/agh-natural-language-processing

This respository contains projects made for the NLP course at the AGH UST in 2024 / 2025. They received maximum grade 5.0.

agh elasticsearch language-modeling language-modelling levenshtein llm ner neural-search nlp prompt-enginering question-answering rag regex spacy text-classificaiton text-classification

Last synced: 17 Mar 2025

https://github.com/debugger404/multilanguage-pos

Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.

multilanguage named-entity-recognition ner python3 spacy

Last synced: 08 Apr 2025

https://github.com/wesslen/spacy-ecfr-ner

spaCy-Prodigy workflow for NER Citation model on eCFR Banking Regulation

nlp prodigy spacy

Last synced: 06 Apr 2025

https://github.com/2pa4ul2/mcq-quiz-maker-nlp

Quizzable a quiz generator for short reviews with Spacy and NLTK

flask nlp nltk python question-generation quizapp spacy

Last synced: 05 Apr 2025

https://github.com/thyripian/core

This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.

api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite

Last synced: 01 Apr 2025

https://github.com/jblake1965/elucidoc

Screens legal text and extracts sentences containing user input party name-predicate phrases

excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word

Last synced: 17 Mar 2025

https://github.com/aadityasivas/spacy-text-summarization

A simple text summarizer built with spaCy

jupyter-notebook nlp python spacy

Last synced: 09 Apr 2025

https://github.com/vidhi1290/chatbot-with-rasa-nlu-model-and-python

This project builds an intelligent chatbot using Rasa NLU for an E-Commerce business 🛍️. The chatbot can handle user queries like product information, pricing, and order management 💬. With spacy and TensorFlow pipelines 🧠 for training, and MongoDB for storing data 📦, it offers seamless, context-aware conversations

aichatbot artificial-intelligence chatbot jupyter-notebook matplotlib nlu nlu-chatbot pandas pymongo python rasa-chatbot rasa-nlu spacy spacy-nlp tensorflow

Last synced: 09 Apr 2025

https://github.com/udit-rawat/whisper-space

An ASR Gradio GUI based project that transcript the audion and provides NLP based analysis.

asr gradio nlp spacy whisper

Last synced: 22 Mar 2025

https://github.com/lilivalgo/analisis_reportes_onu_cambio_climatico

Web Scraping, manipulación de files.PDF, NPL con SpaCy

beautifulsoup4 pandas pypdf2 python requests spacy wordcloud

Last synced: 28 Mar 2025

https://github.com/ajaykumar095/natural_language_processing

Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.

ann nltk-python python rnn spacy tensorflow text-preprocessing textblob

Last synced: 09 Apr 2025

https://github.com/csfelix/nlp-0-spacy-course

💬 Advanced NLP with Spacy Course

natural-language-processing nlp python spacy

Last synced: 26 Mar 2025

https://github.com/prthd/ai-powered-voice-assisted-object-locator

🔍 Real-time object detection with voice command integration using YOLOv5 (Objects365), OpenCV, MediaPipe, spaCy NLP, and SpeechRecognition. Enhances accessibility by guiding users to locate indoor objects with directional feedback relative to their position. Ideal for smart-home, accessibility tech, and assistive applications.

computer-vision nlp object-detection opencv python real-time-systems spacy speech-recognition voice-assistant yolov5

Last synced: 09 Apr 2025

https://github.com/jash271/youglance

Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking

cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling

Last synced: 22 Mar 2025

https://github.com/aiatyourservice/deeplearningforcoders

Hey, this repo contains code from deep learning specialization by Andrew NG

deep-learning nltk python pytorch spacy

Last synced: 29 Mar 2025

https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy

I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.

machine-learning named-entity-recognition ner spacy spacy-nlp

Last synced: 01 Mar 2025

https://github.com/ayushmaanfcb/resume-and-name-card-entity-detection

This project aims at automating the process of hiring for major companies. Instead of going through the tedious process of manually going through tons of applications, the application aims at recognizing essential entities and information from the resumes.

docker fastapi gradio machine-learning mongodb named-entity-recognition natural-language-processing python spacy

Last synced: 26 Feb 2025

https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect

Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%

named-entity-recognition ner python spacy spacy-models

Last synced: 05 Apr 2025

https://github.com/touradbaba/nlp-notebooks

This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.

machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling

Last synced: 08 Feb 2025

https://github.com/oroszgy/cookiecutter-ml-flask

Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker

docker flask flask-application machine-learning nlp rest-api scikit-learn spacy

Last synced: 28 Mar 2025

https://github.com/oroszgy/mltools

Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib

data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools

Last synced: 28 Mar 2025

https://github.com/cmilamaya/flight-dashboard-app

This project is an application that processes attached PDF documents containing flight information and extracts relevant data. The data is stored in a PostgreSQL database and visualized on a dynamic dashboard using Streamlit.

pandas pdfplumber python spacy

Last synced: 05 Apr 2025

https://github.com/meefs/entseeker

entseeker is a command-line tool for Named Entity Recognition (NER) and web entity searches in text files. It uses spaCy's NLP capabilities for standard named entities and custom rules for web-related entities.

ai named-entity-recognition spacy spacy-nlp text-classification text-processing

Last synced: 05 Apr 2025

https://github.com/sudeatesoglu/nlp-document-processor

An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.

nlp spacy text-processing

Last synced: 05 Apr 2025

https://github.com/thjbdvlt/litteralement

schéma de base de données postgresql EAV hybride pour l'analyse de textes en français

eav french nlp nlp-french postgresql spacy sql

Last synced: 05 Apr 2025

https://github.com/wanjage/charles-burney-digital

Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat

geovisualisierung ner nlp nodegoat reisebericht spacy

Last synced: 05 Apr 2025

https://github.com/cano1998/sentiment-analysis-report-for-amazon-product-reviews

Sentiment analysis of Amazon product reviews. The analysis provides insights into customer sentiment and opinions regarding specific products sold on Amazon.

pdf pdf-generation sentiment-analysis spacy text-blob

Last synced: 05 Apr 2025

https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis

MLOps project Training and Deployment of Spacy model for Sentiment analysis

amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification

Last synced: 05 Apr 2025

https://github.com/nanditha-prabhu/qa-system-via-srl

Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques

bert qa-system spacy srl

Last synced: 05 Apr 2025

https://github.com/atharvapathak/customer_service_chatbot

Customer Service Chatbot Repository includes a range of features for building custom chatbots that can handle customer service queries and support requests. These features include NLP capabilities and pre-built dialog flows that can help chatbots understand and respond to customer.

chatbot database dialogflow nlp nltk reinforcement-learning restful-api spacy tensorflow

Last synced: 05 Apr 2025

https://github.com/muhammadshavaiz/ai_learning

Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.

deep-learning nlp python pytorch spacy

Last synced: 05 Apr 2025

https://github.com/atharvapathak/twitter_sentiment_analysis_project

Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.

api bag-of-words bert cnn data gbm nltk rnn spacy twitter

Last synced: 05 Apr 2025

https://github.com/foxbenjaminfox/simil

CLI for semantic string similarity

glove machine-learning python spacy string-similarity

Last synced: 05 Apr 2025

https://github.com/hansalemaos/spacy2df

converts a spaCy object into a pandas DataFrame

dataframe nlp pandas spacy

Last synced: 05 Apr 2025

https://github.com/shadbalti/simple-chatbot

This is a simple chatbot created using Python and spaCy. The chatbot can respond to common questions and perform specific tasks.

ai bots chatbot python spacy

Last synced: 05 Apr 2025

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 05 Apr 2025

https://github.com/hackerajofficial/chatbot

ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django

chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/pavithra-hn/text-summarizer

The Text Summarizer is a web-based application that allows users to input a piece of text and receive a summarized version of that text. The summarization is performed using NLP techniques to extract key information and provide a concise summary.

flask html-css-javascript nlp-library nltk python spacy

Last synced: 05 Apr 2025

https://github.com/karimosman89/resume-screening

Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.

machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing

Last synced: 28 Apr 2025

https://github.com/abdiasarsene/mapping_and_evolution_of_circular_business_models

This repository analyzes trends in circular business models using NLP techniques like LDA for thematic analysis, word clouds for visualization, and spaCy for semantic exploration.

lda-model seaborn sklearn spacy wordcloud-visualization

Last synced: 13 Mar 2025

https://github.com/itsdaiton/named-entity-visualizer

NEV short for Named Entity Visualizer is a tool to visualize entities found in unstructured text built in Python.

named-entity-linking named-entity-recognition natural-language-processing nlp-pipeline python spacy wikidata

Last synced: 05 Apr 2025

https://github.com/direct-phonology/phony

phonology in spaCy!

linguistics nlp phonology python spacy

Last synced: 13 Mar 2025

https://github.com/asrot0/spacy_ner

SpaCy-based NER🧠 implementation for extracting and classifying entities from text✨

machine-learning ner nlp spacy textclassification

Last synced: 16 Feb 2025

https://github.com/ntinouldinho/machine-learning-classification-and-speech-generation

Explored Greek Parliament Proceedings and tried to classify each speech to a corresponding parliamentary political party.

artificial-intelligence classification-machine-learning machine-learning neural-networks pandas python sklearn spacy

Last synced: 29 Mar 2025

https://github.com/thekartikeyamishra/resumeevaluatorapp

The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.

flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob

Last synced: 29 Mar 2025

https://github.com/huspacy/demo

HuSpaCy Streamlit Demo

demo huspacy nlp spacy

Last synced: 21 Mar 2025

https://github.com/manik2000/radiohead-lyrics

NLP analysis of Radiohead's songs lyrics.

embeddings huggingface-transformers nlp spacy

Last synced: 04 Apr 2025

https://github.com/dagmawi-22/hotel-ai

Hotel Customer Support Chatbot Rest API

django nltk pyspellchecker python spacy

Last synced: 04 Apr 2025

https://github.com/defrecord/literate-spacy

A literate programming implementation of a spaCy-based NLP tool using Org mode

fastapi literate-programming nlp org-mode python spacy

Last synced: 21 Mar 2025

https://github.com/abhayy-kumar/emotion-detection-system

A machine learning and NLP approach for classifying emotions in text comments.

emotion-detection machine-learning nlp python spacy text-classification

Last synced: 21 Mar 2025

https://github.com/rggh/api-4

Using FastAPI with spaCy to identify entities

docker fastapi python spacy

Last synced: 28 Mar 2025

spaCy Awesome Lists
spaCy Categories