An open API service indexing awesome lists of open source software.

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/explosion/spacy-legacy

🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility

spacy

Last synced: 04 Feb 2025

https://github.com/f1uctus/ttc

✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].

conversation nlp nlp-apis nlp-library spacy spacy-extension spacy-nlp spacy-pipeline speaker-identification

Last synced: 13 Apr 2025

https://github.com/jash271/youglance-extension

A chrome extension that Simplies your youtube video viewing experience.Navigate directly to the part you're interested in by typing it in the Search bar and we'll handle the Rest.Search by important and frequent entities mentioned in the video and gauge an understanding of the overall Sentiment of the video

deep-learning fastapi javascript nlp oops-in-python regex spacy

Last synced: 18 Mar 2025

https://github.com/surajiyer/spacycake

Simple keyphrase extraction extensions and pipeline components for spaCy.

keyphrase-extraction natural-language-processing nlp spacy spacy-extension spacy-pipeline

Last synced: 09 Feb 2025

https://github.com/senisioi/rolegal

A Spacy Package for Romanian Legal Document Processing

floret legal-documents ner romanian-language spacy

Last synced: 15 Feb 2025

https://github.com/chanind/reddit-words

What have Spacy's sense2vec 2019 word vectors learned from Reddit?

sense2vec spacy spacy-nlp word2vec

Last synced: 26 Mar 2025

https://github.com/neurotech-hq/swahili-ner-spacy

Swahili NER model trained using spacy

ner spacy swahili-ner

Last synced: 20 Feb 2025

https://github.com/bemxio/julia-robotczyk

A Facebook Messenger chatbot based on my classmate's messages

facebook markov-chain markovify messenger nlp python spacy

Last synced: 05 Mar 2025

https://github.com/kasakee/spacy-nlp-node

A library that will expose the parse method of SpaCy to Node.js

natural-language-processing nlp node node-js nodejs spacy spacy-nlp spacy-nlp-node spacy-node

Last synced: 15 Feb 2025

https://github.com/diyclassics/la_senter

Repository for training spaCy-compatible sentence segmenter for Latin

latin nlp spacy

Last synced: 28 Mar 2025

https://github.com/gaving/zorya

:grapes: Build NER graphs from YouTube transcripts

neo4j ner spacy youtube-transcripts

Last synced: 07 Apr 2025

https://github.com/spacexnu/job_finder

Automate Job Search & Analysis Using AI

ai automation django job nlp openai python search spacy

Last synced: 11 Apr 2025

https://github.com/eliask93/debertav3-for-aspect-based-sentiment-analysis

Application for training the pretrained transformer model DeBERTaV3 on an Aspect Based Sentiment Analysis task

amazon-reviews aspect-based-sentiment-analysis deberta deberta-v3 nlp simpletransformers spacy

Last synced: 06 Apr 2025

https://github.com/anushadatta/natural-language-processing

📑 NLP applications with NLTK, spaCy and PyTorch.

natural-language-processing nltk pytorch spacy

Last synced: 30 Mar 2025

https://github.com/conflictingtheories/spacy_ws

Websocket example with Spacy.io

nlp spacy spacy-models spacy-ner websocket

Last synced: 11 Apr 2025

https://github.com/bees4ever/seaqube

Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.

augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings

Last synced: 21 Apr 2025

https://github.com/jtlicardo/spacy-ner

A demo app that extracts process tasks from text

named-entity-recognition spacy streamlit

Last synced: 15 Nov 2024

https://github.com/martinjack/uaddresspacy

🇺🇦 UAddresspacy | Spacy разборка украинского адреса на типы

address nlp parsing spacy spacy-nlp ukraine

Last synced: 19 Jan 2025

https://github.com/sloev/sentimental-onix

sentiment analysis for spacy pipeline in python

onnx sentiment-analysis spacy spacy-pipeline

Last synced: 12 Apr 2025

https://github.com/amrrs/intro_to_nlp_with_spacy

Introduction to NLP with Spacy - Bangpypers October Talk

nlp python spacy

Last synced: 12 Apr 2025

https://github.com/kahngjoonkoh/inkspect

An online Rorschach inkblot test. Uses NLP to code responses and the Exner system to interpret results.

nlp nltk python rorschach spacy web

Last synced: 05 Mar 2025

https://github.com/jbahire/semantic-similarity

This project gives implemetations of semantic similarity using various text embeddings and you can easily compare results using API provided. Go ahead and build your own API for integration in your use case.

bert elmo machine-learning natural-language-processing semantic-similarity spacy word2vec

Last synced: 06 Apr 2025

https://github.com/turbolent/spacyclient

A Swift client for spaCy

client nlp spacy swift

Last synced: 28 Mar 2025

https://github.com/umactually/papanatas

Papanatas Autómata Multiparadigma IV. El bot oficial de mi server de discord, Sociedad de Patanes.

discord discord-bot discord-py ffmpeg pillow pycord python spacy

Last synced: 24 Mar 2025

https://github.com/nanxstats/pdf-word-extraction

Extract meaningful words from a collection of PDF documents and count their frequencies

ftfy natural-language-processing pypdf research-paper spacy wordcloud

Last synced: 22 Apr 2025

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 13 Mar 2025

https://github.com/shubhamjai9/emotion-based-counsellor-bot

An Artificial Intelligence based Chat Bot using python tools like Numpy, Pandas, Spacy etc. Counsellor Bot will mimic the characteristics and emotion interpretation skills of human and generate response on basis of emotion of engager.

chatbot gradient-boosting-classifier machine-learning naive-bayes-classifier nlp numpy pandas python-2 spacy

Last synced: 13 Mar 2025

https://github.com/bram-code/llm-anonymization

This repository provides utilities for anonymizing, pseudonymizing, and simplifying Dutch text using various NLP techniques.

anonymization dutch large-language-models llm named-entity-recognition ner pseudonymisation simplification spacy

Last synced: 01 Feb 2025

https://github.com/direct-phonology/spacy-och

the old chinese language for spaCy

chinese nlp spacy

Last synced: 05 Apr 2025

https://github.com/gtoffoli/commons-textanalysis

Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.

django nlp python spacy text-analysis

Last synced: 07 May 2025

https://github.com/mfkimbell/reviews-nlp-sentiment-analysis

This project investigates various NLP tools, compares them, and then uses the NLP tool to add a sentiment field to a PostgreSQL database in an efficient batch format.

asyncio asyncpg docker nltk pdm postgresql spacy tensorflow toml transformers yml

Last synced: 16 Mar 2025

https://github.com/etdds/redditquotebot

A Reddit comment bot for detecting and replying to famous quotes.

bot chatbot natural-language-processing nlp praw python reddit spacy

Last synced: 17 Mar 2025

https://github.com/5hirish/django_adam_qas

ADAM - QA -- Front-end using Django and Material Design.

django natural-language-processing python3 question-answering spacy

Last synced: 26 Feb 2025

https://github.com/yarosj/prestige-of-districts

:mag_right: This application parses sites and retrieves data associated with failures of public services to display districts' prestige

amqp apollo-client apollo-server docker-compose graphql mapbox-gl ner neural-network nlp nodejs parsing pika python3 rabbitmq react scraping semantic-ui-react spacy taskscheduler webpack

Last synced: 12 Mar 2025

https://github.com/populated/compare

A simple Python-based code to compare texts for similarities.

comparsion nlp numpy spacy text

Last synced: 29 Mar 2025

https://github.com/tritonix711/ai-content-verifier

AI Content Verifier is a tool that finds out if text is written by AI or humans. It uses machine learning and natural language processing to give clear results and confidence scores. With an easy-to-use interface, it helps everyone from researchers to content creators check if the content is real or not.

git machine-learning nlp nltk numpy pandas python scikit-learn spacy tkinter

Last synced: 09 Jan 2025

https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system

In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.

corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis

Last synced: 24 Feb 2025

https://github.com/teakulo/eventime-app

Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.

ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot

Last synced: 09 Apr 2025

https://github.com/adriaanbd/kamtutecs-api

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.

docker fastapi nlp ocr spacy tesseract translate

Last synced: 09 Apr 2025

https://github.com/inanyan/spacy_pat_match_dsl

A simple DSL for creating spaCy pattern matchers

dsl nlp python spacy

Last synced: 22 Mar 2025

https://github.com/acdh-oeaw/acdh-prodigy-utils

custom loaders for spaCy's prodigy

prodigy spacy

Last synced: 16 Mar 2025

https://github.com/ayushsubedi/choto

CLI tool to generate a summary of news/articles right on your terminal. Also a pip package.

articles bert choto cli click gensim news pip python spacy summary

Last synced: 12 Apr 2025

https://github.com/ljvmiranda921/ud-tagalog-spacy

Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)

low-resource-languages machine-learning nlp spacy tagalog

Last synced: 28 Mar 2025

https://github.com/crock/forum-ai

Analyzing the internet's web forums with machine learning one site at a time

ai gatsbyjs ml python scrapy spacy

Last synced: 28 Feb 2025

https://github.com/pyladiesams/nlp-beginner-nov2020

Intro to NLP with NLTK, spaCy, and gensim

gensim nlp nlp-machine-learning nltk python spacy

Last synced: 22 Feb 2025

https://github.com/samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural language processing services using machine learning models with PyTorch and Transformers, packaged in a Docker container to run efficiently.

api docker fastapi hugging-face hugging-face-transformers huggingface-transformers keybert llm openai-compatible-api python python3 pytorch rest rest-api spacy torch transformers uvicorn

Last synced: 05 Apr 2025

https://github.com/kr1shnasomani/webscrub

Python code which extracts the html content, converts it to clean text and pre-processes the text

beautifulsoup html2text natural-language-processing pypi scikit-learn selenium spacy

Last synced: 07 Apr 2025

https://github.com/marmg/moviener

Code for the NER demo. Prepare data, train and extract entities from movie reviews.

extract-entities movie-reviews ner spacy

Last synced: 13 Mar 2025

https://github.com/surajiyer/spacybert

BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes

bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline

Last synced: 22 Mar 2025

https://github.com/timuroeztuerk/data-science-lecture-s24

This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.

economics natural-language-processing nltk spacy text-classification

Last synced: 24 Feb 2025

https://github.com/herambvd/spoken2written

A source of python package which converts language styles in speech to its equivalent written form.

artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher

Last synced: 12 Apr 2025

https://github.com/ajaykumar095/natural_language_processing

Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.

ann nltk-python python rnn spacy tensorflow text-preprocessing textblob

Last synced: 09 Apr 2025

https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy

I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.

machine-learning named-entity-recognition ner spacy spacy-nlp

Last synced: 01 Mar 2025

https://github.com/ivan-kleshnin/spacy-benchmarks

Comparison of Spacy performance with different architectures, corpuses, hyperparams...

clearnlp nlp penn-treebank spacy universal-dependencies universaldependencies

Last synced: 07 Mar 2025

https://github.com/isabelleysseric/question-answering

Building a Natural Language Question & Answer Search Engine with corpus in Python language.

corpus deep-learning nlp qa question-answering spacy whoosh

Last synced: 20 Feb 2025

https://github.com/prthd/ai-powered-voice-assisted-object-locator

🔍 Real-time object detection with voice command integration using YOLOv5 (Objects365), OpenCV, MediaPipe, spaCy NLP, and SpeechRecognition. Enhances accessibility by guiding users to locate indoor objects with directional feedback relative to their position. Ideal for smart-home, accessibility tech, and assistive applications.

computer-vision nlp object-detection opencv python real-time-systems spacy speech-recognition voice-assistant yolov5

Last synced: 09 Apr 2025

https://github.com/florensadimer/nlp_ner_soccer_pt-br

Anotação Manual e Comparação com Modelos Treinados

annotation llm machine-learning ner nlp spacy

Last synced: 12 Apr 2025

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 09 Apr 2025

https://github.com/keshabkjha/climasense

ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.

html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app

Last synced: 31 Mar 2025

https://github.com/araobp/bach-network

J. S. Bach's network with spaCy(NLP)

graphology spacy visjs

Last synced: 11 Mar 2025

https://github.com/lucasspinola/monitorbot-api

API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.

fastapi nlp pln spacy

Last synced: 07 Apr 2025

https://github.com/laurenzv/covbot

A small chatbot written as part of my bachelor thesis.

chatbot corenlp covid-19 docker python spacy sqlite vuejs

Last synced: 05 Apr 2025

https://github.com/jtlicardo/process-visualizer-web

Web interface for the process-visualizer project

bert bpmn nlp openai spacy

Last synced: 15 Nov 2024

https://github.com/wesslen/spacy-ecfr-ner

spaCy-Prodigy workflow for NER Citation model on eCFR Banking Regulation

nlp prodigy spacy

Last synced: 06 Apr 2025

https://github.com/snehadharne/vaers-symptomextractionwithai

VAERS Adverse Event Analysis for COVID 19 Vaccine : A hybrid approach combining LLMs (Gemini 1.5 Flash) and statistical methods for enhanced vaccine safety signal detection. Analyzes temporal and associative relationships in VAERS symptom data.

apriori-algorithm associative-analysis gemini-flash ner ollama pandas spacy symptom-analysis temporal-analysis

Last synced: 14 Apr 2025

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 20 Nov 2024

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 19 Feb 2025

https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques

using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text

healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/giuliosmall/twitter-trending-topics-pipeline

This project demonstrates trending topic detection using Apache Spark and MinIO. It processes Twitter JSON data with PySpark, leveraging distributed data processing and cloud storage. The entire project is containerized with Docker for easy deployment across architectures.

docker minio nlp pyspark pytest spacy spark streamlit

Last synced: 30 Mar 2025

https://github.com/navaneethelite/ner_streamlit

A genreal purpose Named Entity Recognition model using Spacy v3. This web app was built using streamlit and deployed to Heroku.

heroku-app nlp spacy

Last synced: 22 Mar 2025

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 05 Apr 2025

https://github.com/innerdoc/spacy-for-datashare

Let spaCy do the parsing of Named Entities for documents in the Datashare platform

datashare elasticsearch named-entity-recognition natural-language-processing spacy

Last synced: 14 Mar 2025

https://github.com/whatevery1says/preprocessing

WE1S Preprocessing -- workflow preparing documents for import as WE1S data

digital-humanities humanities news nltk preprocessing spacy topic-modeling

Last synced: 04 Mar 2025

https://github.com/alessandromonolo/descriptive-texts-classification-by-usage-purposes-of-estate-properties

The project aims to identify the best model for the classification of texts derived from descriptions of assets subject to Italian judicial auctions. The employed models include both conventional models, such as Logistic Regression, Naive Bayes, SVM, and XGBoost, and neural network models, such as Fasttext and XLM-Roberta.

fasttext logistic-regression naive-bayes nlp python pytorch scikit-learn seaborn spacy svm text-classification tfidf tokenizer xgboost xlm-roberta

Last synced: 18 Feb 2025

https://github.com/aditya172926/text_summarization

Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.

glove machine-learning nlp python scikit-learn spacy

Last synced: 18 Mar 2025

https://github.com/md-emon-hasan/nlp-codebasics

Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.

bag-of-words bow gensim gensim-word2vec lematization nlp nlp-library nlp-machine-learning nltk nltk-python python3 spacy text-classification text-processing tokenization

Last synced: 22 Feb 2025

https://github.com/jonathanfox5/lemon_tizer

LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.

lemmatization lemmatizer spacy wrapper

Last synced: 10 Apr 2025

https://github.com/izuna385/pubtator-multiprocess-parser

Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.

allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy

Last synced: 28 Mar 2025

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 05 Apr 2025

https://github.com/arjunravi26/chatbot-ai

A chatbot for responding to AI related queries

langchain langchain-community pinecone python rag regrex spacy stramlit

Last synced: 23 Feb 2025

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 04 Apr 2025

spaCy Awesome Lists
spaCy Categories