An open API service indexing awesome lists of open source software.

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/explosion/spacy-legacy

🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility

spacy

Last synced: 04 Feb 2025

https://github.com/surajiyer/spacycake

Simple keyphrase extraction extensions and pipeline components for spaCy.

keyphrase-extraction natural-language-processing nlp spacy spacy-extension spacy-pipeline

Last synced: 09 Feb 2025

https://github.com/cloudera/cml_amp_spacy_entity_extraction

A Jupyter notebook demonstrating entity extraction on headlines with SpaCy.

entity-extraction named-entity-recognition nlp spacy

Last synced: 13 Apr 2025

https://github.com/f1uctus/p4a-recipes

📱 🐍 A collection of recipes for p4a (Python for Android).

android blis docker numpy p4a python python-for-android spacy

Last synced: 16 Jan 2025

https://github.com/jparedesds/tensorflow-twitter-sentiment-analysis

Sentiment analysis study through tweets with TensorFlow

matplotlib nltk pandas regex seaborn sklearn spacy tensorflow workcloud

Last synced: 13 May 2025

https://github.com/jbahire/semantic-similarity

This project gives implemetations of semantic similarity using various text embeddings and you can easily compare results using API provided. Go ahead and build your own API for integration in your use case.

bert elmo machine-learning natural-language-processing semantic-similarity spacy word2vec

Last synced: 06 Apr 2025

https://github.com/gaving/zorya

:grapes: Build NER graphs from YouTube transcripts

neo4j ner spacy youtube-transcripts

Last synced: 07 Apr 2025

https://github.com/conflictingtheories/spacy_ws

Websocket example with Spacy.io

nlp spacy spacy-models spacy-ner websocket

Last synced: 11 Apr 2025

https://github.com/neurotech-hq/swahili-ner-spacy

Swahili NER model trained using spacy

ner spacy swahili-ner

Last synced: 20 Feb 2025

https://github.com/eliask93/debertav3-for-aspect-based-sentiment-analysis

Application for training the pretrained transformer model DeBERTaV3 on an Aspect Based Sentiment Analysis task

amazon-reviews aspect-based-sentiment-analysis deberta deberta-v3 nlp simpletransformers spacy

Last synced: 06 Apr 2025

https://github.com/amrrs/intro_to_nlp_with_spacy

Introduction to NLP with Spacy - Bangpypers October Talk

nlp python spacy

Last synced: 12 Apr 2025

https://github.com/bees4ever/seaqube

Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.

augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings

Last synced: 21 Apr 2025

https://github.com/anushadatta/natural-language-processing

📑 NLP applications with NLTK, spaCy and PyTorch.

natural-language-processing nltk pytorch spacy

Last synced: 30 Mar 2025

https://github.com/senisioi/rolegal

A Spacy Package for Romanian Legal Document Processing

floret legal-documents ner romanian-language spacy

Last synced: 15 Feb 2025

https://github.com/chanind/reddit-words

What have Spacy's sense2vec 2019 word vectors learned from Reddit?

sense2vec spacy spacy-nlp word2vec

Last synced: 26 Mar 2025

https://github.com/martinjack/uaddresspacy

🇺🇦 UAddresspacy | Spacy разборка украинского адреса на типы

address nlp parsing spacy spacy-nlp ukraine

Last synced: 19 Jan 2025

https://github.com/kasakee/spacy-nlp-node

A library that will expose the parse method of SpaCy to Node.js

natural-language-processing nlp node node-js nodejs spacy spacy-nlp spacy-nlp-node spacy-node

Last synced: 15 Feb 2025

https://github.com/jtlicardo/spacy-ner

A demo app that extracts process tasks from text

named-entity-recognition spacy streamlit

Last synced: 09 May 2025

https://github.com/bemxio/julia-robotczyk

A Facebook Messenger chatbot based on my classmate's messages

facebook markov-chain markovify messenger nlp python spacy

Last synced: 05 Mar 2025

https://github.com/sloev/sentimental-onix

sentiment analysis for spacy pipeline in python

onnx sentiment-analysis spacy spacy-pipeline

Last synced: 12 Apr 2025

https://github.com/spacexnu/job_finder

Automate Job Search & Analysis Using AI

ai automation django job nlp openai python search spacy

Last synced: 11 Apr 2025

https://github.com/diyclassics/la_senter

Repository for training spaCy-compatible sentence segmenter for Latin

latin nlp spacy

Last synced: 28 Mar 2025

https://github.com/kahngjoonkoh/inkspect

An online Rorschach inkblot test. Uses NLP to code responses and the Exner system to interpret results.

nlp nltk python rorschach spacy web

Last synced: 05 Mar 2025

https://github.com/yarosj/prestige-of-districts

:mag_right: This application parses sites and retrieves data associated with failures of public services to display districts' prestige

amqp apollo-client apollo-server docker-compose graphql mapbox-gl ner neural-network nlp nodejs parsing pika python3 rabbitmq react scraping semantic-ui-react spacy taskscheduler webpack

Last synced: 12 Mar 2025

https://github.com/marmg/moviener

Code for the NER demo. Prepare data, train and extract entities from movie reviews.

extract-entities movie-reviews ner spacy

Last synced: 13 Mar 2025

https://github.com/pyladiesams/nlp-beginner-nov2020

Intro to NLP with NLTK, spaCy, and gensim

gensim nlp nlp-machine-learning nltk python spacy

Last synced: 22 Feb 2025

https://github.com/direct-phonology/spacy-och

the old chinese language for spaCy

chinese nlp spacy

Last synced: 05 Apr 2025

https://github.com/acdh-oeaw/acdh-prodigy-utils

custom loaders for spaCy's prodigy

prodigy spacy

Last synced: 16 Mar 2025

https://github.com/adriaanbd/kamtutecs-api

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.

docker fastapi nlp ocr spacy tesseract translate

Last synced: 09 Apr 2025

https://github.com/teakulo/eventime-app

Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.

ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot

Last synced: 09 Apr 2025

https://github.com/5hirish/django_adam_qas

ADAM - QA -- Front-end using Django and Material Design.

django natural-language-processing python3 question-answering spacy

Last synced: 26 Feb 2025

https://github.com/samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural language processing services using machine learning models with PyTorch and Transformers, packaged in a Docker container to run efficiently.

api docker fastapi hugging-face hugging-face-transformers huggingface-transformers keybert llm openai-compatible-api python python3 pytorch rest rest-api spacy torch transformers uvicorn

Last synced: 05 Apr 2025

https://github.com/tritonix711/ai-content-verifier

AI Content Verifier is a tool that finds out if text is written by AI or humans. It uses machine learning and natural language processing to give clear results and confidence scores. With an easy-to-use interface, it helps everyone from researchers to content creators check if the content is real or not.

git machine-learning nlp nltk numpy pandas python scikit-learn spacy tkinter

Last synced: 09 Jan 2025

https://github.com/ayushsubedi/choto

CLI tool to generate a summary of news/articles right on your terminal. Also a pip package.

articles bert choto cli click gensim news pip python spacy summary

Last synced: 12 Apr 2025

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 13 Mar 2025

https://github.com/gtoffoli/commons-textanalysis

Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.

django nlp python spacy text-analysis

Last synced: 07 May 2025

https://github.com/populated/compare

A simple Python-based code to compare texts for similarities.

comparsion nlp numpy spacy text

Last synced: 29 Mar 2025

https://github.com/shubhamjai9/emotion-based-counsellor-bot

An Artificial Intelligence based Chat Bot using python tools like Numpy, Pandas, Spacy etc. Counsellor Bot will mimic the characteristics and emotion interpretation skills of human and generate response on basis of emotion of engager.

chatbot gradient-boosting-classifier machine-learning naive-bayes-classifier nlp numpy pandas python-2 spacy

Last synced: 13 Mar 2025

https://github.com/kr1shnasomani/webscrub

Python code which extracts the html content, converts it to clean text and pre-processes the text

beautifulsoup html2text natural-language-processing pypi scikit-learn selenium spacy

Last synced: 07 Apr 2025

https://github.com/bram-code/llm-anonymization

This repository provides utilities for anonymizing, pseudonymizing, and simplifying Dutch text using various NLP techniques.

anonymization dutch large-language-models llm named-entity-recognition ner pseudonymisation simplification spacy

Last synced: 01 Feb 2025

https://github.com/herambvd/spoken2written

A source of python package which converts language styles in speech to its equivalent written form.

artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher

Last synced: 12 Apr 2025

https://github.com/umactually/papanatas

Papanatas Autómata Multiparadigma IV. El bot oficial de mi server de discord, Sociedad de Patanes.

discord discord-bot discord-py ffmpeg pillow pycord python spacy

Last synced: 24 Mar 2025

https://github.com/turbolent/spacyclient

A Swift client for spaCy

client nlp spacy swift

Last synced: 28 Mar 2025

https://github.com/mfkimbell/reviews-nlp-sentiment-analysis

This project investigates various NLP tools, compares them, and then uses the NLP tool to add a sentiment field to a PostgreSQL database in an efficient batch format.

asyncio asyncpg docker nltk pdm postgresql spacy tensorflow toml transformers yml

Last synced: 16 Mar 2025

https://github.com/surajiyer/spacybert

BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes

bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline

Last synced: 22 Mar 2025

https://github.com/crock/forum-ai

Analyzing the internet's web forums with machine learning one site at a time

ai gatsbyjs ml python scrapy spacy

Last synced: 28 Feb 2025

https://github.com/inanyan/spacy_pat_match_dsl

A simple DSL for creating spaCy pattern matchers

dsl nlp python spacy

Last synced: 22 Mar 2025

https://github.com/nanxstats/pdf-word-extraction

Extract meaningful words from a collection of PDF documents and count their frequencies

ftfy natural-language-processing pypdf research-paper spacy wordcloud

Last synced: 22 Apr 2025

https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system

In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.

corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis

Last synced: 24 Feb 2025

https://github.com/etdds/redditquotebot

A Reddit comment bot for detecting and replying to famous quotes.

bot chatbot natural-language-processing nlp praw python reddit spacy

Last synced: 17 Mar 2025

https://github.com/timuroeztuerk/data-science-lecture-s24

This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.

economics natural-language-processing nltk spacy text-classification

Last synced: 24 Feb 2025

https://github.com/ljvmiranda921/ud-tagalog-spacy

Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)

low-resource-languages machine-learning nlp spacy tagalog

Last synced: 28 Mar 2025

https://github.com/2pa4ul2/mcq-quiz-maker-nlp

Quizzable a quiz generator for short reviews with Spacy and NLTK

flask nlp nltk python question-generation quizapp spacy

Last synced: 05 Apr 2025

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 28 Mar 2025

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 05 Apr 2025

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 05 Apr 2025

https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques

using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text

healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp

Last synced: 05 Apr 2025

https://github.com/arjunravi26/chatbot-ai

A chatbot for responding to AI related queries

langchain langchain-community pinecone python rag regrex spacy stramlit

Last synced: 23 Feb 2025

https://github.com/randika00/ism-web-automation-y23cp-web

Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.

2captcha-api beautifulsoup regex scrapy selenium spacy webdriver

Last synced: 28 Mar 2025

https://github.com/jtlicardo/process-visualizer-web

Web interface for the process-visualizer project

bert bpmn nlp openai spacy

Last synced: 09 May 2025

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 10 May 2025

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 09 May 2025

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 09 Apr 2025

https://github.com/sukanyadutta52/sentiment-analysis

An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis

flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard

Last synced: 28 Mar 2025

https://github.com/bonysmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

correction kenlm spacy spelling symspell ukrainian

Last synced: 09 Feb 2025

https://github.com/sudip-13/nlp

This repo for tutorial NLP dialog flow chat bot back end configured

dialogflow fastapi fasttext mogodb ner regex spacy tf-idf

Last synced: 29 Mar 2025

https://github.com/gugarosa/brainy

🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.

api machine-learning nlp python spacy

Last synced: 28 Mar 2025

https://github.com/Keshabkjha/ClimaSense

ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.

html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app

Last synced: 13 Mar 2025

https://github.com/charlesyuan02/named_entity_recognition

Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.

conll-2003 named-entity-recognition ner nlp spacy transformer

Last synced: 06 Apr 2025

https://github.com/mbfakourii/nlp-ner

Implement Ner in nlp

ner nlp python spacy spacy-nlp

Last synced: 29 Mar 2025

https://github.com/whatevery1says/preprocessing

WE1S Preprocessing -- workflow preparing documents for import as WE1S data

digital-humanities humanities news nltk preprocessing spacy topic-modeling

Last synced: 04 Mar 2025

https://github.com/izuna385/arxiv-checker

Single Page Application and its deployment for GCE.

docker docker-compose fastapi nginx react react-bootstrap spacy tdd

Last synced: 28 Mar 2025

https://github.com/surajiyer/topic-analysis

Python library to perform topic detection on textual data that are generated over time.

agglomerative-clustering gaussian-mixture-models nlp spacy spectral-clustering textual-data topic-analysis topic-modeling

Last synced: 29 Mar 2025

https://github.com/snehadharne/vaers-symptomextractionwithai

VAERS Adverse Event Analysis for COVID 19 Vaccine : A hybrid approach combining LLMs (Gemini 1.5 Flash) and statistical methods for enhanced vaccine safety signal detection. Analyzes temporal and associative relationships in VAERS symptom data.

apriori-algorithm associative-analysis gemini-flash ner ollama pandas spacy symptom-analysis temporal-analysis

Last synced: 14 Apr 2025

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 04 Apr 2025

https://github.com/jash271/youglance

Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking

cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling

Last synced: 22 Mar 2025

https://github.com/aiatyourservice/deeplearningforcoders

Hey, this repo contains code from deep learning specialization by Andrew NG

deep-learning nltk python pytorch spacy

Last synced: 29 Mar 2025

https://github.com/udit-rawat/whisper-space

An ASR Gradio GUI based project that transcript the audion and provides NLP based analysis.

asr gradio nlp spacy whisper

Last synced: 22 Mar 2025

https://github.com/wesslen/spacy-ecfr-ner

spaCy-Prodigy workflow for NER Citation model on eCFR Banking Regulation

nlp prodigy spacy

Last synced: 06 Apr 2025

https://github.com/florensadimer/nlp_ner_soccer_pt-br

Anotação Manual e Comparação com Modelos Treinados

annotation llm machine-learning ner nlp spacy

Last synced: 12 Apr 2025

spaCy Awesome Lists
spaCy Categories