Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. Itโ€™s designed specifically for production use and helps you build applications that process and โ€œunderstandโ€ large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/oarriaga/luvina

High-level Natural Language Processing (NLP) for Python.

natural-language-processing nlp nltk python spacy

Last synced: 14 Oct 2024

https://github.com/autonomio/signs

A suite of tools for text preparation, vectorization and processing for deep learning with Keras.

embeddings fasttext gensim glove keras spacy word2vec

Last synced: 14 Oct 2024

https://github.com/dcavar/spacy-json-nlp

spaCy wrapper for JSON-NLP.

json natural-language-processing nlp spacy

Last synced: 18 Oct 2024

https://github.com/chaitjo/knowledge-graphs

Building Knowledge Graphs from Unstructured Text

knowledge-graph networkx neuralcoref spacy unstructured-data wikipedia

Last synced: 25 Oct 2024

https://github.com/gatoreducator/gatorminer

A visualized text mining and analysis tool for student markdown reflection documents based on Natural language processing in the Dept of CS at Allegheny College.

nlp spacy streamlit textmining

Last synced: 12 Oct 2024

https://github.com/d-one/nlpeasy

Easy Peasy Language Squeezy

datascience elasticsearch kibana nlp spacy

Last synced: 14 Oct 2024

https://github.com/fako/spacy_arguing_lexicon

A spaCy extension wrapping around the arguing lexicon by MPQA

argument-mining argumentation spacy spacy-extension

Last synced: 14 Oct 2024

https://github.com/bikatr7/kudasai

Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies

auto-translation chatgpt deepl gemini japanese-english japanese-english-translation japanese-translation machine-learning machine-translation nlp-preprocessing python spacy text-processing translation

Last synced: 18 Oct 2024

https://github.com/joshday/spacy.jl

Get up and running with Python's spaCy inside Julia

julia natural-language-processing python spacy

Last synced: 11 Oct 2024

https://github.com/jdagdelen/mondigy

A small component for using Mongodb databases with Prodigy annotation applications.

annotations mongodb natural-language-processing prodigy spacy spacy-nlp

Last synced: 14 Oct 2024

https://github.com/andrewrosss/rake-spacy

Python implementation of the Rapid Automatic Keyword Extraction algorithm using spaCy

algorithm keyword-extraction ml nlp python rake rake-nltk spacy

Last synced: 14 Oct 2024

https://github.com/martinomensio/it_vectors_wiki_spacy

Word embeddings for Italian language, spacy2 prebuilt model

embeddings glove italian model pretrained spacy spacy2 wordvectors

Last synced: 19 Oct 2024

https://github.com/wjbmattingly/tap-2024-spacy-llms

This is the repository for my 2024 Tap Institute Course on spaCy with LLMs

nlp spacy

Last synced: 14 Oct 2024

https://github.com/wjbmattingly/keyword-spacy

Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.

keyword-extraction nlp spacy

Last synced: 14 Oct 2024

https://github.com/davebulaval/spacy-language-detection

Fully customizable language detection for spaCy pipeline

language-detection nlp spacy spacy-extension

Last synced: 30 Sep 2024

https://github.com/lll-lll-lll-lll/sent-pattern

sent-pattern package categorizes English sentences into one of five basic sentence patterns.

japanese nlp portfolio python spacy

Last synced: 14 Oct 2024

https://github.com/wjbmattingly/bagpipes-spacy

Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.

nlp spacy

Last synced: 10 Oct 2024

https://github.com/nikhiljsk/preprocess_nlp

A fast framework for pre-processing (Cleaning text, Reduction of vocabulary, Feature extraction and Vectorization). Implemented with parallel processing using custom number of processes.

cleaning-data feature-extraction glove natural-language-processing nlp parallel-processing preprocess python3 reduction spacy stages tfidf vectorization word2vec

Last synced: 14 Oct 2024

https://github.com/centre-for-humanities-computing/conspiracies

A python package for discovering and examining conspiracies using NLP.

conspiracies conspiracy knowledge-graph nlp spacy

Last synced: 14 Oct 2024

https://github.com/sskorol/ner-spacy-doccano

NER using Doccano / Spacy EN

doccano ner python spacy

Last synced: 12 Oct 2024

https://github.com/cyclecycle/role-pattern-nlp

Build and match patterns for semantic role labelling / information extraction with SpaCy

nlp python semantic-role-labeling spacy

Last synced: 12 Oct 2024

https://github.com/cyclecycle/visualise-spacy-tree

Create dependency tree plots from SpaCy Doc objects

nlp python spacy

Last synced: 14 Oct 2024

https://github.com/nineinchnick/displacy

Python port of https://github.com/explosion/displacy

css natural-language-processing nlp python spacy svg visualization

Last synced: 13 Oct 2024

https://github.com/cvcio/mediawatch

Empowering news organizations to fight disinformation

ai elas golang grpc kafka misinformation neo4j network-analysis nodejs python spacy transformers

Last synced: 23 Oct 2024

https://github.com/plandes/nlparse

Natural language processing parsing and tool library

natural-language-processing nlp-machine-learning pypi-badge pypi-link spacy spacy-nlp

Last synced: 12 Oct 2024

https://github.com/ljvmiranda921/spacy-span-analyzer

Simple tool to analyze spans in your dataset. Implementation of Papay et al's work (EMNLP 2020) on span performance prediction

machine-learning natural-language-processing nlp spacy

Last synced: 30 Sep 2024

https://github.com/turbolent/spacykit

Industrial-strength Natural Language Processing (NLP) with Swift

natural-language-processing nlp spacy swift

Last synced: 19 Oct 2024

https://github.com/papachristoumarios/capbib

:book: Bibliography transformations made easier with NLP

bibtex nlp spacy

Last synced: 11 Oct 2024

https://github.com/opensemanticsearch/spacy-services.deb

Debian & Ubuntu package for REST microservices for spaCy natural language processing and machine learning framework for named entity recognition

api debian debian-packages named-entity-recognition natural-language-processing nlp-machine-learning python spacy spacy-nlp

Last synced: 11 Oct 2024

https://github.com/johnfraney/django-ner-trainer

Tools for training spaCy Named Entity Recognition models in Django

django django-rest-framework named-entity-recognition natural-language-processing spacy

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-legacy

๐Ÿ•ธ๏ธ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility

spacy

Last synced: 07 Oct 2024

https://github.com/ninadpatil09/nlp-notebooks

Explore NLP tasks with Python using NLTK, SpaCy & scikit-learn: Tokenization, Normalization, NER, POS tagging, Encoding, Word embedding.

natural-language-processing nlp nlp-machine-learning nltk python spacy

Last synced: 14 Oct 2024

https://github.com/wjbmattingly/number-spacy

Number spaCy is a custom spaCy pipeline component that enhances the identification of number entities in text and fetches the parsed numeric values using spaCy's token extensions.

nlp spacy

Last synced: 12 Oct 2024

https://github.com/kanishk3813/intel_sentiment_analysis

Intel Review Analyzer is a powerful tool designed to help businesses understand customer sentiments through automated analysis of reviews. This project leverages state-of-the-art NLP techniques to classify reviews, highlight key sentiments, generate word clouds, and visualize trends over time.

axios bert-model cors deep-learning flask pandas python react spacy

Last synced: 14 Oct 2024

https://github.com/bikatr7/kairyou

Quickly preprocesses Japanese text using NLP/NER from SpaCy for Japanese translation or other NLP tasks.

japanese ner nlp preprocess spacy

Last synced: 18 Oct 2024

https://github.com/surajiyer/spacycake

Simple keyphrase extraction extensions and pipeline components for spaCy.

keyphrase-extraction natural-language-processing nlp spacy spacy-extension spacy-pipeline

Last synced: 10 Oct 2024

https://github.com/jbahire/semantic-similarity

This project gives implemetations of semantic similarity using various text embeddings and you can easily compare results using API provided. Go ahead and build your own API for integration in your use case.

bert elmo machine-learning natural-language-processing semantic-similarity spacy word2vec

Last synced: 31 Oct 2024

https://github.com/gtoffoli/spacy-ar_core_news_md

Unofficial Arabic language model for spaCy

arabic-language camel nlp python spacy spacy-pipeline tokenizer

Last synced: 14 Oct 2024

https://github.com/chanind/reddit-words

What have Spacy's sense2vec 2019 word vectors learned from Reddit?

sense2vec spacy spacy-nlp word2vec

Last synced: 15 Oct 2024

https://github.com/nluninja/text-mining-dataviz

Data Visualization and Text Mining course - UNICATT

embeddings lstm nlp spacy text-mining transformers

Last synced: 25 Oct 2024

https://github.com/bees4ever/seaqube

Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.

augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings

Last synced: 18 Oct 2024

https://github.com/diyclassics/la_senter

Repository for training spaCy-compatible sentence segmenter for Latin

latin nlp spacy

Last synced: 19 Oct 2024

https://github.com/louisguitton/spacy-lancedb-linker

spaCy pipeline component for ANN Entity Linking using LanceDB

ann entity-linking lancedb spacy spacy-pipeline

Last synced: 12 Oct 2024

https://github.com/sloev/sentimental-onix

sentiment analysis for spacy pipeline in python

onnx sentiment-analysis spacy spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/herambvd/spoken2written

A source of python package which converts language styles in speech to its equivalent written form.

artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher

Last synced: 14 Oct 2024

https://github.com/nickcrews/spacy-address

Parse oneline US addresses using a spaCy NER model trained on OSM data

address address-parsing osm osm-data spacy spacy-nlp usaddress

Last synced: 20 Oct 2024

https://github.com/kasakee/spacy-nlp-node

A library that will expose the parse method of SpaCy to Node.js

natural-language-processing nlp node node-js nodejs spacy spacy-nlp spacy-nlp-node spacy-node

Last synced: 12 Oct 2024

https://github.com/martinjack/uaddresspacy

๐Ÿ‡บ๐Ÿ‡ฆ UAddresspacy | Spacy ั€ะฐะทะฑะพั€ะบะฐ ัƒะบั€ะฐะธะฝัะบะพะณะพ ะฐะดั€ะตัะฐ ะฝะฐ ั‚ะธะฟั‹

address nlp parsing spacy spacy-nlp ukraine

Last synced: 26 Sep 2024

https://github.com/senisioi/rolegal

A Spacy Package for Romanian Legal Document Processing

floret legal-documents ner romanian-language spacy

Last synced: 12 Oct 2024

https://github.com/direct-phonology/spacy-och

the old chinese language for spaCy

chinese nlp spacy

Last synced: 12 Oct 2024

https://github.com/adriaanbd/kamtutecs-api

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.

docker fastapi nlp ocr spacy tesseract translate

Last synced: 27 Oct 2024

https://github.com/riyajha2305/healthcare-diagnosis-chatbot-ms-hackathon

Built a chatbot capable of diagnosing common medical conditions based on user symptoms input. Utilized pre-trained machine learning models such NLP and NER from Huggingface and Spacy, trained on medical data to provide accurate suggestions and recommendations for further action.

hackathon healthcare-chatbot huggingface machine-learning ner nlp python spacy tkinter

Last synced: 09 Oct 2024

https://github.com/surajiyer/spacybert

BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes

bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/timuroeztuerk/data-science-lecture-s24

This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.

economics natural-language-processing nltk spacy text-classification

Last synced: 14 Oct 2024

https://github.com/turbolent/spacyclient

A Swift client for spaCy

client nlp spacy swift

Last synced: 19 Oct 2024

https://github.com/populated/compare

A simple Python-based code to compare texts for similarities.

comparsion nlp numpy spacy text

Last synced: 14 Oct 2024

https://github.com/jamnicki/metin2_vision_bot

Automatic MMORPG Bot for Dungeons Massive Passing based on Windows API, YoloV8 object detection, statistical methods from OpenCV, Tesseract-OCR and spaCy virtualised on Hyper-V, Win11+CUDA

computer-vision object-detection opencv spacy tesseract-ocr torchvision ultralytics win32

Last synced: 03 Nov 2024

https://github.com/inanyan/spacy_pat_match_dsl

A simple DSL for creating spaCy pattern matchers

dsl nlp python spacy

Last synced: 14 Oct 2024

https://github.com/gtoffoli/commons-textanalysis

Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.

django nlp python spacy text-analysis

Last synced: 07 Aug 2024

https://github.com/gaving/zorya

:grapes: Build NER graphs from YouTube transcripts

neo4j ner spacy youtube-transcripts

Last synced: 27 Oct 2024

https://github.com/ljvmiranda921/ud-tagalog-spacy

Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)

low-resource-languages machine-learning nlp spacy tagalog

Last synced: 19 Oct 2024

https://github.com/teakulo/eventime-app

Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.

ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot

Last synced: 14 Oct 2024

https://github.com/ayushsubedi/choto

CLI tool to generate a summary of news/articles right on your terminal. Also a pip package.

articles bert choto cli click gensim news pip python spacy summary

Last synced: 14 Oct 2024

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 14 Oct 2024

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 30 Oct 2024

https://github.com/laurenzv/covbot

A small chatbot written as part of my bachelor thesis.

chatbot corenlp covid-19 docker python spacy sqlite vuejs

Last synced: 31 Oct 2024

https://github.com/jblake1965/elucidoc

Screens legal text and extracts sentences containing user input party name-predicate phrases

excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word

Last synced: 12 Oct 2024

https://github.com/henx117/chatbot

My chatbot python project

chatbot python python3 spacy

Last synced: 14 Oct 2024

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 31 Oct 2024

https://github.com/aiatyourservice/deeplearningforcoders

Hey, this repo contains code from deep learning specialization by Andrew NG

deep-learning nltk python pytorch spacy

Last synced: 14 Oct 2024

https://github.com/fferegrino/zeldakg

A TLOZ inspired knowledge graph

infobox knowledge-graph nltk pandas python spacy wikidata

Last synced: 28 Oct 2024

https://github.com/lucasspinola/monitorbot-api

API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.

fastapi nlp pln spacy

Last synced: 03 Nov 2024

https://github.com/jash271/youglance

Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking

cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling

Last synced: 14 Oct 2024

https://github.com/farahibrar/programming-in-python

Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.

beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow

Last synced: 15 Oct 2024

https://github.com/sudip-13/nlp

This repo for tutorial NLP dialog flow chat bot back end configured

dialogflow fastapi fasttext mogodb ner regex spacy tf-idf

Last synced: 14 Oct 2024

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 14 Oct 2024

https://github.com/gugarosa/brainy

๐Ÿง  An intelligent Python-inspired Machine Learning API for training NLP-based models.

api machine-learning nlp python spacy

Last synced: 18 Oct 2024

https://github.com/aadityasivas/spacy-text-summarization

A simple text summarizer built with spaCy

jupyter-notebook nlp python spacy

Last synced: 05 Nov 2024

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 03 Aug 2024

https://github.com/toshimelonhead/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 13 Aug 2024

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 03 Aug 2024

spaCy Awesome Lists
spaCy Categories