Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. Itโs designed specifically for production use and helps you build applications that process and โunderstandโ large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-11-05 00:28:59 UTC
- JSON Representation
https://github.com/pythainlp/spacy-pythainlp
PyThaiNLP For spaCy
nlp-library python spacy spacy-extensions
Last synced: 14 Oct 2024
https://github.com/riccorl/ipa
NLP Preprocessing Pipeline Wrappers
lemmatization model natural-language-processing nlp part-of-speech-tagger pipeline preprocessing spacy stanza tagging token tokenizer wrapper
Last synced: 14 Oct 2024
https://github.com/oarriaga/luvina
High-level Natural Language Processing (NLP) for Python.
natural-language-processing nlp nltk python spacy
Last synced: 14 Oct 2024
https://github.com/autonomio/signs
A suite of tools for text preparation, vectorization and processing for deep learning with Keras.
embeddings fasttext gensim glove keras spacy word2vec
Last synced: 14 Oct 2024
https://github.com/dcavar/spacy-json-nlp
spaCy wrapper for JSON-NLP.
json natural-language-processing nlp spacy
Last synced: 18 Oct 2024
https://github.com/explosion/spacy-loggers
๐ Logging utilities for spaCy
logging machine-learning natural-language-processing nlp python spacy
Last synced: 07 Oct 2024
https://github.com/chaitjo/knowledge-graphs
Building Knowledge Graphs from Unstructured Text
knowledge-graph networkx neuralcoref spacy unstructured-data wikipedia
Last synced: 25 Oct 2024
https://github.com/gatoreducator/gatorminer
A visualized text mining and analysis tool for student markdown reflection documents based on Natural language processing in the Dept of CS at Allegheny College.
nlp spacy streamlit textmining
Last synced: 12 Oct 2024
https://github.com/d-one/nlpeasy
Easy Peasy Language Squeezy
datascience elasticsearch kibana nlp spacy
Last synced: 14 Oct 2024
https://github.com/fako/spacy_arguing_lexicon
A spaCy extension wrapping around the arguing lexicon by MPQA
argument-mining argumentation spacy spacy-extension
Last synced: 14 Oct 2024
https://github.com/bikatr7/kudasai
Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies
auto-translation chatgpt deepl gemini japanese-english japanese-english-translation japanese-translation machine-learning machine-translation nlp-preprocessing python spacy text-processing translation
Last synced: 18 Oct 2024
https://github.com/joshday/spacy.jl
Get up and running with Python's spaCy inside Julia
julia natural-language-processing python spacy
Last synced: 11 Oct 2024
https://github.com/jdagdelen/mondigy
A small component for using Mongodb databases with Prodigy annotation applications.
annotations mongodb natural-language-processing prodigy spacy spacy-nlp
Last synced: 14 Oct 2024
https://github.com/tc64/spacyss
Sentence Segmentation for Spacy
sentence-boundary-detection sentence-segmentation spacy spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/explosion/thinc_gpu_ops
๐ฎ GPU kernels for Thinc
ai artificial-intelligence deep-learning machine-learning natural-language-processing nlp python spacy thinc
Last synced: 28 Sep 2024
https://github.com/andrewrosss/rake-spacy
Python implementation of the Rapid Automatic Keyword Extraction algorithm using spaCy
algorithm keyword-extraction ml nlp python rake rake-nltk spacy
Last synced: 14 Oct 2024
https://github.com/martinomensio/it_vectors_wiki_spacy
Word embeddings for Italian language, spacy2 prebuilt model
embeddings glove italian model pretrained spacy spacy2 wordvectors
Last synced: 19 Oct 2024
https://github.com/kernel-loophole/kg-graph
Knowledge graph from unstructured text
knowledge-graph ml nlp-machine-learning nltk pagerank search-algorithm spacy text text-mining
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/tap-2024-spacy-llms
This is the repository for my 2024 Tap Institute Course on spaCy with LLMs
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/keyword-spacy
Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.
Last synced: 14 Oct 2024
https://github.com/davebulaval/spacy-language-detection
Fully customizable language detection for spaCy pipeline
language-detection nlp spacy spacy-extension
Last synced: 30 Sep 2024
https://github.com/wjbmattingly/bagpipes-spacy
Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.
Last synced: 10 Oct 2024
https://github.com/nikhiljsk/preprocess_nlp
A fast framework for pre-processing (Cleaning text, Reduction of vocabulary, Feature extraction and Vectorization). Implemented with parallel processing using custom number of processes.
cleaning-data feature-extraction glove natural-language-processing nlp parallel-processing preprocess python3 reduction spacy stages tfidf vectorization word2vec
Last synced: 14 Oct 2024
https://github.com/centre-for-humanities-computing/conspiracies
A python package for discovering and examining conspiracies using NLP.
conspiracies conspiracy knowledge-graph nlp spacy
Last synced: 14 Oct 2024
https://github.com/cyclecycle/role-pattern-nlp
Build and match patterns for semantic role labelling / information extraction with SpaCy
nlp python semantic-role-labeling spacy
Last synced: 12 Oct 2024
https://github.com/cyclecycle/visualise-spacy-tree
Create dependency tree plots from SpaCy Doc objects
Last synced: 14 Oct 2024
https://github.com/fastent/fastent
custom models for named-entity recognition
data-annotation data-generation named-entities named-entity-recognition natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/turbolent/spacy-thrift
spaCy as a service using Thrift
named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 14 Oct 2024
https://github.com/nineinchnick/displacy
Python port of https://github.com/explosion/displacy
css natural-language-processing nlp python spacy svg visualization
Last synced: 13 Oct 2024
https://github.com/cvcio/mediawatch
Empowering news organizations to fight disinformation
ai elas golang grpc kafka misinformation neo4j network-analysis nodejs python spacy transformers
Last synced: 23 Oct 2024
https://github.com/machinelearningzh/zix_understandability-index
Get a pragmatic assessment how understandable a German text is.
cefr-prediction llms machine-learning natural-language-processing nlp nlp-dataset nlp-library python spacy textdescriptives understandability
Last synced: 14 Oct 2024
https://github.com/asyml/forte-wrappers
Forte wrapper of third-party toolkits.
allennlp casl deep-learning elasticsearch forte huggingface machine-learning nlp nlp-library nltk processors spacy stanza
Last synced: 11 Oct 2024
https://github.com/plandes/nlparse
Natural language processing parsing and tool library
natural-language-processing nlp-machine-learning pypi-badge pypi-link spacy spacy-nlp
Last synced: 12 Oct 2024
https://github.com/ljvmiranda921/spacy-span-analyzer
Simple tool to analyze spans in your dataset. Implementation of Papay et al's work (EMNLP 2020) on span performance prediction
machine-learning natural-language-processing nlp spacy
Last synced: 30 Sep 2024
https://github.com/turbolent/spacykit
Industrial-strength Natural Language Processing (NLP) with Swift
natural-language-processing nlp spacy swift
Last synced: 19 Oct 2024
https://github.com/papachristoumarios/capbib
:book: Bibliography transformations made easier with NLP
Last synced: 11 Oct 2024
https://github.com/opensemanticsearch/spacy-services.deb
Debian & Ubuntu package for REST microservices for spaCy natural language processing and machine learning framework for named entity recognition
api debian debian-packages named-entity-recognition natural-language-processing nlp-machine-learning python spacy spacy-nlp
Last synced: 11 Oct 2024
https://github.com/johnfraney/django-ner-trainer
Tools for training spaCy Named Entity Recognition models in Django
django django-rest-framework named-entity-recognition natural-language-processing spacy
Last synced: 14 Oct 2024
https://github.com/explosion/spacy-legacy
๐ธ๏ธ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Last synced: 07 Oct 2024
https://github.com/ninadpatil09/nlp-notebooks
Explore NLP tasks with Python using NLTK, SpaCy & scikit-learn: Tokenization, Normalization, NER, POS tagging, Encoding, Word embedding.
natural-language-processing nlp nlp-machine-learning nltk python spacy
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/number-spacy
Number spaCy is a custom spaCy pipeline component that enhances the identification of number entities in text and fetches the parsed numeric values using spaCy's token extensions.
Last synced: 12 Oct 2024
https://github.com/kanishk3813/intel_sentiment_analysis
Intel Review Analyzer is a powerful tool designed to help businesses understand customer sentiments through automated analysis of reviews. This project leverages state-of-the-art NLP techniques to classify reviews, highlight key sentiments, generate word clouds, and visualize trends over time.
axios bert-model cors deep-learning flask pandas python react spacy
Last synced: 14 Oct 2024
https://github.com/bikatr7/kairyou
Quickly preprocesses Japanese text using NLP/NER from SpaCy for Japanese translation or other NLP tasks.
japanese ner nlp preprocess spacy
Last synced: 18 Oct 2024
https://github.com/surajiyer/spacycake
Simple keyphrase extraction extensions and pipeline components for spaCy.
keyphrase-extraction natural-language-processing nlp spacy spacy-extension spacy-pipeline
Last synced: 10 Oct 2024
https://github.com/jbahire/semantic-similarity
This project gives implemetations of semantic similarity using various text embeddings and you can easily compare results using API provided. Go ahead and build your own API for integration in your use case.
bert elmo machine-learning natural-language-processing semantic-similarity spacy word2vec
Last synced: 31 Oct 2024
https://github.com/gtoffoli/spacy-ar_core_news_md
Unofficial Arabic language model for spaCy
arabic-language camel nlp python spacy spacy-pipeline tokenizer
Last synced: 14 Oct 2024
https://github.com/chanind/reddit-words
What have Spacy's sense2vec 2019 word vectors learned from Reddit?
sense2vec spacy spacy-nlp word2vec
Last synced: 15 Oct 2024
https://github.com/nluninja/text-mining-dataviz
Data Visualization and Text Mining course - UNICATT
embeddings lstm nlp spacy text-mining transformers
Last synced: 25 Oct 2024
https://github.com/bees4ever/seaqube
Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.
augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings
Last synced: 18 Oct 2024
https://github.com/diyclassics/la_senter
Repository for training spaCy-compatible sentence segmenter for Latin
Last synced: 19 Oct 2024
https://github.com/louisguitton/spacy-lancedb-linker
spaCy pipeline component for ANN Entity Linking using LanceDB
ann entity-linking lancedb spacy spacy-pipeline
Last synced: 12 Oct 2024
https://github.com/sloev/sentimental-onix
sentiment analysis for spacy pipeline in python
onnx sentiment-analysis spacy spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/herambvd/spoken2written
A source of python package which converts language styles in speech to its equivalent written form.
artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher
Last synced: 14 Oct 2024
https://github.com/nickcrews/spacy-address
Parse oneline US addresses using a spaCy NER model trained on OSM data
address address-parsing osm osm-data spacy spacy-nlp usaddress
Last synced: 20 Oct 2024
https://github.com/kasakee/spacy-nlp-node
A library that will expose the parse method of SpaCy to Node.js
natural-language-processing nlp node node-js nodejs spacy spacy-nlp spacy-nlp-node spacy-node
Last synced: 12 Oct 2024
https://github.com/senisioi/rolegal
A Spacy Package for Romanian Legal Document Processing
floret legal-documents ner romanian-language spacy
Last synced: 12 Oct 2024
https://github.com/direct-phonology/spacy-och
the old chinese language for spaCy
Last synced: 12 Oct 2024
https://github.com/riyajha2305/healthcare-diagnosis-chatbot-ms-hackathon
Built a chatbot capable of diagnosing common medical conditions based on user symptoms input. Utilized pre-trained machine learning models such NLP and NER from Huggingface and Spacy, trained on medical data to provide accurate suggestions and recommendations for further action.
hackathon healthcare-chatbot huggingface machine-learning ner nlp python spacy tkinter
Last synced: 09 Oct 2024
https://github.com/surajiyer/spacybert
BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes
bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/timuroeztuerk/data-science-lecture-s24
This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.
economics natural-language-processing nltk spacy text-classification
Last synced: 14 Oct 2024
https://github.com/populated/compare
A simple Python-based code to compare texts for similarities.
comparsion nlp numpy spacy text
Last synced: 14 Oct 2024
https://github.com/jamnicki/metin2_vision_bot
Automatic MMORPG Bot for Dungeons Massive Passing based on Windows API, YoloV8 object detection, statistical methods from OpenCV, Tesseract-OCR and spaCy virtualised on Hyper-V, Win11+CUDA
computer-vision object-detection opencv spacy tesseract-ocr torchvision ultralytics win32
Last synced: 03 Nov 2024
https://github.com/inanyan/spacy_pat_match_dsl
A simple DSL for creating spaCy pattern matchers
Last synced: 14 Oct 2024
https://github.com/izuna385/scispacy-candidate-generator
Generating Candidate Entities with ScispaCy
allennlp entity-linking natural-language-processing scispacy spacy
Last synced: 18 Oct 2024
https://github.com/gtoffoli/commons-textanalysis
Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.
django nlp python spacy text-analysis
Last synced: 07 Aug 2024
https://github.com/gaving/zorya
:grapes: Build NER graphs from YouTube transcripts
neo4j ner spacy youtube-transcripts
Last synced: 27 Oct 2024
https://github.com/ljvmiranda921/ud-tagalog-spacy
Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)
low-resource-languages machine-learning nlp spacy tagalog
Last synced: 19 Oct 2024
https://github.com/teakulo/eventime-app
Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.
ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot
Last synced: 14 Oct 2024
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 14 Oct 2024
https://github.com/ccoreilly/spacy-catala-generator
Training and dataset used for the catalan spacy model
catala catalan catalan-language spacy spacy-models
Last synced: 30 Oct 2024
https://github.com/sloev/spacy_onnx_sentiment_english
english sentiment model for spacy
onnx-models sentiment-analysis spacy spacy-pipeline
Last synced: 19 Oct 2024
https://github.com/turbolent/telescope
Go explore
compiler nlp parser question-answering scala spacy sparql
Last synced: 19 Oct 2024
https://github.com/turbolent/spacy-thrift-docker
spacy-thrift as a Docker container
docker named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 19 Oct 2024
https://github.com/plandes/mimic
MIMIC III Corpus Parsing
mimic-iii natural-language-processing parsing-library spacy
Last synced: 12 Oct 2024
https://github.com/jblake1965/elucidoc
Screens legal text and extracts sentences containing user input party name-predicate phrases
excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word
Last synced: 12 Oct 2024
https://github.com/omar7tech/text-summarization
This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.
natural-language-processing python spacy text-summarization tokenization
Last synced: 31 Oct 2024
https://github.com/jonasrenault/cprex
Chemical Properties Relation Extraction
chemistry crawler deep-learning information-extraction machine-learning named-entity-recognition nlp pubchem relation-extraction scientific-articles spacy transformers
Last synced: 14 Oct 2024
https://github.com/public-health-scotland/dose_instruction_parser
Parsing prescription dose instructions using Named Entity Recognition and rules
dose-instructions drug machine-learning named-entity-recognition natural-language-processing nhs prescribing prescriptions public-health scotland spacy
Last synced: 14 Oct 2024
https://github.com/aiatyourservice/deeplearningforcoders
Hey, this repo contains code from deep learning specialization by Andrew NG
deep-learning nltk python pytorch spacy
Last synced: 14 Oct 2024
https://github.com/fferegrino/zeldakg
A TLOZ inspired knowledge graph
infobox knowledge-graph nltk pandas python spacy wikidata
Last synced: 28 Oct 2024
https://github.com/lucasspinola/monitorbot-api
API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.
Last synced: 03 Nov 2024
https://github.com/jash271/youglance
Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking
cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling
Last synced: 14 Oct 2024
https://github.com/farahibrar/programming-in-python
Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.
beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow
Last synced: 15 Oct 2024
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 14 Oct 2024
https://github.com/somenath203/named-entity-recognizer
Click below to checkout the website
huggingface huggingface-spaces named-entity-recognition ner spacy streamlit torch transformers
Last synced: 14 Oct 2024
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/gugarosa/brainy
๐ง An intelligent Python-inspired Machine Learning API for training NLP-based models.
api machine-learning nlp python spacy
Last synced: 18 Oct 2024
https://github.com/aadityasivas/spacy-text-summarization
A simple text summarizer built with spaCy
jupyter-notebook nlp python spacy
Last synced: 05 Nov 2024
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 03 Aug 2024
https://github.com/toshimelonhead/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 13 Aug 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 03 Aug 2024