Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-27 00:23:22 UTC
- JSON Representation
https://gitlab.com/tangibleai/community/qary-cli
Command Line Interface for developers of qary -- the open source, teachable AI assistant that truly assists, rather than manipulating you.
BERT ELIZA GPT3 HACKTOBERFEST2022 NLP Virtual Assistant chatbot datasets hacktoberfest machine learning pytorch sklearn spacy torch
Last synced: 01 Nov 2024
https://github.com/yashdew/assessor
An open-source Resume Analyzer and Ranking tool for recruiters and candidates.
flask hacktoberfest hacktoberfest2021 nextjs nlp python spacy
Last synced: 27 Oct 2024
https://github.com/code-kern-ai/refinery-python-sdk
Official Python SDK for Kern AI refinery.
active-learning data-centric-ai deep-learning labeling labeling-tool machine-learning natural-language-processing neural-search nlp python sdk spacy supervised-learning text-annotation text-classification transformer
Last synced: 01 Nov 2024
https://github.com/mmxgn/sprl-spacy
Implementation of Spatial Role Labeling using the Spacy NLP framework.
nlp problog spacy spatial-role-labeling sprl
Last synced: 10 Oct 2024
https://github.com/talmago/spacy_crfsuite
sequence tagging with spaCy and crfsuite
crf crf-model crfsuite entity-extraction entity-extraction-extension entity-tagging nlp sklearn-crfsuite spacy spacy-extension spacy-ner
Last synced: 12 Oct 2024
https://github.com/spacyturk/spacyturk
spaCyTurk - trained models & pipelines for Turkish
floret nlp nlp-library spacy turkish-nlp
Last synced: 12 Oct 2024
https://github.com/centre-for-humanities-computing/odycy
A general-purpose NLP pipeline for Ancient Greek
ancient-greek machine-learning natural-language-processing nlp python spacy
Last synced: 09 Nov 2024
https://github.com/izuna385/wikia-and-wikipedia-el-dataset-creator
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wiki are available!
deep-learning entity-linking named-entity-recognition natural-language-processing natural-language-understanding spacy wikipedia
Last synced: 08 Nov 2024
https://github.com/megagonlabs/ginza-transformers
Use custom tokenizers in spacy-transformers
ginza natural-language-processing nlp spacy spacy-transformers sudachitra tokenizers transformers
Last synced: 14 Oct 2024
https://github.com/ccoreilly/spacy-catala
Spacy NLP Model for the Catalan language
catalan catalan-language nlp nlp-model nlu nlu-model spacy
Last synced: 23 Oct 2024
https://github.com/aayushpatel007/topicrankpy
A Python package to get useful information from documents using TopicRank Algorithm.
data-preprocessing email-parsing graph-algorithms hierarchical-clustering keyphrase-extraction keywords-extraction named-entity-recognition network-x nlp pagerank-python phone-parse spacy text-cleaning textrank topicrank
Last synced: 12 Oct 2024
https://github.com/maxbot-ai/maxbot
Maxbot is an open source library and framework for creating conversational apps
bot botkit chatbot chatbot-framework conversational-ai conversational-apps maxbot nlp rasa spacy text-bot voice-bot
Last synced: 18 Oct 2024
https://github.com/tokestermw/spacy_kenlm
:game_die: KenLM extension for spaCy 2.0.
kenlm language-model nlp spacy spacy-extension spacy-nlp
Last synced: 14 Oct 2024
https://github.com/wadaboa/ner-annotator
GUI useful to manually annotate text for Named Entity Recognition purposes
named-entity-recognition ner nlp pyqt5 spacy
Last synced: 12 Oct 2024
https://github.com/pythainlp/spacy-pythainlp
PyThaiNLP For spaCy
nlp-library python spacy spacy-extensions
Last synced: 14 Oct 2024
https://github.com/eea/eea.corpus
Machine Learning and Natural Language Processing of the EEA Corpus via spaCy, Textacy and pyLDAvis and other useful NLP algorithms.
data-visualization eea-corpus environment latent-dirichlet-allocation lda-visualisation machine-learning natural-language-processing nlp spacy text-mining topic-modeling
Last synced: 24 Nov 2024
https://github.com/ghosthamlet/CHN
Hacker news on Console with auto classifer and recommender in reactjs style code
console-ui deeplearning hacker-news hackernews machinelearning reactjs sklearn spacy word2vec
Last synced: 18 Nov 2024
https://github.com/riccorl/ipa
NLP Preprocessing Pipeline Wrappers
lemmatization model natural-language-processing nlp part-of-speech-tagger pipeline preprocessing spacy stanza tagging token tokenizer wrapper
Last synced: 14 Oct 2024
https://github.com/oarriaga/luvina
High-level Natural Language Processing (NLP) for Python.
natural-language-processing nlp nltk python spacy
Last synced: 14 Oct 2024
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 12 Nov 2024
https://github.com/autonomio/signs
A suite of tools for text preparation, vectorization and processing for deep learning with Keras.
embeddings fasttext gensim glove keras spacy word2vec
Last synced: 14 Oct 2024
https://github.com/medspacy/sectionizer
A rule-based Python module for spitting documents into sections.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 11 Nov 2024
https://github.com/dcavar/spacy-json-nlp
spaCy wrapper for JSON-NLP.
json natural-language-processing nlp spacy
Last synced: 18 Oct 2024
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 09 Dec 2024
https://github.com/explosion/spacy-loggers
📟 Logging utilities for spaCy
logging machine-learning natural-language-processing nlp python spacy
Last synced: 25 Dec 2024
https://github.com/GatorEducator/GatorMiner
A visualized text mining and analysis tool for student markdown reflection documents based on Natural language processing in the Dept of CS at Allegheny College.
nlp spacy streamlit textmining
Last synced: 25 Dec 2024
https://github.com/chaitjo/knowledge-graphs
Building Knowledge Graphs from Unstructured Text
knowledge-graph networkx neuralcoref spacy unstructured-data wikipedia
Last synced: 25 Oct 2024
https://github.com/gatoreducator/gatorminer
A visualized text mining and analysis tool for student markdown reflection documents based on Natural language processing in the Dept of CS at Allegheny College.
nlp spacy streamlit textmining
Last synced: 12 Oct 2024
https://github.com/d-one/nlpeasy
Easy Peasy Language Squeezy
datascience elasticsearch kibana nlp spacy
Last synced: 14 Oct 2024
https://github.com/joshday/spacy.jl
Get up and running with Python's spaCy inside Julia
julia natural-language-processing python spacy
Last synced: 11 Oct 2024
https://github.com/fako/spacy_arguing_lexicon
A spaCy extension wrapping around the arguing lexicon by MPQA
argument-mining argumentation spacy spacy-extension
Last synced: 14 Oct 2024
https://github.com/sematext/activate
Examples for the Activate conference
activate entity nlp opennlp recognition sematext solr spacy tagger
Last synced: 11 Nov 2024
https://github.com/bikatr7/kudasai
Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies
auto-translation chatgpt deepl gemini japanese-english japanese-english-translation japanese-translation machine-learning machine-translation nlp-preprocessing python spacy text-processing translation
Last synced: 18 Oct 2024
https://github.com/chrislemke/nlp-text-classifier
NLP for classifying text. Using word Word2Vec word embedding and a neural net with bidirectional LSTM to categorize sentences provided by the user 🤔
colab-notebook jupyter-notebook matplotlib natural-language-processing neural-network nlp-machine-learning nltk pandas philosophy spacy tensorflow2 visualize-data word2vec
Last synced: 16 Nov 2024
https://github.com/worldbank/wb-nlp-tools
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
gensim langdetect nlp nltk pdf2text python spacy text-mining
Last synced: 10 Nov 2024
https://github.com/tc64/spacyss
Sentence Segmentation for Spacy
sentence-boundary-detection sentence-segmentation spacy spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/jdagdelen/mondigy
A small component for using Mongodb databases with Prodigy annotation applications.
annotations mongodb natural-language-processing prodigy spacy spacy-nlp
Last synced: 14 Oct 2024
https://github.com/explosion/thinc_gpu_ops
🔮 GPU kernels for Thinc
ai artificial-intelligence deep-learning machine-learning natural-language-processing nlp python spacy thinc
Last synced: 28 Sep 2024
https://github.com/martinomensio/it_vectors_wiki_spacy
Word embeddings for Italian language, spacy2 prebuilt model
embeddings glove italian model pretrained spacy spacy2 wordvectors
Last synced: 19 Oct 2024
https://github.com/andrewrosss/rake-spacy
Python implementation of the Rapid Automatic Keyword Extraction algorithm using spaCy
algorithm keyword-extraction ml nlp python rake rake-nltk spacy
Last synced: 14 Oct 2024
https://github.com/nikhiljsk/preprocess_nlp
A fast framework for pre-processing (Cleaning text, Reduction of vocabulary, Feature extraction and Vectorization). Implemented with parallel processing using custom number of processes.
cleaning-data feature-extraction glove natural-language-processing nlp parallel-processing preprocess python3 reduction spacy stages tfidf vectorization word2vec
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/keyword-spacy
Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.
Last synced: 14 Oct 2024
https://github.com/kernel-loophole/kg-graph
Knowledge graph from unstructured text
knowledge-graph ml nlp-machine-learning nltk pagerank search-algorithm spacy text text-mining
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/tap-2024-spacy-llms
This is the repository for my 2024 Tap Institute Course on spaCy with LLMs
Last synced: 14 Oct 2024
https://github.com/wjbmattingly/bagpipes-spacy
Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.
Last synced: 10 Oct 2024
https://github.com/davebulaval/spacy-language-detection
Fully customizable language detection for spaCy pipeline
language-detection nlp spacy spacy-extension
Last synced: 30 Sep 2024
https://github.com/fastent/fastent
custom models for named-entity recognition
data-annotation data-generation named-entities named-entity-recognition natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/shivam5992/spacy_nlp
:clipboard: introduction about implementation and usage of spacy - fast industrial strength natural language processing library
Last synced: 24 Dec 2024
https://github.com/cyclecycle/role-pattern-nlp
Build and match patterns for semantic role labelling / information extraction with SpaCy
nlp python semantic-role-labeling spacy
Last synced: 12 Oct 2024
https://github.com/centre-for-humanities-computing/conspiracies
A python package for discovering and examining conspiracies using NLP.
conspiracies conspiracy knowledge-graph nlp spacy
Last synced: 14 Oct 2024
https://github.com/thepanacealab/annotated_twitter_covid19_dataset
A Biomedically Oriented automatically annotated Twitter COVID-19 Dataset
annotated-datasets covid-19 covid19-data medacy medspa scispacy spacy
Last synced: 12 Nov 2024
https://github.com/cyclecycle/visualise-spacy-tree
Create dependency tree plots from SpaCy Doc objects
Last synced: 14 Oct 2024
https://github.com/cvcio/mediawatch
Empowering news organizations to fight disinformation
ai elas golang grpc kafka misinformation neo4j network-analysis nodejs python spacy transformers
Last synced: 09 Nov 2024
https://github.com/turbolent/spacy-thrift
spaCy as a service using Thrift
named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 14 Oct 2024
https://github.com/nineinchnick/displacy
Python port of https://github.com/explosion/displacy
css natural-language-processing nlp python spacy svg visualization
Last synced: 13 Oct 2024
https://github.com/machinelearningzh/zix_understandability-index
Get a pragmatic assessment how understandable a German text is.
cefr-prediction llms machine-learning natural-language-processing nlp nlp-dataset nlp-library python spacy textdescriptives understandability
Last synced: 14 Oct 2024
https://github.com/ljvmiranda921/spacy-span-analyzer
Simple tool to analyze spans in your dataset. Implementation of Papay et al's work (EMNLP 2020) on span performance prediction
machine-learning natural-language-processing nlp spacy
Last synced: 30 Sep 2024
https://github.com/plandes/nlparse
Natural language processing parsing and tool library
natural-language-processing nlp-machine-learning pypi-badge pypi-link spacy spacy-nlp
Last synced: 12 Oct 2024
https://github.com/turbolent/spacykit
Industrial-strength Natural Language Processing (NLP) with Swift
natural-language-processing nlp spacy swift
Last synced: 19 Oct 2024
https://github.com/johnfraney/django-ner-trainer
Tools for training spaCy Named Entity Recognition models in Django
django django-rest-framework named-entity-recognition natural-language-processing spacy
Last synced: 14 Oct 2024
https://github.com/dcavar/antisemitismdatathon2020
This is project material for the Antisemitism Datathon and Hackathon 2020 at Indiana University
antisemitism corpus-data flair hatespeech machine-learning nltk python pytorch social-media spacy tensorflow twitter
Last synced: 07 Nov 2024
https://github.com/opensemanticsearch/spacy-services.deb
Debian & Ubuntu package for REST microservices for spaCy natural language processing and machine learning framework for named entity recognition
api debian debian-packages named-entity-recognition natural-language-processing nlp-machine-learning python spacy spacy-nlp
Last synced: 11 Oct 2024
https://github.com/asyml/forte-wrappers
Forte wrapper of third-party toolkits.
allennlp casl deep-learning elasticsearch forte huggingface machine-learning nlp nlp-library nltk processors spacy stanza
Last synced: 11 Oct 2024
https://github.com/papachristoumarios/capbib
:book: Bibliography transformations made easier with NLP
Last synced: 11 Oct 2024
https://github.com/wjbmattingly/number-spacy
Number spaCy is a custom spaCy pipeline component that enhances the identification of number entities in text and fetches the parsed numeric values using spaCy's token extensions.
Last synced: 12 Oct 2024
https://github.com/f1uctus/ttc
✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].
conversation nlp nlp-apis nlp-library spacy spacy-extension spacy-nlp spacy-pipeline speaker-identification
Last synced: 15 Nov 2024
https://github.com/reyesgeorge/vigil
Vigil is an analysis dashboard created using the visualization framework Dash
dash dashboard growth-hacking knowledge-graph python scrapy spacy twitter-api
Last synced: 13 Nov 2024
https://github.com/ninadpatil09/nlp-notebooks
Explore NLP tasks with Python using NLTK, SpaCy & scikit-learn: Tokenization, Normalization, NER, POS tagging, Encoding, Word embedding.
natural-language-processing nlp nlp-machine-learning nltk python spacy
Last synced: 14 Oct 2024
https://github.com/bikatr7/kairyou
Quickly preprocesses Japanese text using NLP/NER from SpaCy for Japanese translation or other NLP tasks.
japanese ner nlp preprocess spacy
Last synced: 18 Oct 2024
https://github.com/explosion/spacy-legacy
🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Last synced: 07 Oct 2024
https://github.com/o19s/bad-libs
:memo: Automatically converts any book into a Mad-Libs style game of silliness using spaCy. Free Charles Dickens included!
Last synced: 20 Nov 2024
https://github.com/kazkozdev/novelgenerator
📚 NovelGenerator - AI-powered fiction book generator that uses Ollama's LLMs to create complete novels with coherent plot structures, developed characters and multiple writing styles.
ai-novels airesearch aiwriting fiction-generator localllm machinelearning nlp novel-writing ollama python spacy text-generation
Last synced: 21 Dec 2024
https://github.com/jash271/youglance-extension
A chrome extension that Simplies your youtube video viewing experience.Navigate directly to the part you're interested in by typing it in the Search bar and we'll handle the Rest.Search by important and frequent entities mentioned in the video and gauge an understanding of the overall Sentiment of the video
deep-learning fastapi javascript nlp oops-in-python regex spacy
Last synced: 24 Nov 2024
https://github.com/kanishk3813/intel_sentiment_analysis
Intel Review Analyzer is a powerful tool designed to help businesses understand customer sentiments through automated analysis of reviews. This project leverages state-of-the-art NLP techniques to classify reviews, highlight key sentiments, generate word clouds, and visualize trends over time.
axios bert-model cors deep-learning flask pandas python react spacy
Last synced: 14 Oct 2024
https://github.com/surajiyer/spacycake
Simple keyphrase extraction extensions and pipeline components for spaCy.
keyphrase-extraction natural-language-processing nlp spacy spacy-extension spacy-pipeline
Last synced: 10 Oct 2024
https://github.com/senisioi/rolegal
A Spacy Package for Romanian Legal Document Processing
floret legal-documents ner romanian-language spacy
Last synced: 12 Oct 2024
https://github.com/weihanchen/google-colab-python-learn
📚 Learn Google Colab、Python、ML、OpenAI、Whisper、spaCy、NLP、HuggingFace
colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper
Last synced: 11 Nov 2024
https://github.com/miladnouriezade/ktrain-biobert_ner
This repository contains data and BioBert based NER model monologg/biobert_v1.1_pubmed from community-uploaded Hugging Face models for detecting entities such as chemical and disease.
biobert biomedical bionlp disease fasttext huggingface ktrain name named-entity-recognition ner nlp python spacy
Last synced: 03 Dec 2024
https://github.com/eliask93/debertav3-for-aspect-based-sentiment-analysis
Application for training the pretrained transformer model DeBERTaV3 on an Aspect Based Sentiment Analysis task
aspect-based-sentiment-analysis deberta nlp simpletransformers spacy
Last synced: 13 Nov 2024
https://github.com/louisguitton/spacy-lancedb-linker
spaCy pipeline component for ANN Entity Linking using LanceDB
ann entity-linking lancedb spacy spacy-pipeline
Last synced: 12 Oct 2024
https://github.com/jtlicardo/spacy-ner
A demo app that extracts process tasks from text
named-entity-recognition spacy streamlit
Last synced: 15 Nov 2024
https://github.com/kasakee/spacy-nlp-node
A library that will expose the parse method of SpaCy to Node.js
natural-language-processing nlp node node-js nodejs spacy spacy-nlp spacy-nlp-node spacy-node
Last synced: 12 Oct 2024
https://github.com/cloudera/cml_amp_spacy_entity_extraction
A Jupyter notebook demonstrating entity extraction on headlines with SpaCy.
entity-extraction named-entity-recognition nlp spacy
Last synced: 07 Nov 2024
https://github.com/amrrs/intro_to_nlp_with_spacy
Introduction to NLP with Spacy - Bangpypers October Talk
Last synced: 15 Nov 2024
https://github.com/nickcrews/spacy-address
Parse oneline US addresses using a spaCy NER model trained on OSM data
address address-parsing osm osm-data spacy spacy-nlp usaddress
Last synced: 20 Oct 2024
https://github.com/anushadatta/natural-language-processing
📑 NLP applications with NLTK, spaCy and PyTorch.
natural-language-processing nltk pytorch spacy
Last synced: 11 Dec 2024
https://github.com/chanind/reddit-words
What have Spacy's sense2vec 2019 word vectors learned from Reddit?
sense2vec spacy spacy-nlp word2vec
Last synced: 05 Dec 2024
https://github.com/gaving/zorya
:grapes: Build NER graphs from YouTube transcripts
neo4j ner spacy youtube-transcripts
Last synced: 21 Dec 2024
https://github.com/bees4ever/seaqube
Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.
augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings
Last synced: 18 Oct 2024
https://github.com/jbahire/semantic-similarity
This project gives implemetations of semantic similarity using various text embeddings and you can easily compare results using API provided. Go ahead and build your own API for integration in your use case.
bert elmo machine-learning natural-language-processing semantic-similarity spacy word2vec
Last synced: 19 Dec 2024
https://github.com/sloev/sentimental-onix
sentiment analysis for spacy pipeline in python
onnx sentiment-analysis spacy spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/diyclassics/la_senter
Repository for training spaCy-compatible sentence segmenter for Latin
Last synced: 08 Dec 2024
https://github.com/farahibrar/programming-in-python
Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.
beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow
Last synced: 06 Dec 2024
https://github.com/redraw/sqlite-ner
sqlite tool to extract entities into a new table using spaCy
Last synced: 26 Nov 2024