Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-11-05 00:28:59 UTC
- JSON Representation
https://github.com/explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
ai artificial-intelligence cython data-science deep-learning entity-linking machine-learning named-entity-recognition natural-language-processing neural-network neural-networks nlp nlp-library python spacy text-classification tokenization
Last synced: 26 Oct 2024
https://github.com/explosion/spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
ai artificial-intelligence cython data-science deep-learning entity-linking machine-learning named-entity-recognition natural-language-processing neural-network neural-networks nlp nlp-library python spacy text-classification tokenization
Last synced: 28 Oct 2024
https://github.com/spacy-io/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
ai artificial-intelligence cython data-science deep-learning entity-linking machine-learning named-entity-recognition natural-language-processing neural-network neural-networks nlp nlp-library python spacy text-classification tokenization
Last synced: 22 Aug 2024
https://github.com/rasahq/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
bot bot-framework botkit bots chatbot chatbots chatbots-framework conversation-driven-development conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library mitie natural-language-processing nlp nlu rasa spacy wit
Last synced: 01 Nov 2024
https://github.com/RasaHQ/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
bot bot-framework botkit bots chatbot chatbots chatbots-framework conversation-driven-development conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library mitie natural-language-processing nlp nlu rasa spacy wit
Last synced: 25 Oct 2024
https://github.com/RasaHQ/rasa_nlu
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
bot bot-framework botkit bots chatbot chatbots chatbots-framework conversation-driven-development conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library mitie natural-language-processing nlp nlu rasa spacy wit
Last synced: 03 Aug 2024
https://github.com/huggingface/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
coreference coreference-resolution machine-learning neural-networks nlp python pytorch spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/explosion/thinc
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
ai artificial-intelligence deep-learning functional-programming jax machine-learning machine-learning-library mxnet natural-language-processing nlp python pytorch spacy tensorflow type-checking
Last synced: 28 Oct 2024
https://github.com/dipanjanS/practical-machine-learning-with-python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis
Last synced: 29 Oct 2024
https://github.com/dipanjans/practical-machine-learning-with-python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis
Last synced: 09 Oct 2024
https://github.com/explosion/spacy-course
👩🏫 Advanced NLP with spaCy: A free online course
binder course dependency-parsing gatsby gatsbyjs jupyter machine-learning named-entity-recognition natural-language-processing nlp online-course part-of-speech-tagging spacy word-vectors
Last synced: 07 Oct 2024
https://github.com/chartbeat-labs/textacy
NLP, before and after spaCy
natural-language-processing nlp python spacy
Last synced: 14 Oct 2024
https://github.com/keithrozario/Klayers
Python Packages as AWS Lambda Layers
aws-lambda lambda lambda-layer python spacy
Last synced: 02 Nov 2024
https://github.com/DerwenAI/pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
graph-algorithms machine-learning natural-language natural-language-processing nlp python spacy spacy-extension summarization textgraphs textrank
Last synced: 29 Oct 2024
https://github.com/derwenai/pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
graph-algorithms machine-learning natural-language natural-language-processing nlp python spacy spacy-extension summarization textgraphs textrank
Last synced: 29 Oct 2024
https://github.com/keithrozario/klayers
Python Packages as AWS Lambda Layers
aws-lambda lambda lambda-layer python spacy
Last synced: 14 Oct 2024
https://github.com/dipanjans/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization
Last synced: 10 Oct 2024
https://github.com/dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization
Last synced: 02 Aug 2024
https://github.com/explosion/sense2vec
🦆 Contextually-keyed word vectors
gensim gensim-word2vec machine-learning natural-language-processing nlp python sense2vec spacy word2vec
Last synced: 07 Oct 2024
https://github.com/explosion/spacy-models
💫 Models for the spaCy Natural Language Processing (NLP) library
machine-learning machine-learning-models models natural-language-processing nlp spacy spacy-models statistical-models
Last synced: 07 Oct 2024
https://github.com/allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
bioinformatics biomedical custom-pipes nlp scientific-documents spacy
Last synced: 14 Oct 2024
https://allenai.github.io/scispacy/
A full spaCy pipeline and models for scientific/biomedical documents.
bioinformatics biomedical custom-pipes nlp scientific-documents spacy
Last synced: 04 Aug 2024
https://github.com/code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
active-learning annotations artificial-intelligence data-centric-ai data-labeling data-science deep-learning human-in-the-loop labeling labeling-tool machine-learning natural-language-processing neural-search nlp python spacy supervised-learning text-annotation text-classification transformers
Last synced: 14 Oct 2024
https://github.com/DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 01 Aug 2024
https://github.com/dragoncomputer/dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 09 Oct 2024
https://github.com/explosion/spacy-transformers
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
bert google gpt-2 huggingface language-model machine-learning natural-language-processing natural-language-understanding nlp openai pytorch pytorch-model spacy spacy-extension spacy-pipeline transfer-learning xlnet
Last synced: 01 Nov 2024
https://github.com/explosion/projects
🪐 End-to-end NLP workflows from prototype to production
annotations datasets natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/explosion/spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
anthropic claude cohere dolly falcon gpt-3 gpt-4 large-language-models llama llm machine-learning named-entity-recognition natural-language-processing nlp openai prompt-engineering spacy text-classification
Last synced: 07 Oct 2024
https://github.com/NorskRegnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks
data-science distant-supervision natural-language-processing nlp-library nlp-machine-learning python spacy training-data weak-supervision
Last synced: 26 Oct 2024
https://github.com/norskregnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks
data-science distant-supervision natural-language-processing nlp-library nlp-machine-learning python spacy training-data weak-supervision
Last synced: 14 Oct 2024
https://github.com/explosion/spacy-streamlit
👑 spaCy building blocks and visualizers for Streamlit apps
dependency-parsing machine-learning named-entity-recognition natural-language-processing ner nlp part-of-speech-tagging spacy streamlit text-classification tokenization visualizer visualizers word-vectors
Last synced: 07 Oct 2024
https://github.com/openeventdata/mordecai
Full text geoparsing as a Python library
geocoding geonames geoparsing nlp spacy toponym-resolution
Last synced: 14 Oct 2024
https://github.com/explosion/spacy-stanza
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
corenlp data-science machine-learning natural-language-processing nlp spacy spacy-pipeline stanford-corenlp stanford-machine-learning stanford-nlp stanza
Last synced: 01 Nov 2024
https://github.com/nirantk/nlp_quickbook
NLP in Python with Deep Learning
ensemble language-processing natural-language natural-language-processing nlp practitioners spacy spacy-nlp spell-correction text-classification tutorial-code
Last synced: 14 Oct 2024
https://github.com/NirantK/NLP_Quickbook
NLP in Python with Deep Learning
ensemble language-processing natural-language natural-language-processing nlp practitioners spacy spacy-nlp spell-correction text-classification tutorial-code
Last synced: 02 Aug 2024
https://github.com/tecoholic/ner-annotator
Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
Last synced: 14 Oct 2024
https://github.com/medspacy/medspacy
Library for clinical NLP with spaCy.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 14 Oct 2024
https://github.com/microsoft/cookiecutter-spacy-fastapi
Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker
azure-search cognitive-search fastapi natural-language-processing spacy
Last synced: 07 Oct 2024
https://github.com/phantominsights/subreddit-analyzer
A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
matplotlib nlp pandas python3 seaborn spacy wordcloud
Last synced: 30 Oct 2024
https://github.com/phantominsights/mexican-government-report
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
geopandas matplotlib nlp numpy pandas seaborn spacy
Last synced: 30 Oct 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/explosion/wasabi
🍣 A lightweight console printing and formatting toolkit
console console-color formatting python python-2 python-3 spacy
Last synced: 01 Nov 2024
https://github.com/KristiyanVachev/Question-Generation
Generating multiple choice questions from text using Machine Learning.
ai cosine-similarity machine-learning naive-bayes nlp question-generation question-generator questions-and-answers quiz spacy spacy-nlp word-embeddings
Last synced: 02 Aug 2024
https://github.com/kristiyanvachev/question-generation
Generating multiple choice questions from text using Machine Learning.
ai cosine-similarity machine-learning naive-bayes nlp question-generation question-generator questions-and-answers quiz spacy spacy-nlp word-embeddings
Last synced: 30 Oct 2024
https://github.com/nlpatvcu/medacy
:hospital: Medical Text Mining and Information Extraction with spaCy
clinical-text-processing information-extraction machine-learning medical-natural-language-processing medical-text-mining metamap natural-language-processing spacy
Last synced: 14 Oct 2024
https://github.com/erre-quadro/spikex
SpikeX - SpaCy Pipes for Knowledge Extraction
abbreviations-detection acronym-recognition clustering entity-linking named-entity-recognition nlp noun-phrase-extract sentence-splitting spacy spacy-pipes verb-phrase-extract wikigraph wikipedia wikipedia-graph
Last synced: 14 Oct 2024
https://github.com/r1j1t/contextualspellcheck
✔️Contextual word checker for better suggestions
bert chatbot help-wanted natural-language-processing nlp oov preprocessing python python-spelling-corrector spacy spacy-extension spellcheck spellchecker spelling-correction spelling-corrections
Last synced: 14 Oct 2024
https://github.com/tomaarsen/spanmarkerner
SpanMarker for Named Entity Recognition
huggingface ner nlp spacy spacy-extension transformers
Last synced: 14 Oct 2024
https://github.com/msg-systems/holmes-extractor
Information extraction from English and German texts based on predicate logic
information-extraction machine-learning nlp ontology python semantics spacy spacy-extension
Last synced: 31 Oct 2024
https://github.com/xxyzz/worddumb
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
anki books calibre calibre-plugin dictionary ebook epub kindle language-learning python spacy wikidata wikipedia wiktionary
Last synced: 14 Oct 2024
https://github.com/ironman5366/W.I.L.L
A python written personal assistant
ai assistant json-api pa personal-assistant plugins python spacy telegram
Last synced: 07 Aug 2024
https://github.com/xxyzz/WordDumb
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
anki books calibre calibre-plugin dictionary ebook epub kindle language-learning python spacy wikidata wikipedia wiktionary
Last synced: 09 Oct 2024
https://github.com/ironman5366/w.i.l.l
A python written personal assistant
ai assistant json-api pa personal-assistant plugins python spacy telegram
Last synced: 31 Oct 2024
https://github.com/5hirish/adam_qas
ADAM - A Question Answering System. Inspired from IBM Watson
adam elasticsearch gensim natural-language-processing pandas python question-answering scikit-learn spacy spacy-extension wikipedia
Last synced: 25 Sep 2024
https://github.com/ibm/zshot
Zero and Few shot named entity & relationships recognition
ai deep-learning few-shot few-shot-learning machine-learning named-entity-recognition natural-language-processing natural-language-understanding ned ner nlp nlp-library pytorch relation-extraction relationship-extraction spacy transformer zero-shot zero-shot-learning
Last synced: 14 Oct 2024
https://github.com/explosion/displacy
:boom: displaCy.js: An open-source NLP visualiser for the modern web
css javascript natural-language-processing nlp spacy svg visualization
Last synced: 25 Sep 2024
https://github.com/PKSHATechnology-Research/camphr
Camphr - NLP libary for creating pipeline components
Last synced: 01 Aug 2024
https://github.com/pkshatechnology-research/camphr
Camphr - NLP libary for creating pipeline components
Last synced: 14 Oct 2024
https://github.com/hlasse/textdescriptives
A Python library for calculating a large variety of metrics from text
dependency-distance descriptive-statistics nlp python readability readability-scores spacy spacy-extension statistics syntactic-analysis
Last synced: 14 Oct 2024
https://github.com/HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
dependency-distance descriptive-statistics nlp python readability readability-scores spacy spacy-extension statistics syntactic-analysis
Last synced: 04 Aug 2024
https://github.com/explosion/spacy-notebooks
💫 Jupyter notebooks for spaCy examples and tutorials
jupyter jupyter-notebook spacy tutorials
Last synced: 25 Sep 2024
https://github.com/explosion/floret
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
fasttext fasttext-embeddings spacy subword-embeddings word-embeddings word-vectors
Last synced: 30 Sep 2024
https://github.com/phantominsights/summarizer
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
nlp praw python3 reddit-bot spacy web-scraper wordcloud
Last synced: 31 Oct 2024
https://github.com/jenojp/negspacy
spaCy pipeline object for negating concepts in text
negation negation-phrases negex nlp python spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/jgontrum/spacy-api-docker
spaCy REST API, wrapped in a Docker container.
docker microservice natural-language-processing parsing restful-api spacy
Last synced: 31 Oct 2024
https://github.com/bjascob/lemminflect
A python module for English lemmatization and inflection.
inflection lemmatization nlp nlp-machine-learning python spacy spacy-extensions
Last synced: 14 Oct 2024
https://github.com/quanteda/spacyr
R wrapper to spaCy NLP
extract-entities nlp r spacy speech-tagging
Last synced: 14 Oct 2024
https://github.com/argilla-io/spacy-wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Last synced: 14 Oct 2024
https://github.com/gandersen101/spaczz
Fuzzy matching and more functionality for spaCy.
ai artificial-intelligence data-science fuzzy-matching natural-language-processing nlp nlp-library python rapidfuzz regex spacy spacy-extension spacy-extensions
Last synced: 14 Oct 2024
https://github.com/davidberenstein1957/concise-concepts
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
few-shot-classifcation gensim hacktoberfest machine-learning natural-language-processing ner nlp spacy
Last synced: 01 Nov 2024
https://github.com/mpuig/spacy-lookup
Named Entity Recognition based on dictionaries
named-entity-recognition natural-language-processing ner nlp spacy spacy-extension spacy-pipeline
Last synced: 31 Oct 2024
https://github.com/explosion/spacy-services
💫 REST microservices for various spaCy-related tasks
falcon natural-language-processing nlp rest-api rest-microservice spacy
Last synced: 25 Sep 2024
https://github.com/digiteinfotech/kairon
Conversational AI Platform to build effective Proactive Digital Assistants using Visual LLM Chaining
bot bot-framework botkit bots chatbot chatbot-framework chatbots conversational-agents conversational-ai conversational-bots gpt-3-5-turbo llm machine-learning machine-learning-library natural-language-understanding nlp nlu rasa rasa-nlu spacy
Last synced: 14 Oct 2024
https://github.com/bjascob/amrlib
A python library that makes AMR parsing, generation and visualization simple.
abstract-meaning-representation amr amr-graphs amr-parser amr-parsing neural-network python pytorch spacy spacy-extension text-generation transformer
Last synced: 14 Oct 2024
https://github.com/mmxgn/spacy-clausie
Implementation of the ClausIE information extraction system for python+spacy
clausie information-extraction nlp problog python-spacy spacy
Last synced: 30 Sep 2024
https://github.com/jaidevd/numerizer
A Python module to convert natural language numerics into ints and floats.
information-extraction nlp regular-expressions spacy spacy-extension
Last synced: 14 Oct 2024
https://github.com/statsmaths/cleanNLP
R package providing annotators and a normalized data model for natural language processing
corenlp natural-language-processing r-package spacy
Last synced: 25 Oct 2024
https://github.com/statsmaths/cleannlp
R package providing annotators and a normalized data model for natural language processing
corenlp natural-language-processing r-package spacy
Last synced: 14 Oct 2024
https://github.com/janlukasschroeder/nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
cheat-sheet dependency-parsing introduction lemmatization lexnlp machine-learning named-entity-recognition nlp nltk pos-tagging python sentence-similarity spacy spacy-nlp spans starter-kit tokenization
Last synced: 14 Oct 2024
https://github.com/davidberenstein1957/classy-classification
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.
few-shot-classifcation hacktoberfest machine-learning natural-language-processing nlp nlu sentence-transformers spacy text-classification
Last synced: 14 Oct 2024
https://github.com/explosion/displacy-ent
:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web
css javascript named-entities natural-language-processing nlp spacy visualization
Last synced: 25 Sep 2024
https://github.com/explosion/jupyterlab-prodigy
🧬 A JupyterLab extension for annotating data with Prodigy
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science jupyter jupyterlab labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/ines/spacy-js
🎀 JavaScript API for spaCy with Python REST API
javascript natural-language-processing nlp python rest-api spacy
Last synced: 30 Oct 2024
https://github.com/d5555/tageditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 14 Oct 2024
https://github.com/d5555/TagEditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 03 Aug 2024
https://github.com/explosion/spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
emoji emoji-unicode emojis natural-language-processing nlp spacy spacy-extension spacy-pipeline
Last synced: 07 Oct 2024
https://github.com/maxent-ai/converse
Conversational text Analysis using various NLP techniques
callcenter-analysis conversational-ai emotion-recognition huggingface machine-learning nlp nlu pytorch scikit-learn sentiment-analysis spacy speech-to-text text text-mining topic-modeling transformers
Last synced: 01 Nov 2024
https://github.com/martinomensio/spacy-universal-sentence-encoder
Google USE (Universal Sentence Encoder) for spaCy
models nlp spacy tensorflow-hub use
Last synced: 30 Oct 2024
https://github.com/explosion/wheelwright
🎡 Automated build repo for Python wheels and source packages
azure-pipelines linux macos manylinux manylinux1 multibuild python spacy wheels windows
Last synced: 07 Oct 2024
https://github.com/Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
chunker constituency-parser dataset-loader dependency-parser embedding-vectors embeddings lemmatizer natural-language-processing ner nlptoolkit persian persian-nlp postagger spacy tokenizer
Last synced: 04 Aug 2024
https://github.com/microsoft/presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
deep-learning flair machine-learning named-entity-recognition natural-language-processing ner nlp pii privacy spacy transformers
Last synced: 07 Oct 2024
https://github.com/dadmatech/dadmatools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
chunker constituency-parser dataset-loader dependency-parser embedding-vectors embeddings lemmatizer natural-language-processing ner nlptoolkit persian persian-nlp postagger spacy tokenizer
Last synced: 14 Oct 2024
https://github.com/takelab/spacy-udpipe
spaCy + UDPipe
natural-language-processing nlp nlp-library python spacy udpipe universal-dependencies wrapper-library
Last synced: 31 Oct 2024
https://github.com/chambliss/multilingual_ner
Applying BERT to named entity recognition in English and Russian.
bert english-language named-entity-recognition nlp-machine-learning pytorch russian-language spacy
Last synced: 14 Oct 2024
https://github.com/huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing
dependency-parsing hungarian hunlp huspacy information-extraction lemmatization machine-learning morphological-analysis named-entity-recognition natural-language-processing ner nlp pos-tagger python spacy spacy-models spacy-pipeline text-mining universal-dependencies
Last synced: 30 Oct 2024
https://github.com/rominf/profanity-filter
A Python library for detecting and filtering profanity
english-profanity filter filter-profanity filtering language lib library profanity profanity-detection profanity-filter profanityfilter python python3 russian-profanity spacy spacy-extension spacy-nlp
Last synced: 25 Sep 2024
https://github.com/anasaito/skillner
A (smart) rule based NLP module to extract job skills from text
ner nlp python rule-based skillner skills spacy
Last synced: 30 Oct 2024
https://github.com/lucaterre/spacyfishing
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
entity-disambiguation entity-linking natural-language-processing nlp python3 spacy spacy-extension spacy-extensions wikidata
Last synced: 31 Oct 2024