Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-27 00:23:22 UTC
- JSON Representation
https://github.com/anasaito/skillner
A (smart) rule based NLP module to extract job skills from text
ner nlp python rule-based skillner skills spacy
Last synced: 22 Dec 2024
https://github.com/alisonmitchell/stock-prediction
Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.
beautifulsoup bert gensim huggingface keras-tensorflow machine-learning matplotlib mplfinance nlp nltk numpy pandas plotly python scikit-learn scipy seaborn spacy textblob yfinance
Last synced: 19 Dec 2024
https://github.com/rominf/profanity-filter
A Python library for detecting and filtering profanity
english-profanity filter filter-profanity filtering language lib library profanity profanity-detection profanity-filter profanityfilter python python3 russian-profanity spacy spacy-extension spacy-nlp
Last synced: 25 Sep 2024
https://github.com/lucaterre/spacyfishing
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
entity-disambiguation entity-linking natural-language-processing nlp python3 spacy spacy-extension spacy-extensions wikidata
Last synced: 24 Dec 2024
https://github.com/kennethenevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
augmentation natural-language-processing nlp nlproc python spacy spacy-extension spacy-nlp text-augmentation text-classification training-data
Last synced: 22 Dec 2024
https://github.com/richardpaulhudson/holmes-extractor
Information extraction from English and German texts based on predicate logic
information-extraction machine-learning natural-language-processing nlp ontology python semantics spacy spacy-extension text-classification
Last synced: 25 Dec 2024
https://github.com/thepanacealab/smmt
Social Media Mining Toolkit (SMMT) main repository
annotation data-acquisition data-annotation data-preprocessing gathering spacy tweets twitter-api
Last synced: 19 Dec 2024
https://github.com/dipanjans/nlp_workshop_odsc_europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.
deep-learning gensim jupyter-notebook machine-learning natural-language-processing nltk python pytorch scikit-learn spacy tensorflow transfer-learning transformers
Last synced: 14 Oct 2024
https://github.com/explosion/spacy-dev-resources
💫 Scripts, tools and resources for developing spaCy
natural-language-processing nlp python spacy
Last synced: 25 Sep 2024
https://github.com/eellak/nlpbuddy
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
fasttext gensim natural-language-processing spacy text-analysis text-classification
Last synced: 14 Oct 2024
https://github.com/norskregnesentral/weak-supervision-for-ner
Framework to learn Named Entity Recognition models without labelled data using weak supervision.
domain-adaptation hidden-markov-models named-entity-recognition natural-language-processing nlp python spacy weak-supervision
Last synced: 25 Sep 2024
https://github.com/brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
discourse feature-extraction flesch-kincaid lexical-analysis linguistic-analysis natural-language-processing nlp readability-metrics readability-scores semantic-analysis spacy syntactic-analysis text-classification text-simplification
Last synced: 14 Oct 2024
https://github.com/tiesdekok/python_nlp_tutorial
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
computational-linguistics natural-language-processing nlp nltk python research spacy text-mining textblob textual-analysis
Last synced: 14 Oct 2024
https://github.com/aphp/edsnlp
Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.
clinical-data-warehouse deep-learning fast french medical multi-task nlp pytorch rule-based spacy text-mining
Last synced: 21 Dec 2024
https://github.com/brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
bea-workshop feature-extraction handcrafted-features linguistic-features natural-language-processing python readability-scores reading-time spacy text-analysis word-difficulty
Last synced: 24 Dec 2024
https://github.com/kennethenevoldsen/asent
Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.
interpretability natural-language-processing nlp python3 sentiment-analysis spacy spacy-extensions
Last synced: 23 Dec 2024
https://github.com/koichiyasuoka/deplacy
CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis
dependency-visualizer nlp-cube spacy stanza
Last synced: 25 Dec 2024
https://github.com/martinomensio/spacy-sentence-bert
Sentence transformers models for SpaCy
bert models nlp sentence-bert sentence-transformers spacy
Last synced: 19 Dec 2024
https://github.com/martinomensio/spacy-dbpedia-spotlight
A spaCy wrapper for DBpedia Spotlight
dbpedia-spotlight hacktoberfest natural-language-processing nlp spacy
Last synced: 26 Dec 2024
https://github.com/kororo/excelcy
Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.
entity excel nlp python python3 spacy spacy-extensions spacy-nlp spacy-pipeline training xlsx
Last synced: 19 Dec 2024
https://github.com/davidberenstein1957/crosslingual-coreference
A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.
coreference coreference-resolution hacktoberfest natural-language-processing nlp python spacy
Last synced: 13 Dec 2024
https://github.com/baderlab/saber
Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.
bioinformatics biomedical-named-entity-recognition biomedical-text-mining deep-learning information-extraction machine-learning spacy
Last synced: 14 Oct 2024
https://github.com/yash1994/dframcy
Dataframe Integration with spaCy.
dataframe pandas-dataframe python3 spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/d99kris/spacy-cpp
C++ wrapper library for the NLP library spaCy
c-plus-plus linux nlp nlp-libraries spacy
Last synced: 14 Oct 2024
https://github.com/explosion/spacy-lookups-data
📂 Additional lookup tables and data resources for spaCy
lemmatization machine-learning natural-language-processing nlp spacy
Last synced: 21 Dec 2024
https://github.com/agile-ts/agile
🌌 Global State and Logic Library for JavaScript/Typescript applications
agiel-ts agile api functional-reactive-programming global-state javascript modular react-global-state react-native react-state-management reactive reactjs redux-alternative replace-redux simple spacy state state-management typescript vue
Last synced: 01 Nov 2024
https://github.com/explosion/spacy-experimental
🧪 Cutting-edge experimental spaCy components and features
lemmatizer machine-learning natural-language-processing nlp spacy spacy-extension spacy-pipeline tokenizer
Last synced: 23 Dec 2024
https://github.com/eellak/gsoc2018-spacy
[GSOC] Greek language support for spacy.io python NLP software
greek greek-language gsoc-2018 lemmatization natural-language-processing python spacy
Last synced: 08 Nov 2024
https://github.com/explosion/talks
💥 Browser-based slides or PDFs of our talks and presentations
presentations slides spacy talks
Last synced: 25 Sep 2024
https://github.com/tokestermw/spacy_hunspell
:pencil2: Hunspell extension for spaCy 2.0.
hunspell hunspell-extension nlp spacy spacy-extension spell-check spellchecker spelling spelling-correction
Last synced: 21 Dec 2024
https://github.com/mratsim/arch-data-science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
archlinux cuda cudnn data-science deep-learning lightgbm machine-learning mkl mxnet natural-language-processing natural-language-understanding nervana opencv package pandas pytorch scikit-learn spacy tensorflow xgboost
Last synced: 09 Nov 2024
https://github.com/abhijit-2592/spacy-langdetect
A fully customisable language detection pipeline for spaCy
googletrans langdetect language-detection pycld2 spacy spacy-extension
Last synced: 27 Dec 2024
https://github.com/mratsim/Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
archlinux cuda cudnn data-science deep-learning lightgbm machine-learning mkl mxnet natural-language-processing natural-language-understanding nervana opencv package pandas pytorch scikit-learn spacy tensorflow xgboost
Last synced: 27 Nov 2024
https://github.com/ub-mannheim/spacyopentapioca
A spaCy wrapper of OpenTapioca for named entity linking on Wikidata
entity-linking named-entity-linking spacy spacy-extensions spacy-pipeline wikidata
Last synced: 19 Dec 2024
https://github.com/bjascob/pyinflect
A python module for word inflections designed for use with spaCy.
inflection nlp python spacy spacy-extension
Last synced: 19 Dec 2024
https://github.com/explosion/thinc-apple-ops
🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library
Last synced: 20 Dec 2024
https://github.com/centre-for-humanities-computing/dacy
DaCy: The State of the Art Danish NLP pipeline using SpaCy
danish-language natural-language-processing reproducible-workflows spacy
Last synced: 24 Dec 2024
https://github.com/ELS-RD/anonymisation
Anonymization of legal cases (Fr) based on Flair embeddings
anonymization bert camembert dataset-augmentation entities flair legal legal-cases ner pseudo-anonymization spacy transformers
Last synced: 19 Nov 2024
https://github.com/explosion/healthsea
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
healthcare machine-learning named-entity-recognition natural-language-processing pipeline spacy text-classification
Last synced: 07 Oct 2024
https://github.com/ines/spacy-graphql
🤹♀️ Query spaCy's linguistic annotations using GraphQL
flask flask-api graphql natural-language-processing nlp python spacy
Last synced: 10 Dec 2024
https://github.com/bramvanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
conll conll-u data-science machine-learning natural-language-processing nlp pandas parser python spacy spacy-extension spacy-pipeline stanford-machine-learning stanford-nlp stanza udpipe
Last synced: 20 Dec 2024
https://github.com/sorenlind/lemmy
🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪
danish lemma lemmatizer nlp spacy swedish
Last synced: 12 Oct 2024
https://github.com/rokbenko/ai-playground
Code from tutorials presented on the "Code AI with Rok" YouTube channel
fetchai gemini-api llama-index milvus openai-api spacy
Last synced: 12 Nov 2024
https://github.com/ahalterman/mordecai3
Full text geoparsing/toponym resolution with event geolocation
geolocation geoparsing spacy toponym-resolution
Last synced: 22 Dec 2024
https://github.com/ahmedbesbes/anonymization-api
How to build and deploy an anonymization API with FastAPI and SpaCy
aws docker docker-compose ec2 fastapi github-actions named-entity-recognition ner nlp python spacy
Last synced: 23 Nov 2024
https://github.com/cuinjune/text2video
A software tool that converts text to video for more engaging learning experience
aws-polly ffmpeg-server flask learning nlp-keywords-extraction pixabay-api polly-tts spacy text text-to-image text-to-speech text-to-video text-tools text2video video
Last synced: 25 Nov 2024
https://github.com/opennyai/opennyai
Opennyai : An efficient NLP Pipeline for Indian Legal documents
indian-laws indian-legal-judgements legalnlp machine-learning natural-language-processing nlp python spacy
Last synced: 19 Dec 2024
https://github.com/deneutoy/spacy-vis
A visualisation tool for Spacy using Hierplane.
dependency-parsing nlp spacy visualization
Last synced: 01 Nov 2024
https://github.com/sammous/spacy-lefff
Custom French POS and lemmatizer based on Lefff for spacy
dataesr eig-2018 entrepreneur-interet-general french french-pos lemmatizer nlp pos-tagging python spacy spacy-extensions
Last synced: 19 Dec 2024
https://github.com/zaibacu/rita-dsl
A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format
dsl language natural-language-processing nlp parsing python regex rule-based spacy
Last synced: 12 Oct 2024
https://github.com/yohasebe/ruby-spacy
A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall
gpt natural-language nlp openai parsing ruby spacy word-embeddings
Last synced: 21 Dec 2024
https://github.com/liuzl/ling
Natural Language Processing Toolkit in Golang
corenlp lemmatization nlp normalization opencc spacy tokenization
Last synced: 12 Oct 2024
https://github.com/wjbmattingly/spacyex
SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.
Last synced: 31 Oct 2024
https://github.com/samedwardes/spacytextblob
A TextBlob sentiment analysis pipeline component for spaCy.
natural-language-processing nlp python spacy
Last synced: 21 Dec 2024
https://github.com/argilla-io/adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
dataset datasets few-shot-learning machine-learning natural-language-processing nlp spacy
Last synced: 18 Oct 2024
https://github.com/thomasthiebaud/spacy-fastlang
Language detection using Spacy and Fasttext
fasttext fasttext-python language-detection spacy spacy-extensions
Last synced: 19 Dec 2024
https://github.com/SamEdwardes/spacytextblob
A TextBlob sentiment analysis pipeline component for spaCy.
natural-language-processing nlp python spacy
Last synced: 20 Nov 2024
https://github.com/explosion/spacy-ray
☄️ Parallel and distributed training with spaCy and Ray
distributed-computing machine-learning natural-language-processing parallel-training ray spacy training
Last synced: 07 Oct 2024
https://github.com/ahalterman/multiuser_prodigy
Running Prodigy for a team of annotators
Last synced: 08 Nov 2024
https://github.com/paulrinckens/timexy
A spaCy custom component that extracts and normalizes temporal expressions
date-parser datetime natural-language-processing nlp python spacy spacy-extension timeml timex3
Last synced: 14 Oct 2024
https://github.com/jenojp/extractacy
Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)
entity-extraction entity-linking ner nlp pattern-matching spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/d5555/neuralgym
🚀GUI for training spaCy models
neural-networks nlp-machine-learning spacy spacy-gui spacy-models training-neural-net
Last synced: 09 Nov 2024
https://github.com/ljvmiranda921/calamancy
NLP pipelines for Tagalog using spaCy
computational-linguistics low-resource-languages low-resource-nlp machine-learning natural-language-processing ner nlp spacy
Last synced: 24 Dec 2024
https://github.com/kootenpv/spacy_api
Server/Client around Spacy to load spacy only once
api machine-learning nlp spacy
Last synced: 14 Oct 2024
https://github.com/kennethenevoldsen/spacy-wrap
spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.
deep-learning huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch spacy spacy-extension spacy-extensions spacy-models spacy-nlp spacy-pipeline spacy-transformers text-classification transformers
Last synced: 12 Oct 2024
https://github.com/explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
datasets machine-learning machine-learning-datasets spacy testing thinc
Last synced: 07 Oct 2024
https://github.com/machinelearningzh/simply-simplify-language
Use machine learning to make your institutional communication more understandable and inclusive.
anthropic einfachesprache leichtesprache llm llms mistral mistralai natural-language-processing nlp openai plainlanguage python spacy streamlit
Last synced: 19 Dec 2024
https://github.com/explosion/assets
💥 Explosion Assets
machine-learning nlp spacy spacy-nlp
Last synced: 22 Dec 2024
https://github.com/explosion/spacy-huggingface-hub
🤗 Push your spaCy pipelines to the Hugging Face Hub
huggingface machine-learning ml-models models natural-language-processing nlp spacy
Last synced: 07 Oct 2024
https://github.com/tokestermw/spacy_grammar
:black_nib: Language Tool style grammar handling with spaCy 2.0
Last synced: 07 Nov 2024
https://github.com/ecohealthalliance/epitator
EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and EIDR Connect.
disease-surveillance epidemiology geonames nlp spacy toponym-resolution
Last synced: 14 Oct 2024
https://github.com/aajanki/spacy-fi
Experimental Finnish language model for SpaCy
finnish-language-analysis spacy spacy-models
Last synced: 25 Dec 2024
https://github.com/tlkh/m1-cpu-benchmarks
accelerate benchmark cpu m1 m1-max numpy pandas python spacy
Last synced: 07 Nov 2024
https://github.com/liebeck/spacy-sentiws
German sentiment scores with SentiWS as extension for spaCy
nlp spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/adirthaborgohain/ner-re
A Named Entity Recognition + Entity Linker + Relation Extraction Pipeline built using spacy v3. Given a text, the pipeline will extract entities from the text as trained and will disambiguate the entities to its normalized form through an Entity Linker connected to a Knowledge Base and will assign a relation between the entities, if any.
named-entity-recognition nlp relation-extraction spacy transformers
Last synced: 09 Nov 2024
https://github.com/cyclecycle/spacy-pattern-builder
Reverse engineer patterns for use with SpaCy's DependencyMatcher
Last synced: 10 Oct 2024
https://github.com/writer/replacy
spaCy match and replace, maintaining conjugation
Last synced: 01 Nov 2024
https://github.com/ucrel/pymusas
Python Multilingual Ucrel Semantic Analysis System
natural-language-processing nlp python spacy spacy-pipeline
Last synced: 24 Dec 2024
https://github.com/talmago/spacy_ke
Keyword extraction with spaCy
keyword-extraction keyword-extractor positionrank spacy spacy-extension spacy-nlp spacy-pipeline textrank topicrank yake
Last synced: 14 Oct 2024
https://github.com/dadosabertosdefeira/tomba
Identifique endereços, bairros e outras localizações brasileiras em um texto 🏘
brasil hacktoberfest nlp spacy
Last synced: 12 Oct 2024
https://github.com/explosion/vscode-prodigy
🧬 A VS Code extension for annotating data with Prodigy
annotation-tool data-annotation data-labeling data-labeling-tools data-science labeling-tool nlp prodigy spacy vscode vscode-extension
Last synced: 07 Oct 2024
https://github.com/sarthakjshetty/pyresearchinsights
End-to-end NLP tool to analyze research publications. Published in Ecology & Evolution 2021.
gensim natural-language-processing nlp python scientific-analysis spacy text-mining
Last synced: 26 Dec 2024
https://github.com/dpalmasan/trunajod2.0
An easy-to-use library to extract indices from texts.
coherence cohesion entity-graph lexical-diversity natural-language-processing readability-metrics semantic-measurements spacy spacy-extensions text-analysis text-mining text-processing ttr type-token-ratio
Last synced: 12 Oct 2024
https://github.com/fractalego/pynsett
A programmable relation extraction tool
extract-relationships nlp relation-extraction spacy wikidata-knowledge
Last synced: 12 Oct 2024
https://github.com/fredriko/bert-tensorflow-pytorch-spacy-conversion
Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.
bert bert-model how-to keras nlp pytorch-transformers spacy spacy-models spacy-nlp spacy-package spacy-pytorch-transformers tensorflow
Last synced: 27 Nov 2024
https://github.com/autonomio/autonomio
Core functionality for the Autonomio augmented intelligence workbench.
artificial-intelligence datascience deep-learning hyperscan keras lstm machine-learning mlp neural-network spacy tensorflow wrangling
Last synced: 06 Nov 2024
https://github.com/vasisouv/tweets-preprocessor
Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
nltk preprocessing python spacy spacy-nlp twitter
Last synced: 15 Dec 2024
https://github.com/seannaren/cord-19-ann
ANN Search through the COVID CORD-19 Dataset using SBERT.
cord-19 covid-19 machine-learning pytorch scispacy spacy transformer
Last synced: 29 Oct 2024
https://github.com/tlack/hairytext
A data labeling and NLP tool for Elixir (uses Spacy)
elixir entity-recognition nlp nlp-machine-learning phoenix-live-view spacy text-classification
Last synced: 28 Oct 2024
https://github.com/ahmedbesbes/anonymizer
Text Anonymization app with Streamlit and Spacy
heroku nlp nlproc spacy st-annotated-text streamlit streamlit-component
Last synced: 23 Nov 2024
https://github.com/liebeck/spacy-iwnlp
German lemmatization with IWNLP as extension for spaCy
nlp spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/iclrandd/case2vec
A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated Council of Law Reporting for England & Wales (https://www.iclr.co.uk).
caselaw gensim-word2vec natural-language-processing sense2vec spacy word2vec
Last synced: 29 Oct 2024
https://github.com/ryandsilva/medical-ner
:hospital: Clinical NER with UMLS lookup :hospital:
flask flutter medical-natural-language-processing optical-character-recognition python scispacy spacy umls
Last synced: 08 Nov 2024
https://github.com/doccano/spacy-partial-tagger
A simple library for training named entity recognition model from partially annotated data
named-entity-recognition natural-language-processing nlp spacy weak weak-supervision weakly-supervised-learning
Last synced: 10 Oct 2024
https://github.com/explosion/spacy-benchmarks
💫 Runtime performance comparison of spaCy against other NLP libraries
benchmarking benchmarks natural-language-processing nlp spacy
Last synced: 25 Sep 2024
https://github.com/oroszgy/hungarian-text-mining-workshop
Materials for the Text Mining workshop held in the HuNLP meetup, June 2017
classification hungarian information-extraction keyword-extraction machine-learning meetup natural-language-processing nlp python scikit-learn sentiment-analysis spacy spacy-models text-mining text-mining-workshop textacy tutorial workshop
Last synced: 08 Nov 2024
https://github.com/bramvanroy/astred
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
alignment linguistics nlp parallel-corpus parsing spacy stanza translation
Last synced: 14 Oct 2024
https://github.com/hamishivi/lec2flash
Generating flashcards from lecture notes
flask neo4j python3 question-generator spacy
Last synced: 28 Nov 2024