Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/anasaito/skillner

A (smart) rule based NLP module to extract job skills from text

ner nlp python rule-based skillner skills spacy

Last synced: 22 Dec 2024

https://github.com/alisonmitchell/stock-prediction

Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.

beautifulsoup bert gensim huggingface keras-tensorflow machine-learning matplotlib mplfinance nlp nltk numpy pandas plotly python scikit-learn scipy seaborn spacy textblob yfinance

Last synced: 19 Dec 2024

https://github.com/lucaterre/spacyfishing

A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata

entity-disambiguation entity-linking natural-language-processing nlp python3 spacy spacy-extension spacy-extensions wikidata

Last synced: 24 Dec 2024

https://github.com/dipanjans/nlp_workshop_odsc_europe20

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.

deep-learning gensim jupyter-notebook machine-learning natural-language-processing nltk python pytorch scikit-learn spacy tensorflow transfer-learning transformers

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy

natural-language-processing nlp python spacy

Last synced: 25 Sep 2024

https://github.com/eellak/nlpbuddy

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

fasttext gensim natural-language-processing spacy text-analysis text-classification

Last synced: 14 Oct 2024

https://github.com/norskregnesentral/weak-supervision-for-ner

Framework to learn Named Entity Recognition models without labelled data using weak supervision.

domain-adaptation hidden-markov-models named-entity-recognition natural-language-processing nlp python spacy weak-supervision

Last synced: 25 Sep 2024

https://github.com/tiesdekok/python_nlp_tutorial

This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)

computational-linguistics natural-language-processing nlp nltk python research spacy text-mining textblob textual-analysis

Last synced: 14 Oct 2024

https://github.com/aphp/edsnlp

Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.

clinical-data-warehouse deep-learning fast french medical multi-task nlp pytorch rule-based spacy text-mining

Last synced: 21 Dec 2024

https://github.com/brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

bea-workshop feature-extraction handcrafted-features linguistic-features natural-language-processing python readability-scores reading-time spacy text-analysis word-difficulty

Last synced: 24 Dec 2024

https://github.com/kennethenevoldsen/asent

Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.

interpretability natural-language-processing nlp python3 sentiment-analysis spacy spacy-extensions

Last synced: 23 Dec 2024

https://github.com/koichiyasuoka/deplacy

CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis

dependency-visualizer nlp-cube spacy stanza

Last synced: 25 Dec 2024

https://github.com/martinomensio/spacy-sentence-bert

Sentence transformers models for SpaCy

bert models nlp sentence-bert sentence-transformers spacy

Last synced: 19 Dec 2024

https://github.com/kororo/excelcy

Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.

entity excel nlp python python3 spacy spacy-extensions spacy-nlp spacy-pipeline training xlsx

Last synced: 19 Dec 2024

https://github.com/davidberenstein1957/crosslingual-coreference

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

coreference coreference-resolution hacktoberfest natural-language-processing nlp python spacy

Last synced: 13 Dec 2024

https://github.com/baderlab/saber

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.

bioinformatics biomedical-named-entity-recognition biomedical-text-mining deep-learning information-extraction machine-learning spacy

Last synced: 14 Oct 2024

https://github.com/d99kris/spacy-cpp

C++ wrapper library for the NLP library spaCy

c-plus-plus linux nlp nlp-libraries spacy

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-lookups-data

📂 Additional lookup tables and data resources for spaCy

lemmatization machine-learning natural-language-processing nlp spacy

Last synced: 21 Dec 2024

https://github.com/eellak/gsoc2018-spacy

[GSOC] Greek language support for spacy.io python NLP software

greek greek-language gsoc-2018 lemmatization natural-language-processing python spacy

Last synced: 08 Nov 2024

https://github.com/explosion/talks

💥 Browser-based slides or PDFs of our talks and presentations

presentations slides spacy talks

Last synced: 25 Sep 2024

https://github.com/o19s/hello-nlp

A natural language search microservice

elasticsearch nlp solr spacy

Last synced: 20 Nov 2024

https://github.com/abhijit-2592/spacy-langdetect

A fully customisable language detection pipeline for spaCy

googletrans langdetect language-detection pycld2 spacy spacy-extension

Last synced: 19 Dec 2024

https://github.com/ub-mannheim/spacyopentapioca

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

entity-linking named-entity-linking spacy spacy-extensions spacy-pipeline wikidata

Last synced: 19 Dec 2024

https://github.com/bjascob/pyinflect

A python module for word inflections designed for use with spaCy.

inflection nlp python spacy spacy-extension

Last synced: 19 Dec 2024

https://github.com/explosion/thinc-apple-ops

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library

apple spacy thinc

Last synced: 20 Dec 2024

https://github.com/centre-for-humanities-computing/dacy

DaCy: The State of the Art Danish NLP pipeline using SpaCy

danish-language natural-language-processing reproducible-workflows spacy

Last synced: 24 Dec 2024

https://github.com/julesbelveze/concepcy

💫 SpaCy wrapper for ConceptNet 💫

conceptnet nlp spacy

Last synced: 14 Oct 2024

https://github.com/ines/spacy-graphql

🤹‍♀️ Query spaCy's linguistic annotations using GraphQL

flask flask-api graphql natural-language-processing nlp python spacy

Last synced: 10 Dec 2024

https://github.com/explosion/healthsea

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

healthcare machine-learning named-entity-recognition natural-language-processing pipeline spacy text-classification

Last synced: 07 Oct 2024

https://github.com/bramvanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.

conll conll-u data-science machine-learning natural-language-processing nlp pandas parser python spacy spacy-extension spacy-pipeline stanford-machine-learning stanford-nlp stanza udpipe

Last synced: 20 Dec 2024

https://github.com/sorenlind/lemmy

🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

danish lemma lemmatizer nlp spacy swedish

Last synced: 12 Oct 2024

https://github.com/rokbenko/ai-playground

Code from tutorials presented on the "Code AI with Rok" YouTube channel

fetchai gemini-api llama-index milvus openai-api spacy

Last synced: 12 Nov 2024

https://github.com/ahalterman/mordecai3

Full text geoparsing/toponym resolution with event geolocation

geolocation geoparsing spacy toponym-resolution

Last synced: 22 Dec 2024

https://github.com/ahmedbesbes/anonymization-api

How to build and deploy an anonymization API with FastAPI and SpaCy

aws docker docker-compose ec2 fastapi github-actions named-entity-recognition ner nlp python spacy

Last synced: 23 Nov 2024

https://github.com/opennyai/opennyai

Opennyai : An efficient NLP Pipeline for Indian Legal documents

indian-laws indian-legal-judgements legalnlp machine-learning natural-language-processing nlp python spacy

Last synced: 19 Dec 2024

https://github.com/deneutoy/spacy-vis

A visualisation tool for Spacy using Hierplane.

dependency-parsing nlp spacy visualization

Last synced: 01 Nov 2024

https://github.com/zaibacu/rita-dsl

A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format

dsl language natural-language-processing nlp parsing python regex rule-based spacy

Last synced: 12 Oct 2024

https://github.com/yohasebe/ruby-spacy

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

gpt natural-language nlp openai parsing ruby spacy word-embeddings

Last synced: 21 Dec 2024

https://github.com/liuzl/ling

Natural Language Processing Toolkit in Golang

corenlp lemmatization nlp normalization opencc spacy tokenization

Last synced: 12 Oct 2024

https://github.com/wjbmattingly/spacyex

SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.

nlp spacy

Last synced: 31 Oct 2024

https://github.com/samedwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

natural-language-processing nlp python spacy

Last synced: 21 Dec 2024

https://github.com/argilla-io/adept-augmentations

A Python library aimed at dissecting and augmenting NER training data.

dataset datasets few-shot-learning machine-learning natural-language-processing nlp spacy

Last synced: 18 Oct 2024

https://github.com/thomasthiebaud/spacy-fastlang

Language detection using Spacy and Fasttext

fasttext fasttext-python language-detection spacy spacy-extensions

Last synced: 19 Dec 2024

https://github.com/SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

natural-language-processing nlp python spacy

Last synced: 20 Nov 2024

https://github.com/explosion/spacy-ray

☄️ Parallel and distributed training with spaCy and Ray

distributed-computing machine-learning natural-language-processing parallel-training ray spacy training

Last synced: 07 Oct 2024

https://github.com/ahalterman/multiuser_prodigy

Running Prodigy for a team of annotators

prodigy spacy

Last synced: 08 Nov 2024

https://github.com/paulrinckens/timexy

A spaCy custom component that extracts and normalizes temporal expressions

date-parser datetime natural-language-processing nlp python spacy spacy-extension timeml timex3

Last synced: 14 Oct 2024

https://github.com/jenojp/extractacy

Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)

entity-extraction entity-linking ner nlp pattern-matching spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/kootenpv/spacy_api

Server/Client around Spacy to load spacy only once

api machine-learning nlp spacy

Last synced: 14 Oct 2024

https://github.com/kennethenevoldsen/spacy-wrap

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

deep-learning huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch spacy spacy-extension spacy-extensions spacy-models spacy-nlp spacy-pipeline spacy-transformers text-classification transformers

Last synced: 12 Oct 2024

https://github.com/explosion/ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts

datasets machine-learning machine-learning-datasets spacy testing thinc

Last synced: 07 Oct 2024

https://github.com/machinelearningzh/simply-simplify-language

Use machine learning to make your institutional communication more understandable and inclusive.

anthropic einfachesprache leichtesprache llm llms mistral mistralai natural-language-processing nlp openai plainlanguage python spacy streamlit

Last synced: 19 Dec 2024

https://github.com/explosion/spacy-huggingface-hub

🤗 Push your spaCy pipelines to the Hugging Face Hub

huggingface machine-learning ml-models models natural-language-processing nlp spacy

Last synced: 07 Oct 2024

https://github.com/explosion/assets

💥 Explosion Assets

machine-learning nlp spacy spacy-nlp

Last synced: 22 Dec 2024

https://github.com/tokestermw/spacy_grammar

:black_nib: Language Tool style grammar handling with spaCy 2.0

grammar nlp spacy spacy-nlp

Last synced: 07 Nov 2024

https://github.com/ecohealthalliance/epitator

EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and EIDR Connect.

disease-surveillance epidemiology geonames nlp spacy toponym-resolution

Last synced: 14 Oct 2024

https://github.com/aajanki/spacy-fi

Experimental Finnish language model for SpaCy

finnish-language-analysis spacy spacy-models

Last synced: 25 Dec 2024

https://github.com/liebeck/spacy-sentiws

German sentiment scores with SentiWS as extension for spaCy

nlp spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/adirthaborgohain/ner-re

A Named Entity Recognition + Entity Linker + Relation Extraction Pipeline built using spacy v3. Given a text, the pipeline will extract entities from the text as trained and will disambiguate the entities to its normalized form through an Entity Linker connected to a Knowledge Base and will assign a relation between the entities, if any.

named-entity-recognition nlp relation-extraction spacy transformers

Last synced: 09 Nov 2024

https://github.com/cyclecycle/spacy-pattern-builder

Reverse engineer patterns for use with SpaCy's DependencyMatcher

nlp python spacy

Last synced: 10 Oct 2024

https://github.com/writer/replacy

spaCy match and replace, maintaining conjugation

nlp spacy

Last synced: 01 Nov 2024

https://github.com/dadosabertosdefeira/tomba

Identifique endereços, bairros e outras localizações brasileiras em um texto 🏘

brasil hacktoberfest nlp spacy

Last synced: 12 Oct 2024

https://github.com/ucrel/pymusas

Python Multilingual Ucrel Semantic Analysis System

natural-language-processing nlp python spacy spacy-pipeline

Last synced: 24 Dec 2024

https://github.com/sarthakjshetty/pyresearchinsights

End-to-end NLP tool to analyze research publications. Published in Ecology & Evolution 2021.

gensim natural-language-processing nlp python scientific-analysis spacy text-mining

Last synced: 18 Dec 2024

https://github.com/fractalego/pynsett

A programmable relation extraction tool

extract-relationships nlp relation-extraction spacy wikidata-knowledge

Last synced: 12 Oct 2024

https://github.com/fredriko/bert-tensorflow-pytorch-spacy-conversion

Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.

bert bert-model how-to keras nlp pytorch-transformers spacy spacy-models spacy-nlp spacy-package spacy-pytorch-transformers tensorflow

Last synced: 27 Nov 2024

https://github.com/vasisouv/tweets-preprocessor

Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team

nltk preprocessing python spacy spacy-nlp twitter

Last synced: 15 Dec 2024

https://github.com/autonomio/autonomio

Core functionality for the Autonomio augmented intelligence workbench.

artificial-intelligence datascience deep-learning hyperscan keras lstm machine-learning mlp neural-network spacy tensorflow wrangling

Last synced: 06 Nov 2024

https://github.com/seannaren/cord-19-ann

ANN Search through the COVID CORD-19 Dataset using SBERT.

cord-19 covid-19 machine-learning pytorch scispacy spacy transformer

Last synced: 29 Oct 2024

https://github.com/tlack/hairytext

A data labeling and NLP tool for Elixir (uses Spacy)

elixir entity-recognition nlp nlp-machine-learning phoenix-live-view spacy text-classification

Last synced: 28 Oct 2024

https://github.com/ahmedbesbes/anonymizer

Text Anonymization app with Streamlit and Spacy

heroku nlp nlproc spacy st-annotated-text streamlit streamlit-component

Last synced: 23 Nov 2024

https://github.com/liebeck/spacy-iwnlp

German lemmatization with IWNLP as extension for spaCy

nlp spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/iclrandd/case2vec

A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated Council of Law Reporting for England & Wales (https://www.iclr.co.uk).

caselaw gensim-word2vec natural-language-processing sense2vec spacy word2vec

Last synced: 29 Oct 2024

https://github.com/doccano/spacy-partial-tagger

A simple library for training named entity recognition model from partially annotated data

named-entity-recognition natural-language-processing nlp spacy weak weak-supervision weakly-supervised-learning

Last synced: 10 Oct 2024

https://github.com/explosion/spacy-benchmarks

💫 Runtime performance comparison of spaCy against other NLP libraries

benchmarking benchmarks natural-language-processing nlp spacy

Last synced: 25 Sep 2024

https://github.com/bramvanroy/astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

alignment linguistics nlp parallel-corpus parsing spacy stanza translation

Last synced: 14 Oct 2024

https://github.com/hamishivi/lec2flash

Generating flashcards from lecture notes

flask neo4j python3 question-generator spacy

Last synced: 28 Nov 2024

spaCy Awesome Lists
spaCy Categories