Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and β€œunderstand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/alisonmitchell/stock-prediction

Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.

beautifulsoup bert gensim huggingface keras-tensorflow machine-learning matplotlib mplfinance nlp nltk numpy pandas plotly python scikit-learn scipy seaborn spacy textblob yfinance

Last synced: 14 Oct 2024

https://github.com/dipanjans/nlp_workshop_odsc_europe20

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.

deep-learning gensim jupyter-notebook machine-learning natural-language-processing nltk python pytorch scikit-learn spacy tensorflow transfer-learning transformers

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-dev-resources

πŸ’« Scripts, tools and resources for developing spaCy

natural-language-processing nlp python spacy

Last synced: 25 Sep 2024

https://github.com/eellak/nlpbuddy

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

fasttext gensim natural-language-processing spacy text-analysis text-classification

Last synced: 14 Oct 2024

https://github.com/norskregnesentral/weak-supervision-for-ner

Framework to learn Named Entity Recognition models without labelled data using weak supervision.

domain-adaptation hidden-markov-models named-entity-recognition natural-language-processing nlp python spacy weak-supervision

Last synced: 25 Sep 2024

https://github.com/tiesdekok/python_nlp_tutorial

This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)

computational-linguistics natural-language-processing nlp nltk python research spacy text-mining textblob textual-analysis

Last synced: 14 Oct 2024

https://github.com/kennethenevoldsen/asent

Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.

interpretability natural-language-processing nlp python3 sentiment-analysis spacy spacy-extensions

Last synced: 31 Oct 2024

https://github.com/brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

bea-workshop feature-extraction handcrafted-features linguistic-features natural-language-processing python readability-scores reading-time spacy text-analysis word-difficulty

Last synced: 31 Oct 2024

https://github.com/aphp/edsnlp

Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.

clinical-data-warehouse deep-learning fast french medical multi-task nlp pytorch rule-based spacy text-mining

Last synced: 14 Oct 2024

https://github.com/koichiyasuoka/deplacy

CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis

dependency-visualizer nlp-cube spacy stanza

Last synced: 14 Oct 2024

https://github.com/kororo/excelcy

Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.

entity excel nlp python python3 spacy spacy-extensions spacy-nlp spacy-pipeline training xlsx

Last synced: 14 Oct 2024

https://github.com/martinomensio/spacy-sentence-bert

Sentence transformers models for SpaCy

bert models nlp sentence-bert sentence-transformers spacy

Last synced: 14 Oct 2024

https://github.com/davidberenstein1957/crosslingual-coreference

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

coreference coreference-resolution hacktoberfest natural-language-processing nlp python spacy

Last synced: 01 Nov 2024

https://github.com/baderlab/saber

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.

bioinformatics biomedical-named-entity-recognition biomedical-text-mining deep-learning information-extraction machine-learning spacy

Last synced: 14 Oct 2024

https://github.com/d99kris/spacy-cpp

C++ wrapper library for the NLP library spaCy

c-plus-plus linux nlp nlp-libraries spacy

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-experimental

πŸ§ͺ Cutting-edge experimental spaCy components and features

lemmatizer machine-learning natural-language-processing nlp spacy spacy-extension spacy-pipeline tokenizer

Last synced: 07 Oct 2024

https://github.com/explosion/talks

πŸ’₯ Browser-based slides or PDFs of our talks and presentations

presentations slides spacy talks

Last synced: 25 Sep 2024

https://github.com/abhijit-2592/spacy-langdetect

A fully customisable language detection pipeline for spaCy

googletrans langdetect language-detection pycld2 spacy spacy-extension

Last synced: 14 Oct 2024

https://github.com/explosion/spacy-lookups-data

πŸ“‚ Additional lookup tables and data resources for spaCy

lemmatization machine-learning natural-language-processing nlp spacy

Last synced: 07 Oct 2024

https://github.com/explosion/thinc-apple-ops

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library

apple spacy thinc

Last synced: 07 Oct 2024

https://github.com/ub-mannheim/spacyopentapioca

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

entity-linking named-entity-linking spacy spacy-extensions spacy-pipeline wikidata

Last synced: 14 Oct 2024

https://github.com/bjascob/pyinflect

A python module for word inflections designed for use with spaCy.

inflection nlp python spacy spacy-extension

Last synced: 14 Oct 2024

https://github.com/centre-for-humanities-computing/dacy

DaCy: The State of the Art Danish NLP pipeline using SpaCy

danish-language natural-language-processing reproducible-workflows spacy

Last synced: 31 Oct 2024

https://github.com/julesbelveze/concepcy

πŸ’« SpaCy wrapper for ConceptNet πŸ’«

conceptnet nlp spacy

Last synced: 14 Oct 2024

https://github.com/explosion/healthsea

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

healthcare machine-learning named-entity-recognition natural-language-processing pipeline spacy text-classification

Last synced: 07 Oct 2024

https://github.com/ines/spacy-graphql

πŸ€Ήβ€β™€οΈ Query spaCy's linguistic annotations using GraphQL

flask flask-api graphql natural-language-processing nlp python spacy

Last synced: 19 Oct 2024

https://github.com/sorenlind/lemmy

🀘Lemmy is a lemmatizer for Danish πŸ‡©πŸ‡° and Swedish πŸ‡ΈπŸ‡ͺ

danish lemma lemmatizer nlp spacy swedish

Last synced: 12 Oct 2024

https://github.com/bramvanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.

conll conll-u data-science machine-learning natural-language-processing nlp pandas parser python spacy spacy-extension spacy-pipeline stanford-machine-learning stanford-nlp stanza udpipe

Last synced: 30 Oct 2024

https://github.com/ahalterman/mordecai3

Full text geoparsing/toponym resolution with event geolocation

geolocation geoparsing spacy toponym-resolution

Last synced: 12 Oct 2024

https://github.com/opennyai/opennyai

Opennyai : An efficient NLP Pipeline for Indian Legal documents

indian-laws indian-legal-judgements legalnlp machine-learning natural-language-processing nlp python spacy

Last synced: 30 Oct 2024

https://github.com/zaibacu/rita-dsl

A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format

dsl language natural-language-processing nlp parsing python regex rule-based spacy

Last synced: 12 Oct 2024

https://github.com/deneutoy/spacy-vis

A visualisation tool for Spacy using Hierplane.

dependency-parsing nlp spacy visualization

Last synced: 01 Nov 2024

https://github.com/liuzl/ling

Natural Language Processing Toolkit in Golang

corenlp lemmatization nlp normalization opencc spacy tokenization

Last synced: 12 Oct 2024

https://github.com/yohasebe/ruby-spacy

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

gpt natural-language nlp openai parsing ruby spacy word-embeddings

Last synced: 14 Oct 2024

https://github.com/wjbmattingly/spacyex

SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.

nlp spacy

Last synced: 31 Oct 2024

https://github.com/argilla-io/adept-augmentations

A Python library aimed at dissecting and augmenting NER training data.

dataset datasets few-shot-learning machine-learning natural-language-processing nlp spacy

Last synced: 18 Oct 2024

https://github.com/SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

natural-language-processing nlp python spacy

Last synced: 04 Aug 2024

https://github.com/explosion/spacy-ray

β˜„οΈ Parallel and distributed training with spaCy and Ray

distributed-computing machine-learning natural-language-processing parallel-training ray spacy training

Last synced: 07 Oct 2024

https://github.com/samedwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

natural-language-processing nlp python spacy

Last synced: 14 Oct 2024

https://github.com/paulrinckens/timexy

A spaCy custom component that extracts and normalizes temporal expressions

date-parser datetime natural-language-processing nlp python spacy spacy-extension timeml timex3

Last synced: 14 Oct 2024

https://github.com/thomasthiebaud/spacy-fastlang

Language detection using Spacy and Fasttext

fasttext fasttext-python language-detection spacy spacy-extensions

Last synced: 12 Oct 2024

https://github.com/ahalterman/multiuser_prodigy

Running Prodigy for a team of annotators

prodigy spacy

Last synced: 19 Oct 2024

https://github.com/jenojp/extractacy

Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)

entity-extraction entity-linking ner nlp pattern-matching spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/kennethenevoldsen/spacy-wrap

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

deep-learning huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch spacy spacy-extension spacy-extensions spacy-models spacy-nlp spacy-pipeline spacy-transformers text-classification transformers

Last synced: 12 Oct 2024

https://github.com/kootenpv/spacy_api

Server/Client around Spacy to load spacy only once

api machine-learning nlp spacy

Last synced: 14 Oct 2024

https://github.com/explosion/ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts

datasets machine-learning machine-learning-datasets spacy testing thinc

Last synced: 07 Oct 2024

https://github.com/explosion/assets

πŸ’₯ Explosion Assets

machine-learning nlp spacy spacy-nlp

Last synced: 07 Oct 2024

https://github.com/explosion/spacy-huggingface-hub

πŸ€— Push your spaCy pipelines to the Hugging Face Hub

huggingface machine-learning ml-models models natural-language-processing nlp spacy

Last synced: 07 Oct 2024

https://github.com/ecohealthalliance/epitator

EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and EIDR Connect.

disease-surveillance epidemiology geonames nlp spacy toponym-resolution

Last synced: 14 Oct 2024

https://github.com/machinelearningzh/simply-simplify-language

Use machine learning to make your institutional communication more understandable and inclusive.

anthropic einfachesprache leichtesprache llm llms mistral mistralai natural-language-processing nlp openai plainlanguage python spacy streamlit

Last synced: 14 Oct 2024

https://github.com/aajanki/spacy-fi

Experimental Finnish language model for SpaCy

finnish-language-analysis spacy spacy-models

Last synced: 31 Oct 2024

https://github.com/liebeck/spacy-sentiws

German sentiment scores with SentiWS as extension for spaCy

nlp spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/cyclecycle/spacy-pattern-builder

Reverse engineer patterns for use with SpaCy's DependencyMatcher

nlp python spacy

Last synced: 10 Oct 2024

https://github.com/writer/replacy

spaCy match and replace, maintaining conjugation

nlp spacy

Last synced: 01 Nov 2024

https://github.com/dadosabertosdefeira/tomba

Identifique endereços, bairros e outras localizaçáes brasileiras em um texto 🏘

brasil hacktoberfest nlp spacy

Last synced: 12 Oct 2024

https://github.com/sarthakjshetty/pyresearchinsights

End-to-end NLP tool to analyze research publications. Published in Ecology & Evolution 2021.

gensim natural-language-processing nlp python scientific-analysis spacy text-mining

Last synced: 12 Oct 2024

https://github.com/fractalego/pynsett

A programmable relation extraction tool

extract-relationships nlp relation-extraction spacy wikidata-knowledge

Last synced: 12 Oct 2024

https://github.com/ucrel/pymusas

Python Multilingual Ucrel Semantic Analysis System

natural-language-processing nlp python spacy spacy-pipeline

Last synced: 12 Oct 2024

https://github.com/fredriko/bert-tensorflow-pytorch-spacy-conversion

Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.

bert bert-model how-to keras nlp pytorch-transformers spacy spacy-models spacy-nlp spacy-package spacy-pytorch-transformers tensorflow

Last synced: 07 Aug 2024

https://github.com/seannaren/cord-19-ann

ANN Search through the COVID CORD-19 Dataset using SBERT.

cord-19 covid-19 machine-learning pytorch scispacy spacy transformer

Last synced: 29 Oct 2024

https://github.com/tlack/hairytext

A data labeling and NLP tool for Elixir (uses Spacy)

elixir entity-recognition nlp nlp-machine-learning phoenix-live-view spacy text-classification

Last synced: 28 Oct 2024

https://github.com/adirthaborgohain/ner-re

A Named Entity Recognition + Entity Linker + Relation Extraction Pipeline built using spacy v3. Given a text, the pipeline will extract entities from the text as trained and will disambiguate the entities to its normalized form through an Entity Linker connected to a Knowledge Base and will assign a relation between the entities, if any.

named-entity-recognition nlp relation-extraction spacy transformers

Last synced: 23 Oct 2024

https://github.com/liebeck/spacy-iwnlp

German lemmatization with IWNLP as extension for spaCy

nlp spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/iclrandd/case2vec

A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated Council of Law Reporting for England & Wales (https://www.iclr.co.uk).

caselaw gensim-word2vec natural-language-processing sense2vec spacy word2vec

Last synced: 29 Oct 2024

https://github.com/doccano/spacy-partial-tagger

A simple library for training named entity recognition model from partially annotated data

named-entity-recognition natural-language-processing nlp spacy weak weak-supervision weakly-supervised-learning

Last synced: 10 Oct 2024

https://github.com/explosion/spacy-benchmarks

πŸ’« Runtime performance comparison of spaCy against other NLP libraries

benchmarking benchmarks natural-language-processing nlp spacy

Last synced: 25 Sep 2024

https://github.com/bramvanroy/astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

alignment linguistics nlp parallel-corpus parsing spacy stanza translation

Last synced: 14 Oct 2024

https://github.com/yashdew/assessor

An open-source Resume Analyzer and Ranking tool for recruiters and candidates.

flask hacktoberfest hacktoberfest2021 nextjs nlp python spacy

Last synced: 27 Oct 2024

https://github.com/mmxgn/sprl-spacy

Implementation of Spatial Role Labeling using the Spacy NLP framework.

nlp problog spacy spatial-role-labeling sprl

Last synced: 10 Oct 2024

https://gitlab.com/tangibleai/community/qary-cli

Command Line Interface for developers of qary -- the open source, teachable AI assistant that truly assists, rather than manipulating you.

BERT ELIZA GPT3 HACKTOBERFEST2022 NLP Virtual Assistant chatbot datasets hacktoberfest machine learning pytorch sklearn spacy torch

Last synced: 01 Nov 2024

https://github.com/spacyturk/spacyturk

spaCyTurk - trained models & pipelines for Turkish

floret nlp nlp-library spacy turkish-nlp

Last synced: 12 Oct 2024

https://github.com/izuna385/wikia-and-wikipedia-el-dataset-creator

You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wiki are available!

deep-learning entity-linking named-entity-recognition natural-language-processing natural-language-understanding spacy wikipedia

Last synced: 18 Oct 2024

https://github.com/tokestermw/spacy_kenlm

:game_die: KenLM extension for spaCy 2.0.

kenlm language-model nlp spacy spacy-extension spacy-nlp

Last synced: 14 Oct 2024

https://github.com/ccoreilly/spacy-catala

Spacy NLP Model for the Catalan language

catalan catalan-language nlp nlp-model nlu nlu-model spacy

Last synced: 23 Oct 2024

https://github.com/maxbot-ai/maxbot

Maxbot is an open source library and framework for creating conversational apps

bot botkit chatbot chatbot-framework conversational-ai conversational-apps maxbot nlp rasa spacy text-bot voice-bot

Last synced: 18 Oct 2024

https://github.com/wadaboa/ner-annotator

GUI useful to manually annotate text for Named Entity Recognition purposes

named-entity-recognition ner nlp pyqt5 spacy

Last synced: 12 Oct 2024

https://github.com/ghosthamlet/CHN

Hacker news on Console with auto classifer and recommender in reactjs style code

console-ui deeplearning hacker-news hackernews machinelearning reactjs sklearn spacy word2vec

Last synced: 03 Aug 2024

spaCy Awesome Lists
spaCy Categories