Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/weihanchen/google-colab-python-learn

📚 Learn Google Colab、Python、ML、OpenAI、Whisper、spaCy、NLP、HuggingFace

colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper

Last synced: 11 Nov 2024

https://github.com/nluninja/text-mining-dataviz

Data Visualization and Text Mining course - UNICATT

embeddings lstm nlp spacy text-mining transformers

Last synced: 09 Nov 2024

https://github.com/nickcrews/spacy-address

Parse oneline US addresses using a spaCy NER model trained on OSM data

address address-parsing osm osm-data spacy spacy-nlp usaddress

Last synced: 20 Oct 2024

https://github.com/amrrs/intro_to_nlp_with_spacy

Introduction to NLP with Spacy - Bangpypers October Talk

nlp python spacy

Last synced: 15 Nov 2024

https://github.com/herambvd/spoken2written

A source of python package which converts language styles in speech to its equivalent written form.

artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher

Last synced: 14 Oct 2024

https://github.com/martinjack/uaddresspacy

🇺🇦 UAddresspacy | Spacy разборка украинского адреса на типы

address nlp parsing spacy spacy-nlp ukraine

Last synced: 26 Sep 2024

https://github.com/gtoffoli/spacy-ar_core_news_md

Unofficial Arabic language model for spaCy

arabic-language camel nlp python spacy spacy-pipeline tokenizer

Last synced: 14 Oct 2024

https://github.com/direct-phonology/spacy-och

the old chinese language for spaCy

chinese nlp spacy

Last synced: 18 Dec 2024

https://github.com/marmg/moviener

Code for the NER demo. Prepare data, train and extract entities from movie reviews.

extract-entities movie-reviews ner spacy

Last synced: 19 Nov 2024

https://github.com/jamnicki/metin2_vision_bot

Automatic MMORPG Bot for Dungeons Massive Passing based on Windows API, YoloV8 object detection, statistical methods from OpenCV, Tesseract-OCR and spaCy virtualised on Hyper-V, Win11+CUDA

computer-vision object-detection opencv spacy tesseract-ocr torchvision ultralytics win32

Last synced: 03 Nov 2024

https://github.com/riyajha2305/healthcare-diagnosis-chatbot-ms-hackathon

Built a chatbot capable of diagnosing common medical conditions based on user symptoms input. Utilized pre-trained machine learning models such NLP and NER from Huggingface and Spacy, trained on medical data to provide accurate suggestions and recommendations for further action.

hackathon healthcare-chatbot huggingface machine-learning ner nlp python spacy tkinter

Last synced: 09 Oct 2024

https://github.com/tomfran/caselaw-temporal-analysis

Information retrieval techniques applied to legal texts to study terms relevance during years and with respect to similar type of cases.

caselaw illinois-courts information-retrieval latent-dirichlet-allocation semantic-shifts spacy wordembeddings

Last synced: 10 Nov 2024

https://github.com/adriaanbd/kamtutecs-api

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.

docker fastapi nlp ocr spacy tesseract translate

Last synced: 21 Dec 2024

https://github.com/populated/compare

A simple Python-based code to compare texts for similarities.

comparsion nlp numpy spacy text

Last synced: 14 Oct 2024

https://github.com/neurotech-hq/swahili-ner-spacy

Swahili NER model trained using spacy

ner spacy swahili-ner

Last synced: 08 Nov 2024

https://github.com/inanyan/spacy_pat_match_dsl

A simple DSL for creating spaCy pattern matchers

dsl nlp python spacy

Last synced: 28 Nov 2024

https://github.com/teakulo/eventime-app

Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.

ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot

Last synced: 14 Oct 2024

https://github.com/surajiyer/spacybert

BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes

bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline

Last synced: 28 Nov 2024

https://github.com/pyladiesams/nlp-beginner-nov2020

Intro to NLP with NLTK, spaCy, and gensim

gensim nlp nlp-machine-learning nltk python spacy

Last synced: 09 Nov 2024

https://github.com/5hirish/django_adam_qas

ADAM - QA -- Front-end using Django and Material Design.

django natural-language-processing python3 question-answering spacy

Last synced: 11 Nov 2024

https://github.com/turbolent/spacyclient

A Swift client for spaCy

client nlp spacy swift

Last synced: 08 Dec 2024

https://github.com/ayushsubedi/choto

CLI tool to generate a summary of news/articles right on your terminal. Also a pip package.

articles bert choto cli click gensim news pip python spacy summary

Last synced: 14 Oct 2024

https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system

In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.

corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis

Last synced: 09 Nov 2024

https://github.com/ljvmiranda921/ud-tagalog-spacy

Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)

low-resource-languages machine-learning nlp spacy tagalog

Last synced: 08 Dec 2024

https://github.com/gtoffoli/commons-textanalysis

Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.

django nlp python spacy text-analysis

Last synced: 30 Nov 2024

https://github.com/timuroeztuerk/data-science-lecture-s24

This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.

economics natural-language-processing nltk spacy text-classification

Last synced: 14 Oct 2024

https://github.com/nanxstats/pdf-word-extraction

Extract meaningful words from a collection of PDF documents and count their frequencies

ftfy natural-language-processing pypdf research-paper spacy wordcloud

Last synced: 16 Nov 2024

https://github.com/acdh-oeaw/acdh-prodigy-utils

custom loaders for spaCy's prodigy

prodigy spacy

Last synced: 22 Nov 2024

https://github.com/umactually/papanatas

Papanatas Autómata Multiparadigma IV. El bot oficial de mi server de discord, Sociedad de Patanes.

discord discord-bot discord-py ffmpeg pillow pycord python spacy

Last synced: 01 Dec 2024

https://github.com/doug1043/sistembot

BOT Telegram para atendimento automático em pizzarias, utilizando ferramentas de processamento natural de linguagem.

chatbot gaussian-naive-bayes nlp processamento-de-linguagem-natural python3 scikit-learn sklearn spacy spacy-nlp telegram telegram-bot telegram-bot-api

Last synced: 17 Nov 2024

https://github.com/bemxio/julia-robotczyk

A Facebook Messenger chatbot based on my classmate's messages

facebook markov-chain markovify messenger nlp python spacy

Last synced: 15 Nov 2024

https://github.com/yarosj/prestige-of-districts

:mag_right: This application parses sites and retrieves data associated with failures of public services to display districts' prestige

amqp apollo-client apollo-server docker-compose graphql mapbox-gl ner neural-network nlp nodejs parsing pika python3 rabbitmq react scraping semantic-ui-react spacy taskscheduler webpack

Last synced: 17 Nov 2024

https://github.com/jash271/youglance

Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking

cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling

Last synced: 28 Nov 2024

https://github.com/keshabkjha/weatherapp

WeatherApp is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot called Weatha, built using Python (Streamlit & SpaCy), that responds to weather-related queries.

html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app

Last synced: 25 Oct 2024

https://github.com/sukanyadutta52/sentiment-analysis

An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis

flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard

Last synced: 07 Dec 2024

https://github.com/codebasics/ner-resume-parser

A tutorial for NER Resume Parser to get the keywords out of a resume.

mlflow mlflow-tracking nlp python spacy spacy-models spacy-nlp

Last synced: 16 Nov 2024

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 08 Dec 2024

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 16 Nov 2024

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 16 Nov 2024

https://github.com/gtoffoli/spacy-cameltokenizer

Tokenizer extension for the Arabic language (MSA), integrating the Morphological Tokenizer of the camel_tools project (CAMeL Lab).

arabic nlp spacy spacy-pipeline tokenizer tools

Last synced: 30 Nov 2024

https://github.com/sudip-13/nlp

This repo for tutorial NLP dialog flow chat bot back end configured

dialogflow fastapi fasttext mogodb ner regex spacy tf-idf

Last synced: 14 Oct 2024

https://github.com/charlesyuan02/named_entity_recognition

Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.

conll-2003 named-entity-recognition ner nlp spacy transformer

Last synced: 19 Dec 2024

https://github.com/navaneethelite/ner_streamlit

A genreal purpose Named Entity Recognition model using Spacy v3. This web app was built using streamlit and deployed to Heroku.

heroku-app nlp spacy

Last synced: 29 Nov 2024

https://github.com/whatevery1says/preprocessing

WE1S Preprocessing -- workflow preparing documents for import as WE1S data

digital-humanities humanities news nltk preprocessing spacy topic-modeling

Last synced: 14 Nov 2024

https://github.com/vidhi1290/chatbot-with-rasa-nlu-model-and-python

This project builds an intelligent chatbot using Rasa NLU for an E-Commerce business 🛍️. The chatbot can handle user queries like product information, pricing, and order management 💬. With spacy and TensorFlow pipelines 🧠 for training, and MongoDB for storing data 📦, it offers seamless, context-aware conversations

aichatbot artificial-intelligence chatbot jupyter-notebook matplotlib nlu nlu-chatbot pandas pymongo python rasa-chatbot rasa-nlu spacy spacy-nlp tensorflow

Last synced: 22 Dec 2024

https://github.com/florensadimer/nlp_ner_soccer_pt-br

Anotação Manual e Comparação com Modelos Treinados

annotation llm machine-learning ner nlp spacy

Last synced: 09 Dec 2024

https://github.com/aadityasivas/spacy-text-summarization

A simple text summarizer built with spaCy

jupyter-notebook nlp python spacy

Last synced: 22 Dec 2024

https://github.com/samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural language processing services using machine learning models with PyTorch and Transformers, packaged in a Docker container to run efficiently.

api docker fastapi hugging-face hugging-face-transformers huggingface-transformers keybert llm openai-compatible-api python python3 pytorch rest rest-api spacy torch transformers uvicorn

Last synced: 18 Dec 2024

https://github.com/gugarosa/brainy

🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.

api machine-learning nlp python spacy

Last synced: 07 Dec 2024

https://github.com/fferegrino/zeldakg

A TLOZ inspired knowledge graph

infobox knowledge-graph nltk pandas python spacy wikidata

Last synced: 15 Dec 2024

https://github.com/izuna385/arxiv-checker

Single Page Application and its deployment for GCE.

docker docker-compose fastapi nginx react react-bootstrap spacy tdd

Last synced: 07 Dec 2024

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 07 Nov 2024

https://github.com/izuna385/pubtator-multiprocess-parser

Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.

allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy

Last synced: 07 Dec 2024

https://github.com/shaadclt/businesscard-dataextraction-ocr-ner

This project aims to extract structured data from business cards using a combination of OpenCV, PyTesseract, and spaCy.

ner ocr opencv pytesseract spacy

Last synced: 07 Dec 2024

https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem

The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.

chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp

Last synced: 18 Dec 2024

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 14 Oct 2024

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 14 Oct 2024

https://github.com/jtlicardo/process-visualizer-web

Web interface for the process-visualizer project

bert bpmn nlp openai spacy

Last synced: 15 Nov 2024

https://github.com/henx117/chatbot

My chatbot python project

chatbot python python3 spacy

Last synced: 14 Oct 2024

https://github.com/f1uctus/p4a-recipes

📱 🐍 A collection of recipes for p4a (Python for Android).

android blis docker numpy p4a python python-for-android spacy

Last synced: 15 Nov 2024

https://github.com/jblake1965/elucidoc

Screens legal text and extracts sentences containing user input party name-predicate phrases

excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word

Last synced: 12 Oct 2024

https://github.com/laurenzv/covbot

A small chatbot written as part of my bachelor thesis.

chatbot corenlp covid-19 docker python spacy sqlite vuejs

Last synced: 18 Dec 2024

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 20 Nov 2024

https://github.com/lilivalgo/analisis_reportes_onu_cambio_climatico

Web Scraping, manipulación de files.PDF, NPL con SpaCy

beautifulsoup4 pandas pypdf2 python requests spacy wordcloud

Last synced: 07 Dec 2024

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 18 Dec 2024

https://github.com/surajiyer/topic-analysis

Python library to perform topic detection on textual data that are generated over time.

agglomerative-clustering gaussian-mixture-models nlp spacy spectral-clustering textual-data topic-analysis topic-modeling

Last synced: 10 Dec 2024

https://github.com/miteshgupta07/ats-scoring-system

An ATS (Applicant Tracking System) scoring system that evaluates and ranks resumes based on keyword matching and relevance.

ats ats-system nlp python resume-parser spacy

Last synced: 18 Dec 2024

https://github.com/thyripian/core

This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.

api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite

Last synced: 14 Dec 2024

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 23 Dec 2024

https://github.com/jonathanfox5/lemon_tizer

LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.

lemmatization lemmatizer spacy wrapper

Last synced: 14 Nov 2024

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 17 Dec 2024

https://github.com/aiatyourservice/deeplearningforcoders

Hey, this repo contains code from deep learning specialization by Andrew NG

deep-learning nltk python pytorch spacy

Last synced: 14 Oct 2024

https://github.com/innerdoc/spacy-for-datashare

Let spaCy do the parsing of Named Entities for documents in the Datashare platform

datashare elasticsearch named-entity-recognition natural-language-processing spacy

Last synced: 20 Nov 2024

https://github.com/mbfakourii/nlp-ner

Implement Ner in nlp

ner nlp python spacy spacy-nlp

Last synced: 09 Dec 2024

https://github.com/debugger404/multilanguage-pos

Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.

multilanguage named-entity-recognition ner python3 spacy

Last synced: 22 Dec 2024

https://github.com/lucasspinola/monitorbot-api

API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.

fastapi nlp pln spacy

Last synced: 21 Dec 2024

https://github.com/medspacy/nlp_preprocessor

SpaCy component for modifying the string of a doc before tokenizing.

clinical-nlp medspacy nlp nlp-library pipeline spacy

Last synced: 11 Nov 2024

spaCy Awesome Lists
spaCy Categories