Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-24 00:24:28 UTC
- JSON Representation
https://github.com/nluninja/text-mining-dataviz
Data Visualization and Text Mining course - UNICATT
embeddings lstm nlp spacy text-mining transformers
Last synced: 09 Nov 2024
https://github.com/herambvd/spoken2written
A source of python package which converts language styles in speech to its equivalent written form.
artificial-intelligence entity machine-learning named-entity-recognition natural-language-processing spacy speech-recognition token-matcher
Last synced: 14 Oct 2024
https://github.com/shubhamjai9/emotion-based-counsellor-bot
An Artificial Intelligence based Chat Bot using python tools like Numpy, Pandas, Spacy etc. Counsellor Bot will mimic the characteristics and emotion interpretation skills of human and generate response on basis of emotion of engager.
chatbot gradient-boosting-classifier machine-learning naive-bayes-classifier nlp numpy pandas python-2 spacy
Last synced: 22 Dec 2024
https://github.com/cloudera/cml_amp_spacy_entity_extraction
A Jupyter notebook demonstrating entity extraction on headlines with SpaCy.
entity-extraction named-entity-recognition nlp spacy
Last synced: 07 Nov 2024
https://github.com/senisioi/rolegal
A Spacy Package for Romanian Legal Document Processing
floret legal-documents ner romanian-language spacy
Last synced: 12 Oct 2024
https://github.com/gaving/zorya
:grapes: Build NER graphs from YouTube transcripts
neo4j ner spacy youtube-transcripts
Last synced: 21 Dec 2024
https://github.com/woctezuma/steam-descriptions
Retrieve semantically similar Steam games.
discovery game games gensim glove glove-embeddings glove-vectors spacy steam steam-api steam-descriptions steam-game steam-games steam-store-descriptions word2vec
Last synced: 06 Dec 2024
https://github.com/teakulo/eventime-app
Eventime App is an event management platform using Angular, Spring Boot, Flask, and PostgreSQL. It offers AI-powered event recommendations, social features, and secure authentication. Users can manage events, chat with a chatbot, and view their calendar.
ai angular authentication calendar chatbot event flask lemmatization nlp nltk postgresql spacy springboot
Last synced: 14 Oct 2024
https://github.com/acdh-oeaw/acdh-prodigy-utils
custom loaders for spaCy's prodigy
Last synced: 22 Nov 2024
https://github.com/nanxstats/pdf-word-extraction
Extract meaningful words from a collection of PDF documents and count their frequencies
ftfy natural-language-processing pypdf research-paper spacy wordcloud
Last synced: 16 Nov 2024
https://github.com/5hirish/django_adam_qas
ADAM - QA -- Front-end using Django and Material Design.
django natural-language-processing python3 question-answering spacy
Last synced: 11 Nov 2024
https://github.com/surajiyer/spacybert
BERT inference (with similar function to hanxiao/bert-as-service) for spaCy with custom extension attributes
bert huggingface huggingface-transformers language-model machine-learning natural-language-processing nlp pytorch pytorch-model spacy spacy-extension spacy-pipeline
Last synced: 28 Nov 2024
https://github.com/pyladiesams/nlp-beginner-nov2020
Intro to NLP with NLTK, spaCy, and gensim
gensim nlp nlp-machine-learning nltk python spacy
Last synced: 09 Nov 2024
https://github.com/populated/compare
A simple Python-based code to compare texts for similarities.
comparsion nlp numpy spacy text
Last synced: 14 Oct 2024
https://github.com/inanyan/spacy_pat_match_dsl
A simple DSL for creating spaCy pattern matchers
Last synced: 28 Nov 2024
https://github.com/andrehaguiar/jones_granatyr
Cursos IA Expert Academy - Udemy
nlp nlp-machine-learning nltk nltk-python opencv python spacy tensorflow udemy
Last synced: 07 Nov 2024
https://github.com/izuna385/scispacy-candidate-generator
Generating Candidate Entities with ScispaCy
allennlp entity-linking natural-language-processing scispacy spacy
Last synced: 07 Dec 2024
https://github.com/gtoffoli/commons-textanalysis
Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.
django nlp python spacy text-analysis
Last synced: 30 Nov 2024
https://github.com/riyajha2305/healthcare-diagnosis-chatbot-ms-hackathon
Built a chatbot capable of diagnosing common medical conditions based on user symptoms input. Utilized pre-trained machine learning models such NLP and NER from Huggingface and Spacy, trained on medical data to provide accurate suggestions and recommendations for further action.
hackathon healthcare-chatbot huggingface machine-learning ner nlp python spacy tkinter
Last synced: 09 Oct 2024
https://github.com/tomfran/caselaw-temporal-analysis
Information retrieval techniques applied to legal texts to study terms relevance during years and with respect to similar type of cases.
caselaw illinois-courts information-retrieval latent-dirichlet-allocation semantic-shifts spacy wordembeddings
Last synced: 10 Nov 2024
https://github.com/timuroeztuerk/data-science-lecture-s24
This is the webpage of the Data Science course offered by VWL 7 for the summer semester 2024.
economics natural-language-processing nltk spacy text-classification
Last synced: 14 Oct 2024
https://github.com/ucrel/pymusas-models
PyMUSAS Models
models natural-language-processing nlp spacy spacy-models
Last synced: 22 Nov 2024
https://github.com/direct-phonology/spacy-och
the old chinese language for spaCy
Last synced: 18 Dec 2024
https://github.com/neurotech-hq/swahili-ner-spacy
Swahili NER model trained using spacy
Last synced: 08 Nov 2024
https://github.com/umactually/papanatas
Papanatas Autómata Multiparadigma IV. El bot oficial de mi server de discord, Sociedad de Patanes.
discord discord-bot discord-py ffmpeg pillow pycord python spacy
Last synced: 01 Dec 2024
https://github.com/ljvmiranda921/ud-tagalog-spacy
Training a POS Tagger and Dependency Parser for a Low-Resource Language (Tagalog)
low-resource-languages machine-learning nlp spacy tagalog
Last synced: 08 Dec 2024
https://github.com/bemxio/julia-robotczyk
A Facebook Messenger chatbot based on my classmate's messages
facebook markov-chain markovify messenger nlp python spacy
Last synced: 15 Nov 2024
https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system
In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.
corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis
Last synced: 09 Nov 2024
https://github.com/marmg/moviener
Code for the NER demo. Prepare data, train and extract entities from movie reviews.
extract-entities movie-reviews ner spacy
Last synced: 19 Nov 2024
https://github.com/yarosj/prestige-of-districts
:mag_right: This application parses sites and retrieves data associated with failures of public services to display districts' prestige
amqp apollo-client apollo-server docker-compose graphql mapbox-gl ner neural-network nlp nodejs parsing pika python3 rabbitmq react scraping semantic-ui-react spacy taskscheduler webpack
Last synced: 17 Nov 2024
https://github.com/doug1043/sistembot
BOT Telegram para atendimento automático em pizzarias, utilizando ferramentas de processamento natural de linguagem.
chatbot gaussian-naive-bayes nlp processamento-de-linguagem-natural python3 scikit-learn sklearn spacy spacy-nlp telegram telegram-bot telegram-bot-api
Last synced: 17 Nov 2024
https://github.com/jamnicki/metin2_vision_bot
Automatic MMORPG Bot for Dungeons Massive Passing based on Windows API, YoloV8 object detection, statistical methods from OpenCV, Tesseract-OCR and spaCy virtualised on Hyper-V, Win11+CUDA
computer-vision object-detection opencv spacy tesseract-ocr torchvision ultralytics win32
Last synced: 03 Nov 2024
https://github.com/oroszgy/spacy-tokenizer-benchmark
Quick and dirty scripts to measure the performance of spaCy
benchmark natural-language-processing nlp python spacy tokenizer
Last synced: 08 Dec 2024
https://github.com/lucasspinola/monitorbot-api
API feita com FastApi e Spacy para auxiliar Bot Educacional em suas atividades durante a aula.
Last synced: 21 Dec 2024
https://github.com/keshabkjha/weatherapp
WeatherApp is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot called Weatha, built using Python (Streamlit & SpaCy), that responds to weather-related queries.
html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app
Last synced: 25 Oct 2024
https://github.com/vidhi1290/chatbot-with-rasa-nlu-model-and-python
This project builds an intelligent chatbot using Rasa NLU for an E-Commerce business 🛍️. The chatbot can handle user queries like product information, pricing, and order management 💬. With spacy and TensorFlow pipelines 🧠 for training, and MongoDB for storing data 📦, it offers seamless, context-aware conversations
aichatbot artificial-intelligence chatbot jupyter-notebook matplotlib nlu nlu-chatbot pandas pymongo python rasa-chatbot rasa-nlu spacy spacy-nlp tensorflow
Last synced: 22 Dec 2024
https://github.com/aadityasivas/spacy-text-summarization
A simple text summarizer built with spaCy
jupyter-notebook nlp python spacy
Last synced: 22 Dec 2024
https://github.com/lykmapipo/us-inaugural-addresses
Python scripts to download, process, and analyze US Inaugural Addresses
beautifulsoup4 gensim joblib lykmapipo natural-language-processing nlp nltk python python-scripts requests spacy text-analysis text-analytics text-extraction text-processing web-scraping
Last synced: 21 Dec 2024
https://github.com/ggordonhall/measurement_tagger
Spacy Measurement Tagger
dependency-parser measurement-tagger nlp python spacy tagger
Last synced: 07 Nov 2024
https://github.com/surajiyer/topic-analysis
Python library to perform topic detection on textual data that are generated over time.
agglomerative-clustering gaussian-mixture-models nlp spacy spectral-clustering textual-data topic-analysis topic-modeling
Last synced: 10 Dec 2024
https://github.com/sloev/spacy_onnx_sentiment_english
english sentiment model for spacy
onnx-models sentiment-analysis spacy spacy-pipeline
Last synced: 08 Dec 2024
https://github.com/aiatyourservice/deeplearningforcoders
Hey, this repo contains code from deep learning specialization by Andrew NG
deep-learning nltk python pytorch spacy
Last synced: 14 Oct 2024
https://github.com/izuna385/arxiv-checker
Single Page Application and its deployment for GCE.
docker docker-compose fastapi nginx react react-bootstrap spacy tdd
Last synced: 07 Dec 2024
https://github.com/public-health-scotland/dose_instruction_parser
Parsing prescription dose instructions using Named Entity Recognition and rules
dose-instructions drug machine-learning named-entity-recognition natural-language-processing nhs prescribing prescriptions public-health scotland spacy
Last synced: 14 Oct 2024
https://github.com/jonasrenault/cprex
Chemical Properties Relation Extraction
chemistry crawler deep-learning information-extraction machine-learning named-entity-recognition nlp pubchem relation-extraction scientific-articles spacy transformers
Last synced: 14 Oct 2024
https://github.com/izuna385/pubtator-multiprocess-parser
Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.
allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy
Last synced: 07 Dec 2024
https://github.com/karimosman89/legal-document-nlp
Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.
nltk python scikit-learn spacy
Last synced: 07 Nov 2024
https://github.com/metalcorebear/spacy-affect-model
A Spacy model for measuring emotional affect.
affect affect-analysis model nltk-python nrclex sentiment-analysis sentiment-classification spacy spacy-nlp vader-sentiment-analysis
Last synced: 07 Nov 2024
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 14 Oct 2024
https://github.com/codebasics/ner-resume-parser
A tutorial for NER Resume Parser to get the keywords out of a resume.
mlflow mlflow-tracking nlp python spacy spacy-models spacy-nlp
Last synced: 16 Nov 2024
https://github.com/isabelleysseric/sentiment-analysis
Sentiment analysis with dependency tree.
bag-of-words-model corpus dependency-analysis dependency-parsing dependency-tree dependency-trees displacy embedding multigram nlp nltk parsing pos-tagging scope-of-negation sentiment-analysis sentimental-analysis sentiwordnet spacy text-classification
Last synced: 08 Nov 2024
https://github.com/lilivalgo/analisis_reportes_onu_cambio_climatico
Web Scraping, manipulación de files.PDF, NPL con SpaCy
beautifulsoup4 pandas pypdf2 python requests spacy wordcloud
Last synced: 07 Dec 2024
https://github.com/zevio/pcu
Plateforme de Connaissances Unifiées (PCU) project (i.e Unified Knowledge Platform)
extraction json keyphrase-extraction kleis knowledge knowledge-extraction langdetect pcu pcu-io pcu-json pcu-keyphrase pcu-language pcu-nlp pcu-pdf pcu-relation pdf python spacy text workflow
Last synced: 10 Nov 2024
https://github.com/turbolent/telescope
Go explore
compiler nlp parser question-answering scala spacy sparql
Last synced: 08 Dec 2024
https://github.com/ajla-brdarevic/pdf_question_generator
Project - Artificial intelligence
ai flask machine-learning mt5 pypdf2 python spacy transformers
Last synced: 08 Dec 2024
https://github.com/charlesyuan02/named_entity_recognition
Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.
conll-2003 named-entity-recognition ner nlp spacy transformer
Last synced: 19 Dec 2024
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 14 Oct 2024
https://github.com/jonathanfox5/lemon_tizer
LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications.
lemmatization lemmatizer spacy wrapper
Last synced: 14 Nov 2024
https://github.com/5hraddha/sentiment-analysis
An innovative system for filtering and categorizing movie reviews
countvectorizer dummyclassifier lgbmclassifier logisticregression matplotlib minmaxscaler nltk nltk-stopwords nltk-tokenizer numpy pandas seaborn spacy tfidfvectorizer torch tqdm transformers
Last synced: 18 Dec 2024
https://github.com/somenath203/named-entity-recognizer
Click below to checkout the website
huggingface huggingface-spaces named-entity-recognition ner spacy streamlit torch transformers
Last synced: 14 Oct 2024
https://github.com/jblake1965/elucidoc
Screens legal text and extracts sentences containing user input party name-predicate phrases
excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word
Last synced: 12 Oct 2024
https://github.com/medspacy/nlp_preprocessor
SpaCy component for modifying the string of a doc before tokenizing.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 11 Nov 2024
https://github.com/kailejie/ner
This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.
Last synced: 18 Dec 2024
https://github.com/aitechhero/nonullsense-nlp
Natural Language Processing (NLP) with libraries like spaCy, Transformers, and NLTK.
ai artificial-intelligence huggingface natural-language-processing nlp nltk python spacy text-analysis transformers
Last synced: 11 Nov 2024
https://github.com/miteshgupta07/ats-scoring-system
An ATS (Applicant Tracking System) scoring system that evaluates and ranks resumes based on keyword matching and relevance.
ats ats-system nlp python resume-parser spacy
Last synced: 18 Dec 2024
https://github.com/plandes/mimic
MIMIC III Corpus Parsing
mimic-iii natural-language-processing parsing-library spacy
Last synced: 17 Nov 2024
https://github.com/omar7tech/text-summarization
This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.
natural-language-processing python spacy text-summarization tokenization
Last synced: 18 Dec 2024
https://github.com/pyladiesams/nlp-projects-with-spacy-may2024
NLP projects with spaCy
Last synced: 18 Dec 2024
https://github.com/navaneethelite/ner_streamlit
A genreal purpose Named Entity Recognition model using Spacy v3. This web app was built using streamlit and deployed to Heroku.
Last synced: 29 Nov 2024
https://github.com/samestrin/llm-services-api
A FastAPI-powered REST API offering a comprehensive suite of natural language processing services using machine learning models with PyTorch and Transformers, packaged in a Docker container to run efficiently.
api docker fastapi hugging-face hugging-face-transformers huggingface-transformers keybert llm openai-compatible-api python python3 pytorch rest rest-api spacy torch transformers uvicorn
Last synced: 18 Dec 2024
https://github.com/shaadclt/businesscard-dataextraction-ocr-ner
This project aims to extract structured data from business cards using a combination of OpenCV, PyTesseract, and spaCy.
ner ocr opencv pytesseract spacy
Last synced: 07 Dec 2024
https://github.com/gtoffoli/spacy-cameltokenizer
Tokenizer extension for the Arabic language (MSA), integrating the Morphological Tokenizer of the camel_tools project (CAMeL Lab).
arabic nlp spacy spacy-pipeline tokenizer tools
Last synced: 30 Nov 2024
https://github.com/etdds/redditquotebot
A Reddit comment bot for detecting and replying to famous quotes.
bot chatbot natural-language-processing nlp praw python reddit spacy
Last synced: 23 Nov 2024
https://github.com/jash271/youglance
Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking
cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling
Last synced: 28 Nov 2024
https://github.com/whatevery1says/preprocessing
WE1S Preprocessing -- workflow preparing documents for import as WE1S data
digital-humanities humanities news nltk preprocessing spacy topic-modeling
Last synced: 14 Nov 2024
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem
The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.
chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/sukanyadutta52/sentiment-analysis
An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis
flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard
Last synced: 07 Dec 2024
https://github.com/f1uctus/p4a-recipes
📱 🐍 A collection of recipes for p4a (Python for Android).
android blis docker numpy p4a python python-for-android spacy
Last synced: 15 Nov 2024
https://github.com/moindalvs/text_mining_nlp
Natural Language Processing
bag-of-words classifier data-science fake-news lemmatization nlp pipeline sentiment-analysis sentiment-classification spacy spacy-pipeline stemming text-classification text-mining tfidf tokenization vectorizer
Last synced: 17 Nov 2024
https://github.com/moindalvs/sentiment_analysis_on_-elon_musk_tweets
Perform sentimental analysis on the Elon-musk tweets (Elon-musk.csv)
bag-of-words cleaning-data elon-musk feature-engineering nlp nltk polarity sentiment-analysis sentiment-intensity sentiment-polarity spacy subjectivity text-mining text-processing textblob-sentiment-analysis tfidf tfidf-vectorizer tokenizer tweet-analysis twitter-sentiment-analysis
Last synced: 17 Nov 2024
https://github.com/srstevenson/keyword-extractor
Extract keywords from plain text documents
Last synced: 20 Nov 2024
https://github.com/thyripian/core
This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.
api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite
Last synced: 14 Dec 2024
https://github.com/ccoreilly/spacy-catala-generator
Training and dataset used for the catalan spacy model
catala catalan catalan-language spacy spacy-models
Last synced: 17 Dec 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 16 Nov 2024
https://github.com/florensadimer/nlp_ner_soccer_pt-br
Anotação Manual e Comparação com Modelos Treinados
annotation llm machine-learning ner nlp spacy
Last synced: 09 Dec 2024
https://github.com/innerdoc/spacy-for-datashare
Let spaCy do the parsing of Named Entities for documents in the Datashare platform
datashare elasticsearch named-entity-recognition natural-language-processing spacy
Last synced: 20 Nov 2024
https://github.com/debugger404/multilanguage-pos
Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.
multilanguage named-entity-recognition ner python3 spacy
Last synced: 22 Dec 2024
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 24 Nov 2024
https://github.com/den1ksk/nlp-with-disastertweets
Kaggle competition
bert data-science deeplearning kaggle machine-learning nlp nltk pytorch spacy transformers xgboost
Last synced: 20 Nov 2024
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 16 Nov 2024