Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
![](https://explore-feed.github.com/topics/spacy/spacy.png)
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-02-09 00:28:04 UTC
- JSON Representation
https://github.com/ajla-brdarevic/pdf_question_generator
Project - Artificial intelligence
ai flask machine-learning mt5 pypdf2 python spacy transformers
Last synced: 03 Feb 2025
https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system
In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.
corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis
Last synced: 05 Jan 2025
https://github.com/etdds/redditquotebot
A Reddit comment bot for detecting and replying to famous quotes.
bot chatbot natural-language-processing nlp praw python reddit spacy
Last synced: 23 Jan 2025
https://github.com/gugarosa/brainy
🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.
api machine-learning nlp python spacy
Last synced: 02 Feb 2025
https://github.com/keshabkjha/climasense
ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.
html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app
Last synced: 06 Feb 2025
https://github.com/florensadimer/nlp_ner_soccer_pt-br
Anotação Manual e Comparação com Modelos Treinados
annotation llm machine-learning ner nlp spacy
Last synced: 09 Dec 2024
https://github.com/aitechhero/nonullsense-nlp
Natural Language Processing (NLP) with libraries like spaCy, Transformers, and NLTK.
ai artificial-intelligence huggingface natural-language-processing nlp nltk python spacy text-analysis transformers
Last synced: 09 Jan 2025
https://github.com/devbm7/qgen
Question Generator System
json ml nlp nltk pandas pymupdf-fitz python3 pytorch regular-expressions smtp spacy streamlit t5-large transformers wikipedia-api
Last synced: 10 Dec 2024
https://github.com/datarohit/nlp-course-files
The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.
nlp nlp-machine-learning nltk numpy panda python sklearn spacy
Last synced: 23 Dec 2024
https://github.com/bjam24/agh-natural-language-processing
This respository contains projects made for the NLP course at the AGH UST in 2024.
agh agh-wi elasticsearch language-modeling language-modelling levenshtein llm ner nlp prompt-engineering regex spacy text-classificaiton text-classification
Last synced: 23 Jan 2025
https://github.com/pyladiesams/nlp-projects-with-spacy-may2024
NLP projects with spaCy
Last synced: 10 Feb 2025
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 09 Feb 2025
https://github.com/charlesyuan02/named_entity_recognition
Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.
conll-2003 named-entity-recognition ner nlp spacy transformer
Last synced: 19 Dec 2024
https://github.com/sukanyadutta52/sentiment-analysis
An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis
flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard
Last synced: 02 Feb 2025
https://github.com/ggordonhall/measurement_tagger
Spacy Measurement Tagger
dependency-parser measurement-tagger nlp python spacy tagger
Last synced: 26 Dec 2024
https://github.com/jblake1965/elucidoc
Screens legal text and extracts sentences containing user input party name-predicate phrases
excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word
Last synced: 23 Jan 2025
https://github.com/moindalvs/text_mining_nlp
Natural Language Processing
bag-of-words classifier data-science fake-news lemmatization nlp pipeline sentiment-analysis sentiment-classification spacy spacy-pipeline stemming text-classification text-mining tfidf tokenization vectorizer
Last synced: 18 Jan 2025
https://github.com/moindalvs/sentiment_analysis_on_-elon_musk_tweets
Perform sentimental analysis on the Elon-musk tweets (Elon-musk.csv)
bag-of-words cleaning-data elon-musk feature-engineering nlp nltk polarity sentiment-analysis sentiment-intensity sentiment-polarity spacy subjectivity text-mining text-processing textblob-sentiment-analysis tfidf tfidf-vectorizer tokenizer tweet-analysis twitter-sentiment-analysis
Last synced: 18 Jan 2025
https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy
I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.
machine-learning named-entity-recognition ner spacy spacy-nlp
Last synced: 11 Jan 2025
https://github.com/csfelix/nlp-0-spacy-course
💬 Advanced NLP with Spacy Course
natural-language-processing nlp python spacy
Last synced: 30 Jan 2025
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 24 Jan 2025
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 16 Nov 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 16 Nov 2024
https://github.com/veldhub/veld_chain__mara_load_and_publish_models
Chain velds for publishing self-trained MARA models to huggingface.
Last synced: 21 Jan 2025
https://github.com/itsdaiton/named-entity-visualizer
NEV short for Named Entity Visualizer is a tool to visualize entities found in unstructured text built in Python.
named-entity-linking named-entity-recognition natural-language-processing nlp-pipeline python spacy wikidata
Last synced: 11 Feb 2025
https://github.com/tbarlow12/wiki-answer
I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions
nlp python question-answering spacy wikipedia
Last synced: 02 Feb 2025
https://github.com/jamnicki/bachelor_thesis_project
System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning
active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy
Last synced: 21 Dec 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/chewzzz1014/fyp
backend fastapi final-year-project flair fyp machine-learning ner project spacy transformer
Last synced: 26 Jan 2025
https://github.com/e3oroush/music_sorting
A simple project for categorizing your local musics. Find and delete the duplicate music files in your local machine
duplication-detection mediainfo music-duplication-detection music-information-retrieval python spacy
Last synced: 29 Jan 2025
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 18 Jan 2025
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024
https://github.com/lilivalgo/nlp-for-ipcc-climate-reports
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping
Last synced: 17 Nov 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 18 Jan 2025
https://github.com/pabvald/bachelor-thesis
Bachelor's thesis overview
chatbots dialogflow fasttext glove nlp spacy university-of-valladolid user-evaluation virtual-assistants word-embeddings word2vec
Last synced: 29 Jan 2025
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 09 Feb 2025
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 09 Feb 2025
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/samarthhchinivar/nlp-codebasics-playlist
This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy
nlp-machine-learning nltk python3 spacy
Last synced: 06 Jan 2025
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 21 Dec 2024
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 08 Feb 2025
https://github.com/saifinohwal/sentiment-analysis
Sentiment analysis of Steve Jobs speech
lemmetization nlp spacy summarization tokenization wordcloud-visualization
Last synced: 10 Feb 2025
https://github.com/kr1shnasomani/summarai
Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)
natural-language-processing pytextrank pytorch spacy transformers
Last synced: 21 Dec 2024
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/tristan-mcinnis/spacy-models-setup-and-testing
A Python utility for downloading, storing, and testing Spacy language models for English and Chinese NLP tasks.
chinese english nlp python simple-project spacy testing
Last synced: 10 Feb 2025
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024
https://github.com/thekartikeyamishra/documentsummarizer
The Document Summarizer is a Python-based application that extracts summaries from uploaded text and PDF documents using Natural Language Processing (NLP) techniques. This project includes a basic GUI to interact with the application, upload documents, and view the summarized content.
machine-learning nlp nlp-machine-learning pdfplumber python spacy tkinter tkinter-gui
Last synced: 02 Feb 2025
https://github.com/arkadiuszkaros/nlp-book-pos-extractor
This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.
extractor nlp part-of-speech-tagging python spacy
Last synced: 02 Feb 2025
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 06 Feb 2025
https://github.com/simeonhristov99/ati
Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).
authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer
Last synced: 13 Jan 2025
https://github.com/gopireddy99/named_entity_recognition
NLP Concept on Simple NER(Named Entity Recognition) using Spacy and pandas
Last synced: 01 Feb 2025
https://github.com/aubainmbk/analyse-des-avis-clients-amazon
Utiliser l’analyse des sentiments et le clustering sur des avis Amazon à des fins de marketing et de satisfaction des clients.
clustering marketing-analytics nlp-machine-learning nltk pca spacy vader-sentiment-analysis
Last synced: 05 Feb 2025
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 21 Dec 2024
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 02 Feb 2025
https://github.com/zofiaqlt/nlp_libraries_tweets_analysis
🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)
Last synced: 12 Jan 2025
https://github.com/jpedrou/spotify-nlp-analysis
Repository created with the aim of analyzing song lyrics with the help of Spotify API and Natural Language Processing algorithms.
genius-api matplotlib natural-language-processing nltk python3 spacy spotify-api
Last synced: 01 Feb 2025
https://github.com/benevanio/nasa-api-astro
Projeto utilizando a API da nasa.
apdo api api-client api-rest api-server astronomy css frond-end-development html5 javascipt javascipt-ai javascript nasa-api nasa-data react-router reactjs space spaceship spacy
Last synced: 28 Jan 2025
https://github.com/lucas54neves/dependency-parsing
Repository of the project for the Introduction to Natural Language Processing discipline of the Computer Science course at the University of Lavras, whose task objective is to explore the parsing of dependencies, using the SpaCy tool.
dependency-parsing nlp python spacy spacy-nlp
Last synced: 13 Jan 2025
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 02 Feb 2025
https://github.com/pavithra-hn/text-summarizer
The Text Summarizer is a web-based application that allows users to input a piece of text and receive a summarized version of that text. The summarization is performed using NLP techniques to extract key information and provide a concise summary.
flask html-css-javascript nlp-library nltk python spacy
Last synced: 11 Feb 2025
https://github.com/hackerajofficial/chatbot
ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django
chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp
Last synced: 10 Feb 2025
https://github.com/cllspy/nlp-playground
application to understand key concepts of nlp
Last synced: 07 Feb 2025
https://github.com/nxgeo/id-svo-extractor
id-svo-extractor: Extract SVO triples from Indonesian text.
artificial-intelligence indonesian-language indonesian-linguistics indonesian-nlp information-extraction knowledge-extraction knowledge-representation natural-language-processing nlp python rdf-triples spacy spacy-stanza stanza text-analysis triple-extraction
Last synced: 07 Dec 2024
https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports
This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change
beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping
Last synced: 12 Oct 2024
https://github.com/medspacy/nlp_postprocessor
A spaCy component for executing custom logic at the end of a pipeline.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 09 Jan 2025
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/surbhi242singh/text_summarizer
machine-learning nlp spacy tokenization
Last synced: 08 Feb 2025
https://github.com/ajaykumar095/natural_language_processing
Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.
ann nltk-python python rnn spacy tensorflow text-preprocessing textblob
Last synced: 22 Dec 2024
https://github.com/ayaz-amin/speechpos
A simple Python script that tags speech to parts-of-speech
deep-learning machine-learning python3 spacy
Last synced: 29 Jan 2025
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 28 Dec 2024
https://github.com/xwiz/spacy_symspell
Spacy symspell extension
spacy spelling-correction spelling-suggestions symspell
Last synced: 02 Feb 2025
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 02 Feb 2025
https://github.com/ianhaggerty/final-capstone
This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.
amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud
Last synced: 10 Feb 2025
https://github.com/satoru-shibata-jpn/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 02 Dec 2024
https://github.com/leosimoes/coursera-usp-pln-i
Atividades do curso "Processamento Neural de Linguagem Natural em Português I" oferecido pela USP através do Coursera.
Last synced: 30 Jan 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/touradbaba/nlp-notebooks
This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.
machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling
Last synced: 08 Feb 2025
https://github.com/thjbdvlt/solipcysme
spaCy pipeline for french focused on personal pronouns, fictions and first person point of view texts.
french french-nlp lemmatization morphological-analysis natural-language-processing nlp nlp-french normalization part-of-speech-tagging pos-tagging spacy spacy-extensions tokenization word-embeddings
Last synced: 17 Jan 2025
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/cmucheru/chatbot
A conversational chatbot for embedding in a site.
Last synced: 08 Feb 2025
https://github.com/hansalemaos/spacy2df
converts a spaCy object into a pandas DataFrame
Last synced: 10 Feb 2025
https://github.com/lexxai/goit_python_ds_hw_12
Модуль 12. Основи NLP.
nlp nlp-machine-learning nlp-spacy nltk nltk-tokenizer spacy spacy-nlp
Last synced: 10 Feb 2025
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 08 Feb 2025
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 10 Feb 2025
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 22 Dec 2024
https://github.com/nanditha-prabhu/qa-system-via-srl
Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques
Last synced: 10 Feb 2025