Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
![](https://explore-feed.github.com/topics/spacy/spacy.png)
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-02-12 00:28:10 UTC
- JSON Representation
https://github.com/ajla-brdarevic/pdf_question_generator
Project - Artificial intelligence
ai flask machine-learning mt5 pypdf2 python spacy transformers
Last synced: 03 Feb 2025
https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system
In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.
corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis
Last synced: 05 Jan 2025
https://github.com/etdds/redditquotebot
A Reddit comment bot for detecting and replying to famous quotes.
bot chatbot natural-language-processing nlp praw python reddit spacy
Last synced: 23 Jan 2025
https://github.com/gugarosa/brainy
🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.
api machine-learning nlp python spacy
Last synced: 02 Feb 2025
https://github.com/keshabkjha/climasense
ClimaSense is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot , built using Python (Streamlit & SpaCy), that responds to weather-related queries.
html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app
Last synced: 06 Feb 2025
https://github.com/florensadimer/nlp_ner_soccer_pt-br
Anotação Manual e Comparação com Modelos Treinados
annotation llm machine-learning ner nlp spacy
Last synced: 09 Dec 2024
https://github.com/aitechhero/nonullsense-nlp
Natural Language Processing (NLP) with libraries like spaCy, Transformers, and NLTK.
ai artificial-intelligence huggingface natural-language-processing nlp nltk python spacy text-analysis transformers
Last synced: 09 Jan 2025
https://github.com/kr1shnasomani/webscrub
Python code which extracts the html content, converts it to clean text and pre-processes the text
beautifulsoup html2text natural-language-processing pypi scikit-learn selenium spacy
Last synced: 13 Feb 2025
https://github.com/devbm7/qgen
Question Generator System
json ml nlp nltk pandas pymupdf-fitz python3 pytorch regular-expressions smtp spacy streamlit t5-large transformers wikipedia-api
Last synced: 10 Dec 2024
https://github.com/datarohit/nlp-course-files
The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.
nlp nlp-machine-learning nltk numpy panda python sklearn spacy
Last synced: 23 Dec 2024
https://github.com/bjam24/agh-natural-language-processing
This respository contains projects made for the NLP course at the AGH UST in 2024.
agh agh-wi elasticsearch language-modeling language-modelling levenshtein llm ner nlp prompt-engineering regex spacy text-classificaiton text-classification
Last synced: 23 Jan 2025
https://github.com/lykmapipo/us-inaugural-addresses
Python scripts to download, process, and analyze US Inaugural Addresses
beautifulsoup4 gensim joblib lykmapipo natural-language-processing nlp nltk python python-scripts requests spacy text-analysis text-analytics text-extraction text-processing web-scraping
Last synced: 14 Feb 2025
https://github.com/pyladiesams/nlp-projects-with-spacy-may2024
NLP projects with spaCy
Last synced: 10 Feb 2025
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 09 Feb 2025
https://github.com/sukanyadutta52/sentiment-analysis
An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis
flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard
Last synced: 02 Feb 2025
https://github.com/ggordonhall/measurement_tagger
Spacy Measurement Tagger
dependency-parser measurement-tagger nlp python spacy tagger
Last synced: 26 Dec 2024
https://github.com/jblake1965/elucidoc
Screens legal text and extracts sentences containing user input party name-predicate phrases
excel law legal-documents legal-text-analytics natural-language-processing python-script python3 spacy textacy word
Last synced: 23 Jan 2025
https://github.com/moindalvs/text_mining_nlp
Natural Language Processing
bag-of-words classifier data-science fake-news lemmatization nlp pipeline sentiment-analysis sentiment-classification spacy spacy-pipeline stemming text-classification text-mining tfidf tokenization vectorizer
Last synced: 18 Jan 2025
https://github.com/moindalvs/sentiment_analysis_on_-elon_musk_tweets
Perform sentimental analysis on the Elon-musk tweets (Elon-musk.csv)
bag-of-words cleaning-data elon-musk feature-engineering nlp nltk polarity sentiment-analysis sentiment-intensity sentiment-polarity spacy subjectivity text-mining text-processing textblob-sentiment-analysis tfidf tfidf-vectorizer tokenizer tweet-analysis twitter-sentiment-analysis
Last synced: 18 Jan 2025
https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy
I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.
machine-learning named-entity-recognition ner spacy spacy-nlp
Last synced: 11 Jan 2025
https://github.com/csfelix/nlp-0-spacy-course
💬 Advanced NLP with Spacy Course
natural-language-processing nlp python spacy
Last synced: 30 Jan 2025
https://github.com/thyripian/core
This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.
api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite
Last synced: 07 Feb 2025
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 24 Jan 2025
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 16 Nov 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 16 Nov 2024
https://github.com/tbarlow12/wiki-answer
I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions
nlp python question-answering spacy wikipedia
Last synced: 02 Feb 2025
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/chewzzz1014/fyp
backend fastapi final-year-project flair fyp machine-learning ner project spacy transformer
Last synced: 26 Jan 2025
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 09 Feb 2025
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 08 Feb 2025
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/aydan-moon/news_headlines_ner
Named Entity Recognition (NER) model for analyzing entities in news headlines using spaCy and trained on the CoNLL-2003 dataset.
conll-2003 ner nlp python spacy
Last synced: 21 Jan 2025
https://github.com/emmy-bradfield/hilly_xmas
A simple ChatBot built using openAI's davinci 003 as a gift for a dear friend of ours
machine-learning natural-language-processing openai python spacy
Last synced: 21 Jan 2025
https://github.com/aranzadata/moviereviewclassifier
Modelo de análisis de sentimientos basado en BERT para 45,000 reseñas de películas, logrando una puntuación F1 de 0.88 al aprovechar técnicas avanzadas de preprocesamiento de texto con NLTK y SpaCy
Last synced: 04 Feb 2025
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024
https://github.com/samarthhchinivar/nlp-codebasics-playlist
This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy
nlp-machine-learning nltk python3 spacy
Last synced: 06 Jan 2025
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 12 Feb 2025
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 12 Feb 2025
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 04 Feb 2025
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 10 Feb 2025
https://github.com/d5555/textcat_dataset_imdb
Movie Review Dataset for binary sentiment classification
categories dataset spacy textcat textcategorizer
Last synced: 02 Jan 2025
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 16 Jan 2025
https://github.com/aubainmbk/analyse-des-avis-clients-amazon
Utiliser l’analyse des sentiments et le clustering sur des avis Amazon à des fins de marketing et de satisfaction des clients.
clustering marketing-analytics nlp-machine-learning nltk pca spacy vader-sentiment-analysis
Last synced: 05 Feb 2025
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 02 Feb 2025
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 30 Dec 2024
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 09 Feb 2025
https://github.com/itsdaiton/named-entity-visualizer
NEV short for Named Entity Visualizer is a tool to visualize entities found in unstructured text built in Python.
named-entity-linking named-entity-recognition natural-language-processing nlp-pipeline python spacy wikidata
Last synced: 11 Feb 2025
https://github.com/tristan-mcinnis/spacy-models-setup-and-testing
A Python utility for downloading, storing, and testing Spacy language models for English and Chinese NLP tasks.
chinese english nlp python simple-project spacy testing
Last synced: 10 Feb 2025
https://github.com/thekartikeyamishra/documentsummarizer
The Document Summarizer is a Python-based application that extracts summaries from uploaded text and PDF documents using Natural Language Processing (NLP) techniques. This project includes a basic GUI to interact with the application, upload documents, and view the summarized content.
machine-learning nlp nlp-machine-learning pdfplumber python spacy tkinter tkinter-gui
Last synced: 02 Feb 2025
https://github.com/wesslen/spacy-ecfr-ner
spaCy-Prodigy workflow for NER Citation model on eCFR Banking Regulation
Last synced: 13 Feb 2025
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 18 Jan 2025
https://github.com/gopireddy99/named_entity_recognition
NLP Concept on Simple NER(Named Entity Recognition) using Spacy and pandas
Last synced: 01 Feb 2025
https://github.com/nxgeo/id-svo-extractor
id-svo-extractor: Extract SVO triples from Indonesian text.
artificial-intelligence indonesian-language indonesian-linguistics indonesian-nlp information-extraction knowledge-extraction knowledge-representation natural-language-processing nlp python rdf-triples spacy spacy-stanza stanza text-analysis triple-extraction
Last synced: 07 Dec 2024
https://github.com/turbolent/spacy-http
spaCy as a HTTP service
api named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy
Last synced: 27 Jan 2025
https://github.com/medspacy/nlp_postprocessor
A spaCy component for executing custom logic at the end of a pipeline.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 09 Jan 2025
https://github.com/arkadiuszkaros/nlp-book-pos-extractor
This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.
extractor nlp part-of-speech-tagging python spacy
Last synced: 02 Feb 2025
https://github.com/jpedrou/spotify-nlp-analysis
Repository created with the aim of analyzing song lyrics with the help of Spotify API and Natural Language Processing algorithms.
genius-api matplotlib natural-language-processing nltk python3 spacy spotify-api
Last synced: 01 Feb 2025
https://github.com/imvladikon/quora-question-pair
duplicates detection experiments on Quora Question Pairs (QQP)
Last synced: 02 Jan 2025
https://github.com/ajaykumar095/natural_language_processing
Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.
ann nltk-python python rnn spacy tensorflow text-preprocessing textblob
Last synced: 22 Dec 2024
https://github.com/ayaz-amin/speechpos
A simple Python script that tags speech to parts-of-speech
deep-learning machine-learning python3 spacy
Last synced: 29 Jan 2025
https://github.com/cllspy/nlp-playground
application to understand key concepts of nlp
Last synced: 07 Feb 2025
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 02 Feb 2025
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 28 Dec 2024
https://github.com/satoru-shibata-jpn/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 02 Dec 2024
https://github.com/leosimoes/coursera-usp-pln-i
Atividades do curso "Processamento Neural de Linguagem Natural em Português I" oferecido pela USP através do Coursera.
Last synced: 30 Jan 2025
https://github.com/saifinohwal/sentiment-analysis
Sentiment analysis of Steve Jobs speech
lemmetization nlp spacy summarization tokenization wordcloud-visualization
Last synced: 10 Feb 2025
https://github.com/lucas54neves/dependency-parsing
Repository of the project for the Introduction to Natural Language Processing discipline of the Computer Science course at the University of Lavras, whose task objective is to explore the parsing of dependencies, using the SpaCy tool.
dependency-parsing nlp python spacy spacy-nlp
Last synced: 13 Jan 2025
https://github.com/surbhi242singh/text_summarizer
machine-learning nlp spacy tokenization
Last synced: 08 Feb 2025
https://github.com/xwiz/spacy_symspell
Spacy symspell extension
spacy spelling-correction spelling-suggestions symspell
Last synced: 02 Feb 2025
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 02 Feb 2025
https://github.com/thjbdvlt/solipcysme
spaCy pipeline for french focused on personal pronouns, fictions and first person point of view texts.
french french-nlp lemmatization morphological-analysis natural-language-processing nlp nlp-french normalization part-of-speech-tagging pos-tagging spacy spacy-extensions tokenization word-embeddings
Last synced: 17 Jan 2025
https://github.com/cmucheru/chatbot
A conversational chatbot for embedding in a site.
Last synced: 08 Feb 2025
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 06 Feb 2025
https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports
This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change
beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping
Last synced: 12 Oct 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 13 Feb 2025
https://github.com/simeonhristov99/ati
Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).
authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer
Last synced: 13 Jan 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/zofiaqlt/nlp_libraries_tweets_analysis
🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)
Last synced: 12 Jan 2025
https://github.com/benevanio/nasa-api-astro
Projeto utilizando a API da nasa.
apdo api api-client api-rest api-server astronomy css frond-end-development html5 javascipt javascipt-ai javascript nasa-api nasa-data react-router reactjs space spaceship spacy
Last synced: 28 Jan 2025
https://github.com/francislauriano/chatsoftex
Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.
cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow
Last synced: 03 Feb 2025
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 08 Feb 2025
https://github.com/pavithra-hn/text-summarizer
The Text Summarizer is a web-based application that allows users to input a piece of text and receive a summarized version of that text. The summarization is performed using NLP techniques to extract key information and provide a concise summary.
flask html-css-javascript nlp-library nltk python spacy
Last synced: 11 Feb 2025
https://github.com/hackerajofficial/chatbot
ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django
chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp
Last synced: 10 Feb 2025
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 09 Dec 2024