Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
![](https://explore-feed.github.com/topics/spacy/spacy.png)
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-02-15 00:25:36 UTC
- JSON Representation
https://github.com/e3oroush/music_sorting
A simple project for categorizing your local musics. Find and delete the duplicate music files in your local machine
duplication-detection mediainfo music-duplication-detection music-information-retrieval python spacy
Last synced: 29 Jan 2025
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 10 Feb 2025
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 18 Jan 2025
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024
https://github.com/lilivalgo/nlp-for-ipcc-climate-reports
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping
Last synced: 17 Nov 2024
https://github.com/rrayhka/indonesian-ner-spacy
Fine-tuning SpaCy for Indonesian Named Entity Recognition (NER) with custom dataset.
indonesian named-entity-recognition ner nlp spacy
Last synced: 08 Feb 2025
https://github.com/pabvald/bachelor-thesis
Bachelor's thesis overview
chatbots dialogflow fasttext glove nlp spacy university-of-valladolid user-evaluation virtual-assistants word-embeddings word2vec
Last synced: 29 Jan 2025
https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool
An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.
flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling
Last synced: 10 Feb 2025
https://github.com/kivanc57/nlp_data_visualization
This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.
data-science matplotlib nlp parsing plotting python spacy visualization
Last synced: 08 Feb 2025
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/thekartikeyamishra/resumeevaluatorapp
The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.
flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob
Last synced: 03 Feb 2025
https://github.com/gopireddy99/named_entity_recognition
NLP Concept on Simple NER(Named Entity Recognition) using Spacy and pandas
Last synced: 01 Feb 2025
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 08 Feb 2025
https://github.com/ntinouldinho/machine-learning-classification-and-speech-generation
Explored Greek Parliament Proceedings and tried to classify each speech to a corresponding parliamentary political party.
artificial-intelligence classification-machine-learning machine-learning neural-networks pandas python sklearn spacy
Last synced: 03 Feb 2025
https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis
MLOps project Training and Deployment of Spacy model for Sentiment analysis
amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification
Last synced: 10 Feb 2025
https://github.com/atharvapathak/customer_sentiment_analysis
Customer sentiment analysis is the process of using natural language processing (NLP) and machine learning techniques to analyze and understand the feelings, opinions, and attitudes expressed by customers in textual data, such as reviews, feedback, and social media posts.
cnn naive-bayes nlp nltk spacy stemming text-mining tokenization
Last synced: 10 Feb 2025
https://github.com/simeonhristov99/ati
Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).
authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer
Last synced: 13 Jan 2025
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 06 Feb 2025
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 12 Feb 2025
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 12 Feb 2025
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques
using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text
healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp
Last synced: 10 Feb 2025
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/nanditha-prabhu/qa-system-via-srl
Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques
Last synced: 10 Feb 2025
https://github.com/aubainmbk/analyse-des-avis-clients-amazon
Utiliser l’analyse des sentiments et le clustering sur des avis Amazon à des fins de marketing et de satisfaction des clients.
clustering marketing-analytics nlp-machine-learning nltk pca spacy vader-sentiment-analysis
Last synced: 05 Feb 2025