Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
![](https://explore-feed.github.com/topics/spacy/spacy.png)
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-02-09 00:28:04 UTC
- JSON Representation
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 02 Feb 2025
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 04 Feb 2025
https://github.com/aranzadata/moviereviewclassifier
Modelo de análisis de sentimientos basado en BERT para 45,000 reseñas de películas, logrando una puntuación F1 de 0.88 al aprovechar técnicas avanzadas de preprocesamiento de texto con NLTK y SpaCy
Last synced: 04 Feb 2025
https://github.com/emmy-bradfield/hilly_xmas
A simple ChatBot built using openAI's davinci 003 as a gift for a dear friend of ours
machine-learning natural-language-processing openai python spacy
Last synced: 21 Jan 2025
https://github.com/aydan-moon/news_headlines_ner
Named Entity Recognition (NER) model for analyzing entities in news headlines using spaCy and trained on the CoNLL-2003 dataset.
conll-2003 ner nlp python spacy
Last synced: 21 Jan 2025
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/praadnya/govt-circular-analysis
Uses OCR and NER techniques for parsing Goverment Circulars
annotations graphdb ner ocr spacy
Last synced: 07 Jan 2025
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 28 Dec 2024
https://github.com/xwiz/spacy_symspell
Spacy symspell extension
spacy spelling-correction spelling-suggestions symspell
Last synced: 02 Feb 2025
https://github.com/caterinatasinato/machine-learning-nlp-projects
Projects I worked on as Trainee in Data Analytics at ProfessionAI
gensim matplotlib nltk pandas sklearn spacy
Last synced: 11 Feb 2025
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 02 Feb 2025
https://github.com/michabirklbauer/hgb_dse_text_mining
Contents for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 09 Nov 2024
https://github.com/raul23/nlp
Performing various NLP tasks with different Python libraries
cld2 compact-language-detector langdetect langid language-classification nameparser natural-language-processing nlp nltk python spacy textcat
Last synced: 13 Jan 2025
https://github.com/stephenombuya/ai-powered-writing-assistant
An advanced writing assistant that helps users improve their writing through grammar checking, style analysis, and intelligent suggestions.
flask-application pytest python3 spacy sqlalchemy sqlite3 textblob-sentiment-analysis writing-assistant
Last synced: 09 Jan 2025
https://github.com/free-analytics/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 09 Jan 2025
https://github.com/philippeitis/nlp_specifier
Formal verification for natural language software documentation
natural-language-processing nlp spacy
Last synced: 21 Jan 2025
https://github.com/ashenoooone/semantic-book-analyzer
Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍
fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript
Last synced: 15 Jan 2025
https://github.com/mugambi645/spacy-text-classification
Text classification with spacy
Last synced: 11 Nov 2024
https://github.com/zackakil/nlp-using-word-vectors
Code resources for Central London Data Science Project Nights meetup on word vectors
machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors
Last synced: 13 Nov 2024
https://github.com/salma-4/nlp-task
Preprocessing using NLTK ,SPACY
nltk-library python spacy svm-model
Last synced: 22 Jan 2025
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 22 Dec 2024
https://github.com/iv4n-ga6l/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 25 Jan 2025
https://github.com/viniciusds2020/nlp_classificacao_texto_spacy
Projeto de Machine learning - Classificação de texto NLTK, SpaCy e Sklearn
logistic-regression machine-learning nlp nlp-machine-learning nltk-python pt-br random-forest-classifier spacy
Last synced: 05 Feb 2025
https://github.com/chewzzz1014/fyp-ner-archive
final-year-project flair fyp machine-learning ner nlp project spacy transformer
Last synced: 18 Jan 2025
https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect
Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%
named-entity-recognition ner python spacy spacy-models
Last synced: 10 Feb 2025
https://github.com/pythonicforge/e.c.h.o-mini
A miniature model of ECHO intended for my portfolio
ai express javascript nltk python spacy
Last synced: 22 Jan 2025
https://github.com/imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit
Last synced: 28 Jan 2025
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 20 Dec 2024
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 20 Dec 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 08 Feb 2025
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 10 Feb 2025
https://github.com/e3oroush/music_sorting
A simple project for categorizing your local musics. Find and delete the duplicate music files in your local machine
duplication-detection mediainfo music-duplication-detection music-information-retrieval python spacy
Last synced: 29 Jan 2025
https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool
An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.
flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling
Last synced: 10 Feb 2025
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 18 Jan 2025
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024
https://github.com/lilivalgo/nlp-for-ipcc-climate-reports
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping
Last synced: 17 Nov 2024
https://github.com/pabvald/bachelor-thesis
Bachelor's thesis overview
chatbots dialogflow fasttext glove nlp spacy university-of-valladolid user-evaluation virtual-assistants word-embeddings word2vec
Last synced: 29 Jan 2025
https://github.com/kivanc57/nlp_data_visualization
This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.
data-science matplotlib nlp parsing plotting python spacy visualization
Last synced: 08 Feb 2025
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 08 Feb 2025
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 02 Feb 2025
https://github.com/shwetajanwekar/capstone-project_1
Capstone project_1 include python code for best fit regression model, SQL feature store and tableau dashboard
accuracy create engine lasso-regression-model linear-regression mysql-database pandas ridge-regression seaborn skewness sklearn-library spacy sqlalchemy standard-deviation transformation variense xtrain ytrain
Last synced: 21 Dec 2024
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024
https://github.com/aubainmbk/analyse-des-avis-clients-amazon
Utiliser l’analyse des sentiments et le clustering sur des avis Amazon à des fins de marketing et de satisfaction des clients.
clustering marketing-analytics nlp-machine-learning nltk pca spacy vader-sentiment-analysis
Last synced: 05 Feb 2025
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 02 Feb 2025
https://github.com/nxgeo/id-svo-extractor
id-svo-extractor: Extract SVO triples from Indonesian text.
artificial-intelligence indonesian-language indonesian-linguistics indonesian-nlp information-extraction knowledge-extraction knowledge-representation natural-language-processing nlp python rdf-triples spacy spacy-stanza stanza text-analysis triple-extraction
Last synced: 07 Dec 2024
https://github.com/medspacy/nlp_postprocessor
A spaCy component for executing custom logic at the end of a pipeline.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 09 Jan 2025
https://github.com/ajaykumar095/natural_language_processing
Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.
ann nltk-python python rnn spacy tensorflow text-preprocessing textblob
Last synced: 22 Dec 2024
https://github.com/ayaz-amin/speechpos
A simple Python script that tags speech to parts-of-speech
deep-learning machine-learning python3 spacy
Last synced: 29 Jan 2025
https://github.com/satoru-shibata-jpn/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 02 Dec 2024
https://github.com/leosimoes/coursera-usp-pln-i
Atividades do curso "Processamento Neural de Linguagem Natural em Português I" oferecido pela USP através do Coursera.
Last synced: 30 Jan 2025
https://github.com/thjbdvlt/solipcysme
spaCy pipeline for french focused on personal pronouns, fictions and first person point of view texts.
french french-nlp lemmatization morphological-analysis natural-language-processing nlp nlp-french normalization part-of-speech-tagging pos-tagging spacy spacy-extensions tokenization word-embeddings
Last synced: 17 Jan 2025
https://github.com/cmucheru/chatbot
A conversational chatbot for embedding in a site.
Last synced: 08 Feb 2025
https://github.com/arnabd64/spacy-ner-hf-space
A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.
gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification
Last synced: 08 Feb 2025
https://github.com/richackashyap/using-bart-model-and-named-entity-recognition-to-summarize-text-and-create-a-mind-map-
Generation of mind maps based on any given paragraph
Last synced: 10 Feb 2025
https://github.com/fyt3rp4til/tfidf-emotiondetection
multinomial-naive-bayes n-grams random-forest spacy tfidf-vectorizer
Last synced: 08 Feb 2025
https://github.com/francislauriano/chatsoftex
Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.
cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow
Last synced: 03 Feb 2025
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 09 Dec 2024
https://github.com/rggh/api-4
Using FastAPI with spaCy to identify entities
Last synced: 02 Feb 2025
https://github.com/crodriguez1a/kaggle-la-jobs
Helping the City of Los Angeles to structure and analyze its job descriptions
kaggle linguistic-analysis ml nlu python spacy
Last synced: 09 Feb 2025
https://github.com/galal-pic/gd-project
annotations data fine-tuning ner nlp python spacy
Last synced: 23 Dec 2024
https://github.com/camara94/spacy
nlp nlp-machine-learning space-invaders spacy
Last synced: 23 Dec 2024
https://github.com/camara94/nlp-basique
Dans ce tutoriel, nous découvrir ensemble les bases de NLP en IA
gensim nlp nlp-keywords-extraction nlp-machine-learning pytorch sklearn spacy spacy-nlp tensorflow
Last synced: 23 Dec 2024
https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon
An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.
matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud
Last synced: 23 Jan 2025
https://github.com/rahul1582/named-entity-recognition
A keras implementation of Bidirectional-LSTM for Named Entity Recognition.
bidirectional-lstm keras named-entity-recognition spacy tensorflow
Last synced: 06 Feb 2025
https://github.com/brianj-4/ai-race-engineer
AI Race Engineer for the F1 Games
ai f1-22 intent-classification named-entity-recognition natural-language-processing nlp spacy
Last synced: 13 Dec 2024
https://github.com/wanjage/charles-burney-digital
Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat
geovisualisierung ner nlp nodegoat reisebericht spacy
Last synced: 10 Feb 2025
https://github.com/atharvapathak/customer_service_chatbot
Customer Service Chatbot Repository includes a range of features for building custom chatbots that can handle customer service queries and support requests. These features include NLP capabilities and pre-built dialog flows that can help chatbots understand and respond to customer.
chatbot database dialogflow nlp nltk reinforcement-learning restful-api spacy tensorflow
Last synced: 10 Feb 2025
https://github.com/anquetos/nasa-apod-database
etl-pipeline galaxy image json nasa-apod object-oriented-programming pandas pillow space spacy
Last synced: 10 Feb 2025
https://github.com/ahmedkhaled404/ner-with-spacy
Named entity recognition using traditional NLP methods
machine-learning matplotlib ner nlp nlp-machine-learning python spacy
Last synced: 10 Feb 2025
https://github.com/tanyakuznetsova/amazon-handmade-reviews-23-sentiment-and-ner
Comparison of AWS Comprehend and SpaCy on a subset of the Amazon Handmade reviews for sentiment analysis and NER
amazon-api amazon-reviews amazon-reviews-sentiment-analysis aws-boto3 aws-comprehend aws-comprehend-nlp named-entity-recognition natural-language-processing ner sentiment-analysis spacy spacy-nlp spacy-nlp-ner
Last synced: 18 Dec 2024
https://github.com/sudeatesoglu/nlp-document-processor
An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.
Last synced: 10 Feb 2025
https://github.com/muhammadshavaiz/ai_learning
Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.
deep-learning nlp python pytorch spacy
Last synced: 10 Feb 2025
https://github.com/michaelkinfu/hknews-headline-analysis
The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.
beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5
Last synced: 10 Feb 2025
https://github.com/foxbenjaminfox/simil
CLI for semantic string similarity
glove machine-learning python spacy string-similarity
Last synced: 10 Feb 2025
https://github.com/karimosman89/resume-screening
Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.
machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing
Last synced: 25 Dec 2024
https://github.com/etienne-bobo/information-retreival_project
In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.
information-retrieval nlp prodigy spacy
Last synced: 10 Jan 2025
https://github.com/sadegh15khedry/comments-sentiment-analysis
text classification on comments using an ANN model.
collections deep-learning keras nlp numpy pandas python sentiment-analysis sklearn spacy unicodedata
Last synced: 10 Jan 2025
https://github.com/malcolmgreaves/py_ml_img
A Python 3 image for NLP & ML. Includes spaCy & NLTK model data.
docker-image machine-learning nlp nltk python3 spacy
Last synced: 07 Feb 2025
https://github.com/sukanyadutta52/topic_modeling
What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?
climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy
Last synced: 26 Dec 2024
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 26 Dec 2024
https://github.com/tony-stone-code/codealpha_simple_chatbot
This is a simple chatbot, built with python.
ai bot-development chatbot css flask flask-application flask-web htlm5 javascript python python3 spacy spacy-nlp web-development
Last synced: 23 Jan 2025
https://github.com/michabirklbauer/hgb_dse_text_mining_solutions
Solutions for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 04 Jan 2025
https://github.com/arya-io/ner-entitylinker
A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.
ai disambiguation entityextraction entitylinking machinelearning namedentityrecognition naturallanguageprocessing nlp python spacy streamlit textprocessing wikipediaapi
Last synced: 11 Jan 2025
https://github.com/maxzirps/lyrics-sentiment-analysis
Analyse lyrics for their sentiment score
nlp pandas sentiment-analysis spacy spacy-nlp
Last synced: 12 Jan 2025
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 09 Feb 2025
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 12 Jan 2025
https://github.com/ledsouza/nlp-article-classification
This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.
gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp
Last synced: 30 Jan 2025
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 09 Feb 2025
https://github.com/christram/tln-miage-ia2-2
NLP Project #2
deep-learning nlp python spacy
Last synced: 19 Jan 2025