Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-29 00:22:24 UTC
- JSON Representation
https://github.com/debugger404/multilanguage-pos
Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.
multilanguage named-entity-recognition ner python3 spacy
Last synced: 22 Dec 2024
https://github.com/miteshgupta07/ats-scoring-system
An ATS (Applicant Tracking System) scoring system that evaluates and ranks resumes based on keyword matching and relevance.
ats ats-system nlp python resume-parser spacy
Last synced: 18 Dec 2024
https://github.com/turbolent/spacy-thrift-docker
spacy-thrift as a Docker container
docker named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 08 Dec 2024
https://github.com/kailejie/ner
This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.
Last synced: 18 Dec 2024
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 24 Nov 2024
https://github.com/turbolent/telescope
Go explore
compiler nlp parser question-answering scala spacy sparql
Last synced: 08 Dec 2024
https://github.com/ajla-brdarevic/pdf_question_generator
Project - Artificial intelligence
ai flask machine-learning mt5 pypdf2 python spacy transformers
Last synced: 08 Dec 2024
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 14 Oct 2024
https://github.com/aranzadata/moviereviewclassifier
Modelo de análisis de sentimientos basado en BERT para 45,000 reseñas de películas, logrando una puntuación F1 de 0.88 al aprovechar técnicas avanzadas de preprocesamiento de texto con NLTK y SpaCy
Last synced: 09 Dec 2024
https://github.com/kavyachouhan/manasvi
An AI-powered chatbot built with Django and spaCy that provides real-time emotional support. Manasvi uses natural language processing (NLP) and sentiment analysis to engage users in meaningful conversations about mental health, offering personalized responses based on emotional tone.
chatbot django machine-learning mental-health mental-health-chatbot nlp python sentiment-analysis spacy text-processing web-app
Last synced: 09 Dec 2024
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 09 Dec 2024
https://github.com/galal-pic/gd-project
annotations data fine-tuning ner nlp python spacy
Last synced: 23 Dec 2024
https://github.com/crodriguez1a/kaggle-la-jobs
Helping the City of Los Angeles to structure and analyze its job descriptions
kaggle linguistic-analysis ml nlu python spacy
Last synced: 16 Dec 2024
https://github.com/viniciusds2020/nlp_classificacao_texto_spacy
Projeto de Machine learning - Classificação de texto NLTK, SpaCy e Sklearn
logistic-regression machine-learning nlp nlp-machine-learning nltk-python pt-br random-forest-classifier spacy
Last synced: 10 Dec 2024
https://github.com/camara94/spacy
nlp nlp-machine-learning space-invaders spacy
Last synced: 23 Dec 2024
https://github.com/camara94/nlp-basique
Dans ce tutoriel, nous découvrir ensemble les bases de NLP en IA
gensim nlp nlp-keywords-extraction nlp-machine-learning pytorch sklearn spacy spacy-nlp tensorflow
Last synced: 23 Dec 2024
https://github.com/giuliosmall/twitter-trending-topics-pipeline
This project demonstrates trending topic detection using Apache Spark and MinIO. It processes Twitter JSON data with PySpark, leveraging distributed data processing and cloud storage. The entire project is containerized with Docker for easy deployment across architectures.
docker minio nlp pyspark pytest spacy spark streamlit
Last synced: 11 Dec 2024
https://github.com/lexxai/goit_python_ds_hw_12
Модуль 12. Основи NLP.
nlp nlp-machine-learning nlp-spacy nltk nltk-tokenizer spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 17 Dec 2024
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 17 Dec 2024
https://github.com/rahul1582/named-entity-recognition
A keras implementation of Bidirectional-LSTM for Named Entity Recognition.
bidirectional-lstm keras named-entity-recognition spacy tensorflow
Last synced: 13 Dec 2024
https://github.com/brianj-4/ai-race-engineer
AI Race Engineer for the F1 Games
ai f1-22 intent-classification named-entity-recognition natural-language-processing nlp spacy
Last synced: 13 Dec 2024
https://github.com/saifinohwal/sentiment-analysis
Sentiment analysis of Steve Jobs speech
lemmetization nlp spacy summarization tokenization wordcloud-visualization
Last synced: 18 Dec 2024
https://github.com/wanjage/charles-burney-digital
Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat
geovisualisierung ner nlp nodegoat reisebericht spacy
Last synced: 18 Dec 2024
https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques
using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text
healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/atharvapathak/customer_service_chatbot
Customer Service Chatbot Repository includes a range of features for building custom chatbots that can handle customer service queries and support requests. These features include NLP capabilities and pre-built dialog flows that can help chatbots understand and respond to customer.
chatbot database dialogflow nlp nltk reinforcement-learning restful-api spacy tensorflow
Last synced: 18 Dec 2024
https://github.com/anquetos/nasa-apod-database
etl-pipeline galaxy image json nasa-apod object-oriented-programming pandas pillow space spacy
Last synced: 18 Dec 2024
https://github.com/ahmedkhaled404/ner-with-spacy
Named entity recognition using traditional NLP methods
machine-learning matplotlib ner nlp nlp-machine-learning python spacy
Last synced: 18 Dec 2024
https://github.com/tanyakuznetsova/amazon-handmade-reviews-23-sentiment-and-ner
Comparison of AWS Comprehend and SpaCy on a subset of the Amazon Handmade reviews for sentiment analysis and NER
amazon-api amazon-reviews amazon-reviews-sentiment-analysis aws-boto3 aws-comprehend aws-comprehend-nlp named-entity-recognition natural-language-processing ner sentiment-analysis spacy spacy-nlp spacy-nlp-ner
Last synced: 18 Dec 2024
https://github.com/meefs/entseeker
entseeker is a command-line tool for Named Entity Recognition (NER) and web entity searches in text files. It uses spaCy's NLP capabilities for standard named entities and custom rules for web-related entities.
ai named-entity-recognition spacy spacy-nlp text-classification text-processing
Last synced: 18 Dec 2024
https://github.com/cmilamaya/flight-dashboard-app
This project is an application that processes attached PDF documents containing flight information and extracts relevant data. The data is stored in a PostgreSQL database and visualized on a dynamic dashboard using Streamlit.
pandas pdfplumber python spacy
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-presque
normalisation de mots (français) pour spacy
french nlp normalization spacy spacy-extensions
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-french-parser
syntactic dependency parser for french using spacy
french nlp nlp-french spacy spacy-parser syntactic-dependency-parsing universal-dependencies
Last synced: 18 Dec 2024
https://github.com/sudeatesoglu/nlp-document-processor
An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.
Last synced: 18 Dec 2024
https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis
MLOps project Training and Deployment of Spacy model for Sentiment analysis
amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification
Last synced: 18 Dec 2024
https://github.com/coueghlani/nlp
Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos
mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-viceverser
lemmatisation du français avec hunspell et spacy
french hunspell lemmatization nlp nlp-french spacy
Last synced: 18 Dec 2024
https://github.com/trikztr/gptscrape
GPTScrape: A tool for web scraping that uses spaCy for NLP and GPT4All for converting scraped text into structured JSON.
ai data-extraction data-scraping gpt gpt4all llm npl python scraping spacy spacy-nlp web-scraping
Last synced: 18 Dec 2024
https://github.com/cano1998/sentiment-analysis-report-for-amazon-product-reviews
Sentiment analysis of Amazon product reviews. The analysis provides insights into customer sentiment and opinions regarding specific products sold on Amazon.
pdf pdf-generation sentiment-analysis spacy text-blob
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/litteralement
schéma de base de données postgresql EAV hybride pour l'analyse de textes en français
eav french nlp nlp-french postgresql spacy sql
Last synced: 18 Dec 2024
https://github.com/atharvapathak/customer_sentiment_analysis
Customer sentiment analysis is the process of using natural language processing (NLP) and machine learning techniques to analyze and understand the feelings, opinions, and attitudes expressed by customers in textual data, such as reviews, feedback, and social media posts.
cnn naive-bayes nlp nltk spacy stemming text-mining tokenization
Last synced: 18 Dec 2024
https://github.com/i-am-jiwoo-seo/vocabhub
Python Flask based web application
bootstrap flask googletrans gtts pandas python spacy website webview-app
Last synced: 18 Dec 2024
https://github.com/presizhai/rmp-ai-assistant
This project implements a RAG system for a Rate My Professor service, leveraging Pinecone for vector storage and OpenAI for text embeddings. It preprocesses professor reviews using SpaCy for cleaning and sentiment analysis, enabling the AI assistant to provide more nuanced recommendations and insights based on student queries.
generative-ai large-language-model natural-language-processing openai software-development software-engineering spacy
Last synced: 18 Dec 2024
https://github.com/nanditha-prabhu/qa-system-via-srl
Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques
Last synced: 18 Dec 2024
https://github.com/muhammadshavaiz/ai_learning
Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.
deep-learning nlp python pytorch spacy
Last synced: 18 Dec 2024
https://github.com/michaelkinfu/hknews-headline-analysis
The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.
beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5
Last synced: 18 Dec 2024
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 18 Dec 2024
https://github.com/foxbenjaminfox/simil
CLI for semantic string similarity
glove machine-learning python spacy string-similarity
Last synced: 18 Dec 2024
https://github.com/hansalemaos/spacy2df
converts a spaCy object into a pandas DataFrame
Last synced: 18 Dec 2024
https://github.com/ianhaggerty/final-capstone
This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.
amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud
Last synced: 18 Dec 2024
https://github.com/hackerajofficial/chatbot
ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django
chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 20 Dec 2024
https://github.com/direct-phonology/phony
phonology in spaCy!
linguistics nlp phonology python spacy
Last synced: 19 Nov 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 17 Nov 2024
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 17 Nov 2024
https://github.com/karimosman89/resume-screening
Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.
machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing
Last synced: 25 Dec 2024
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 12 Oct 2024
https://github.com/etienne-bobo/information-retreival_project
In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.
information-retrieval nlp prodigy spacy
Last synced: 10 Jan 2025
https://github.com/sadegh15khedry/comments-sentiment-analysis
text classification on comments using an ANN model.
collections deep-learning keras nlp numpy pandas python sentiment-analysis sklearn spacy unicodedata
Last synced: 10 Jan 2025
https://github.com/samarthhchinivar/nlp-codebasics-playlist
This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy
nlp-machine-learning nltk python3 spacy
Last synced: 06 Jan 2025
https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon
An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.
matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud
Last synced: 23 Nov 2024
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 26 Dec 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 22 Dec 2024
https://github.com/maxzirps/lyrics-sentiment-analysis
Analyse lyrics for their sentiment score
nlp pandas sentiment-analysis spacy spacy-nlp
Last synced: 12 Jan 2025
https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports
This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change
beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping
Last synced: 12 Oct 2024
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 06 Jan 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/michabirklbauer/hgb_dse_text_mining_solutions
Solutions for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 04 Jan 2025
https://github.com/philippeitis/nlp_specifier
Formal verification for natural language software documentation
natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool
An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.
flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling
Last synced: 18 Dec 2024
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 30 Dec 2024
https://github.com/kivanc57/nlp_data_visualization
This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.
data-science matplotlib nlp parsing plotting python spacy visualization
Last synced: 09 Oct 2024
https://github.com/thjbdvlt/spacy-turlututu
french morphological analysis model for spacy
french french-nlp morphological-analysis nlp part-of-speech-tagging pos-tagging spacy
Last synced: 18 Dec 2024
https://github.com/d5555/textcat_dataset_imdb
Movie Review Dataset for binary sentiment classification
categories dataset spacy textcat textcategorizer
Last synced: 02 Jan 2025
https://github.com/touradbaba/nlp-notebooks
This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.
machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling
Last synced: 09 Oct 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/tony-stone-code/codealpha_simple_chatbot
This is a simple chatbot, built with python.
ai bot-development chatbot css flask flask-application flask-web htlm5 javascript python python3 spacy spacy-nlp web-development
Last synced: 23 Nov 2024
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 18 Dec 2024
https://github.com/kr1shnasomani/summarai
Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)
natural-language-processing pytextrank pytorch spacy transformers
Last synced: 21 Dec 2024
https://github.com/caterinatasinato/machine-learning-nlp-projects
Projects I worked on as Trainee in Data Analytics at ProfessionAI
gensim matplotlib nltk pandas sklearn spacy
Last synced: 19 Dec 2024
https://github.com/sukanyadutta52/topic_modeling
What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?
climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy
Last synced: 26 Dec 2024
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 18 Dec 2024
https://github.com/praadnya/govt-circular-analysis
Uses OCR and NER techniques for parsing Goverment Circulars
annotations graphdb ner ocr spacy
Last synced: 07 Jan 2025
https://github.com/malcolmgreaves/py_ml_img
A Python 3 image for NLP & ML. Includes spaCy & NLTK model data.
docker-image machine-learning nlp nltk python3 spacy
Last synced: 13 Dec 2024
https://github.com/ivangael/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 31 Oct 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 09 Oct 2024
https://github.com/benevanio/nasa-api-astro
Projeto utilizando a API da nasa.
apdo api api-client api-rest api-server astronomy css frond-end-development html5 javascipt javascipt-ai javascript nasa-api nasa-data react-router reactjs space spaceship spacy
Last synced: 29 Nov 2024
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 20 Dec 2024
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 20 Dec 2024