Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-11-05 00:28:59 UTC
- JSON Representation
https://github.com/srstevenson/keyword-extractor
Extract keywords from plain text documents
Last synced: 04 Aug 2024
https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem
The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.
chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp
Last synced: 31 Oct 2024
https://github.com/izuna385/pubtator-multiprocess-parser
Specifically for Entity Linking. Quick demo with MedMentions and NCBI datasets is also included.
allennlp bioinformatics entity-disambiguation entity-linking natural-language-processing pubtator spacy
Last synced: 18 Oct 2024
https://github.com/5hraddha/sentiment-analysis
An innovative system for filtering and categorizing movie reviews
countvectorizer dummyclassifier lgbmclassifier logisticregression matplotlib minmaxscaler nltk nltk-stopwords nltk-tokenizer numpy pandas seaborn spacy tfidfvectorizer torch tqdm transformers
Last synced: 31 Oct 2024
https://github.com/izuna385/arxiv-checker
Single Page Application and its deployment for GCE.
docker docker-compose fastapi nginx react react-bootstrap spacy tdd
Last synced: 18 Oct 2024
https://github.com/keshabkjha/weatherapp
WeatherApp is a web application that provides real-time weather information based on the user's location or any searched city. It features automatic location detection, manual search, and a chatbot called Weatha, built using Python (Streamlit & SpaCy), that responds to weather-related queries.
html-css-javascript niet-codetantra niet-training python python3 spacy spacy-nlp streamlit weather-api weather-app
Last synced: 25 Oct 2024
https://github.com/oroszgy/spacy-tokenizer-benchmark
Quick and dirty scripts to measure the performance of spaCy
benchmark natural-language-processing nlp python spacy tokenizer
Last synced: 19 Oct 2024
https://github.com/florensadimer/nlp_ner_soccer_pt-br
Anotação Manual e Comparação com Modelos Treinados
annotation llm machine-learning ner nlp spacy
Last synced: 21 Oct 2024
https://github.com/thyripian/core
This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.
api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite
Last synced: 26 Oct 2024
https://github.com/kailejie/ner
This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.
Last synced: 31 Oct 2024
https://github.com/miteshgupta07/ats-scoring-system
An ATS (Applicant Tracking System) scoring system that evaluates and ranks resumes based on keyword matching and relevance.
ats ats-system nlp python resume-parser spacy
Last synced: 31 Oct 2024
https://github.com/lykmapipo/us-inaugural-addresses
Python scripts to download, process, and analyze US Inaugural Addresses
beautifulsoup4 gensim joblib lykmapipo natural-language-processing nlp nltk python python-scripts requests spacy text-analysis text-analytics text-extraction text-processing web-scraping
Last synced: 04 Nov 2024
https://github.com/pyladiesams/nlp-projects-with-spacy-may2024
NLP projects with spaCy
Last synced: 31 Oct 2024
https://github.com/samestrin/llm-services-api
A FastAPI-powered REST API offering a comprehensive suite of natural language processing services using machine learning models with PyTorch and Transformers, packaged in a Docker container to run efficiently.
api docker fastapi hugging-face hugging-face-transformers huggingface-transformers keybert llm openai-compatible-api python python3 pytorch rest rest-api spacy torch transformers uvicorn
Last synced: 31 Oct 2024
https://github.com/charlesyuan02/named_entity_recognition
Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.
conll-2003 named-entity-recognition ner nlp spacy transformer
Last synced: 31 Oct 2024
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 19 Oct 2024
https://github.com/thjbdvlt/quelquhui
tokenizer for french
french french-nlp nlp spacy tokenizer-nlp
Last synced: 09 Oct 2024
https://github.com/muhammadshavaiz/ai_learning
Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.
deep-learning nlp python pytorch spacy
Last synced: 31 Oct 2024
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 09 Oct 2024
https://github.com/2pa4ul2/mcq-quiz-maker-nlp
Quizzable a quiz generator for short reviews with Spacy and NLTK
flask nlp nltk python question-generation quizapp spacy
Last synced: 09 Oct 2024
https://github.com/rrayhka/indonesian-ner-spacy
Fine-tuning SpaCy for Indonesian Named Entity Recognition (NER) with custom dataset.
indonesian named-entity-recognition ner nlp spacy
Last synced: 09 Oct 2024
https://github.com/arnabd64/spacy-ner-hf-space
A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.
gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification
Last synced: 09 Oct 2024
https://github.com/surbhi242singh/text_summarizer
machine-learning nlp spacy tokenization
Last synced: 09 Oct 2024
https://github.com/pedcapa/nlpower
FastAPI-based service designed to provide real-time text analysis. It leverages some Natural Language Processing (NLP) libraries to offer functionalities such as sentiment analysis, keyword extraction, and text summarization.
Last synced: 09 Oct 2024
https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis
MLOps project Training and Deployment of Spacy model for Sentiment analysis
amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification
Last synced: 31 Oct 2024
https://github.com/tanyakuznetsova/amazon-handmade-reviews-23-sentiment-and-ner
Comparison of AWS Comprehend and SpaCy on a subset of the Amazon Handmade reviews for sentiment analysis and NER
amazon-api amazon-reviews amazon-reviews-sentiment-analysis aws-boto3 aws-comprehend aws-comprehend-nlp named-entity-recognition natural-language-processing ner sentiment-analysis spacy spacy-nlp spacy-nlp-ner
Last synced: 31 Oct 2024
https://github.com/coueghlani/nlp
Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos
mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy
Last synced: 31 Oct 2024
https://github.com/richackashyap/using-bart-model-and-named-entity-recognition-to-summarize-text-and-create-a-mind-map-
Generation of mind maps based on any given paragraph
Last synced: 31 Oct 2024
https://github.com/fyt3rp4til/tfidf-emotiondetection
multinomial-naive-bayes n-grams random-forest spacy tfidf-vectorizer
Last synced: 09 Oct 2024
https://github.com/kazkozdev/novelgenerator
NovelGenerator - AI-powered fiction book generator that uses Ollama's LLMs to create complete novels with coherent plot structures, developed characters and multiple writing styles.
ai-novels fiction-generator nlp novel-writing ollama python spacy text-generation
Last synced: 02 Nov 2024
https://github.com/turbolent/spacy-http
spaCy as a HTTP service
api named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy
Last synced: 14 Oct 2024
https://github.com/oroszgy/cookiecutter-ml-flask
Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker
docker flask flask-application machine-learning nlp rest-api scikit-learn spacy
Last synced: 19 Oct 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 11 Oct 2024
https://github.com/thjbdvlt/spacy-viceverser
lemmatisation du français avec hunspell et spacy
french hunspell lemmatization nlp nlp-french spacy
Last synced: 31 Oct 2024
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 03 Nov 2024
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 03 Nov 2024
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 12 Oct 2024
https://github.com/rahul1582/named-entity-recognition
A keras implementation of Bidirectional-LSTM for Named Entity Recognition.
bidirectional-lstm keras named-entity-recognition spacy tensorflow
Last synced: 24 Oct 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 24 Oct 2024
https://github.com/philippeitis/nlp_specifier
Formal verification for natural language software documentation
natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 12 Oct 2024
https://github.com/sukanyadutta52/sentiment-analysis
An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis
flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard
Last synced: 17 Oct 2024
https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports
This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change
beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping
Last synced: 12 Oct 2024
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 12 Oct 2024
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 10 Oct 2024
https://github.com/direct-phonology/phony
phonology in spaCy!
linguistics nlp phonology python spacy
Last synced: 12 Oct 2024
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 10 Oct 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 05 Nov 2024
https://github.com/etienne-bobo/information-retreival_project
In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.
information-retrieval nlp prodigy spacy
Last synced: 11 Oct 2024
https://github.com/ivangael/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 31 Oct 2024
https://github.com/thjbdvlt/spacy-french-parser
syntactic dependency parser for french using spacy
french nlp nlp-french spacy spacy-parser syntactic-dependency-parsing universal-dependencies
Last synced: 31 Oct 2024
https://github.com/rohetoric/text-vector-visualisation
Website: https://rohetoric.github.io/text-vector-visualisation/
data-science data-visualization fasttext fasttext-embeddings machine-learning python3 spacy spacy-nlp tensorflow tensorflow-examples tensorflow-experiments tensorflow-tutorials tensorflow1 tensorflow2
Last synced: 03 Nov 2024
https://github.com/giuliosmall/twitter-trending-topics-pipeline
This project demonstrates trending topic detection using Apache Spark and MinIO. It processes Twitter JSON data with PySpark, leveraging distributed data processing and cloud storage. The entire project is containerized with Docker for easy deployment across architectures.
docker minio nlp pyspark pytest spacy spark streamlit
Last synced: 24 Oct 2024
https://github.com/meefs/entseeker
entseeker is a command-line tool for Named Entity Recognition (NER) and web entity searches in text files. It uses spaCy's NLP capabilities for standard named entities and custom rules for web-related entities.
ai named-entity-recognition spacy spacy-nlp text-classification text-processing
Last synced: 31 Oct 2024
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 30 Oct 2024
https://github.com/hackerajofficial/chatbot
ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django
chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp
Last synced: 31 Oct 2024
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 18 Oct 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 02 Nov 2024
https://github.com/ianhaggerty/final-capstone
This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.
amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud
Last synced: 31 Oct 2024
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 18 Oct 2024
https://github.com/wanjage/charles-burney-digital
Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat
geovisualisierung ner nlp nodegoat reisebericht spacy
Last synced: 31 Oct 2024
https://github.com/presizhai/rmp-ai-assistant
This project implements a RAG system for a Rate My Professor service, leveraging Pinecone for vector storage and OpenAI for text embeddings. It preprocesses professor reviews using SpaCy for cleaning and sentiment analysis, enabling the AI assistant to provide more nuanced recommendations and insights based on student queries.
generative-ai large-language-model natural-language-processing openai software-development software-engineering spacy
Last synced: 31 Oct 2024
https://github.com/thjbdvlt/litteralement
schéma de base de données postgresql EAV hybride pour l'analyse de textes en français
eav french nlp nlp-french postgresql spacy sql
Last synced: 31 Oct 2024
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 18 Oct 2024
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 18 Oct 2024
https://github.com/brianj-4/ai-race-engineer
AI Race Engineer for the F1 Games
ai f1-22 intent-classification named-entity-recognition natural-language-processing nlp spacy
Last synced: 25 Oct 2024
https://github.com/saifinohwal/sentiment-analysis
Sentiment analysis of Steve Jobs speech
lemmetization nlp spacy summarization tokenization wordcloud-visualization
Last synced: 30 Oct 2024
https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect
Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%
named-entity-recognition ner python spacy spacy-models
Last synced: 09 Oct 2024
https://github.com/salma-4/nlp-task
Preprocessing using NLTK ,SPACY
nltk-library python spacy svm-model
Last synced: 09 Oct 2024
https://github.com/tbarlow12/wiki-answer
I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions
nlp python question-answering spacy wikipedia
Last synced: 19 Oct 2024
https://github.com/foxbenjaminfox/simil
CLI for semantic string similarity
glove machine-learning python spacy string-similarity
Last synced: 01 Nov 2024
https://github.com/oroszgy/mltools
Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib
data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools
Last synced: 19 Oct 2024
https://github.com/lexxai/goit_python_ds_hw_12
Модуль 12. Основи NLP.
nlp nlp-machine-learning nlp-spacy nltk nltk-tokenizer spacy spacy-nlp
Last synced: 31 Oct 2024
https://github.com/ajla-brdarevic/pdf_question_generator
Project - Artificial intelligence
ai flask machine-learning mt5 pypdf2 python spacy transformers
Last synced: 20 Oct 2024
https://github.com/randika00/ism-web-automation-y23cp-web
Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.
2captcha-api beautifulsoup regex scrapy selenium spacy webdriver
Last synced: 20 Oct 2024
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 18 Oct 2024
https://github.com/michaelkinfu/hknews-headline-analysis
The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.
beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5
Last synced: 31 Oct 2024
https://github.com/crodriguez1a/kaggle-la-jobs
Helping the City of Los Angeles to structure and analyze its job descriptions
kaggle linguistic-analysis ml nlu python spacy
Last synced: 28 Oct 2024
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 21 Oct 2024
https://github.com/kavyachouhan/manasvi
An AI-powered chatbot built with Django and spaCy that provides real-time emotional support. Manasvi uses natural language processing (NLP) and sentiment analysis to engage users in meaningful conversations about mental health, offering personalized responses based on emotional tone.
chatbot django machine-learning mental-health mental-health-chatbot nlp python sentiment-analysis spacy text-processing web-app
Last synced: 21 Oct 2024
https://github.com/i-am-jiwoo-seo/vocabhub
Python Flask based web application
bootstrap flask googletrans gtts pandas python spacy website webview-app
Last synced: 31 Oct 2024
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 21 Oct 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/mydarapy/named-entity-recognition-in-clinical-texts-using-nlp-techniques
using a pretrained ML model to identify and extract named entities (drugs and dosage) from a medical corpus of clinical text
healthcare-data machine-learning medical named-entity-recognition nlp spacy spacy-nlp
Last synced: 31 Oct 2024
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 31 Oct 2024
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 12 Oct 2024
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/atharvapathak/customer_sentiment_analysis
Customer sentiment analysis is the process of using natural language processing (NLP) and machine learning techniques to analyze and understand the feelings, opinions, and attitudes expressed by customers in textual data, such as reviews, feedback, and social media posts.
cnn naive-bayes nlp nltk spacy stemming text-mining tokenization
Last synced: 31 Oct 2024
https://github.com/cano1998/sentiment-analysis-report-for-amazon-product-reviews
Sentiment analysis of Amazon product reviews. The analysis provides insights into customer sentiment and opinions regarding specific products sold on Amazon.
pdf pdf-generation sentiment-analysis spacy text-blob
Last synced: 31 Oct 2024
https://github.com/touradbaba/nlp-notebooks
This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.
machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling
Last synced: 09 Oct 2024
https://github.com/sudeatesoglu/nlp-document-processor
An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.
Last synced: 31 Oct 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 12 Oct 2024