Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-29 00:22:24 UTC
- JSON Representation
https://github.com/ahmedkhaled404/ner-with-spacy
Named entity recognition using traditional NLP methods
machine-learning matplotlib ner nlp nlp-machine-learning python spacy
Last synced: 18 Dec 2024
https://github.com/tanyakuznetsova/amazon-handmade-reviews-23-sentiment-and-ner
Comparison of AWS Comprehend and SpaCy on a subset of the Amazon Handmade reviews for sentiment analysis and NER
amazon-api amazon-reviews amazon-reviews-sentiment-analysis aws-boto3 aws-comprehend aws-comprehend-nlp named-entity-recognition natural-language-processing ner sentiment-analysis spacy spacy-nlp spacy-nlp-ner
Last synced: 18 Dec 2024
https://github.com/viniciusmecosta/cv_classifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 09 Oct 2024
https://github.com/meefs/entseeker
entseeker is a command-line tool for Named Entity Recognition (NER) and web entity searches in text files. It uses spaCy's NLP capabilities for standard named entities and custom rules for web-related entities.
ai named-entity-recognition spacy spacy-nlp text-classification text-processing
Last synced: 18 Dec 2024
https://github.com/cmilamaya/flight-dashboard-app
This project is an application that processes attached PDF documents containing flight information and extracts relevant data. The data is stored in a PostgreSQL database and visualized on a dynamic dashboard using Streamlit.
pandas pdfplumber python spacy
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-presque
normalisation de mots (français) pour spacy
french nlp normalization spacy spacy-extensions
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-french-parser
syntactic dependency parser for french using spacy
french nlp nlp-french spacy spacy-parser syntactic-dependency-parsing universal-dependencies
Last synced: 18 Dec 2024
https://github.com/sudeatesoglu/nlp-document-processor
An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.
Last synced: 18 Dec 2024
https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis
MLOps project Training and Deployment of Spacy model for Sentiment analysis
amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification
Last synced: 18 Dec 2024
https://github.com/coueghlani/nlp
Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos
mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy
Last synced: 18 Dec 2024
https://github.com/jrubengaliciab/wordtoobsidian
Converts Word documents into Markdown for Obsidian, identifying and linking keywords related to topics using spaCy's Spanish NER model.
Last synced: 13 Jan 2025
https://github.com/thjbdvlt/spacy-viceverser
lemmatisation du français avec hunspell et spacy
french hunspell lemmatization nlp nlp-french spacy
Last synced: 18 Dec 2024
https://github.com/trikztr/gptscrape
GPTScrape: A tool for web scraping that uses spaCy for NLP and GPT4All for converting scraped text into structured JSON.
ai data-extraction data-scraping gpt gpt4all llm npl python scraping spacy spacy-nlp web-scraping
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/quelquhui
tokenizer for french
french french-nlp nlp spacy tokenizer-nlp
Last synced: 09 Oct 2024
https://github.com/cano1998/sentiment-analysis-report-for-amazon-product-reviews
Sentiment analysis of Amazon product reviews. The analysis provides insights into customer sentiment and opinions regarding specific products sold on Amazon.
pdf pdf-generation sentiment-analysis spacy text-blob
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/litteralement
schéma de base de données postgresql EAV hybride pour l'analyse de textes en français
eav french nlp nlp-french postgresql spacy sql
Last synced: 18 Dec 2024
https://github.com/atharvapathak/customer_sentiment_analysis
Customer sentiment analysis is the process of using natural language processing (NLP) and machine learning techniques to analyze and understand the feelings, opinions, and attitudes expressed by customers in textual data, such as reviews, feedback, and social media posts.
cnn naive-bayes nlp nltk spacy stemming text-mining tokenization
Last synced: 18 Dec 2024
https://github.com/i-am-jiwoo-seo/vocabhub
Python Flask based web application
bootstrap flask googletrans gtts pandas python spacy website webview-app
Last synced: 18 Dec 2024
https://github.com/2pa4ul2/mcq-quiz-maker-nlp
Quizzable a quiz generator for short reviews with Spacy and NLTK
flask nlp nltk python question-generation quizapp spacy
Last synced: 09 Oct 2024
https://github.com/presizhai/rmp-ai-assistant
This project implements a RAG system for a Rate My Professor service, leveraging Pinecone for vector storage and OpenAI for text embeddings. It preprocesses professor reviews using SpaCy for cleaning and sentiment analysis, enabling the AI assistant to provide more nuanced recommendations and insights based on student queries.
generative-ai large-language-model natural-language-processing openai software-development software-engineering spacy
Last synced: 18 Dec 2024
https://github.com/nanditha-prabhu/qa-system-via-srl
Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques
Last synced: 18 Dec 2024
https://github.com/muhammadshavaiz/ai_learning
Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.
deep-learning nlp python pytorch spacy
Last synced: 18 Dec 2024
https://github.com/michaelkinfu/hknews-headline-analysis
The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.
beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5
Last synced: 18 Dec 2024
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 18 Dec 2024
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 21 Dec 2024
https://github.com/richackashyap/using-bart-model-and-named-entity-recognition-to-summarize-text-and-create-a-mind-map-
Generation of mind maps based on any given paragraph
Last synced: 18 Dec 2024
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/mugambi645/spacy-text-classification
Text classification with spacy
Last synced: 11 Nov 2024
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/simeonhristov99/ati
Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).
authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer
Last synced: 13 Jan 2025
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 20 Dec 2024
https://github.com/fyt3rp4til/tfidf-emotiondetection
multinomial-naive-bayes n-grams random-forest spacy tfidf-vectorizer
Last synced: 09 Oct 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 13 Dec 2024
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 20 Dec 2024
https://github.com/galal-pic/gd-project
annotations data fine-tuning ner nlp python spacy
Last synced: 23 Dec 2024
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 09 Oct 2024
https://github.com/zackakil/nlp-using-word-vectors
Code resources for Central London Data Science Project Nights meetup on word vectors
machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors
Last synced: 13 Nov 2024
https://github.com/foxbenjaminfox/simil
CLI for semantic string similarity
glove machine-learning python spacy string-similarity
Last synced: 18 Dec 2024
https://github.com/hansalemaos/spacy2df
converts a spaCy object into a pandas DataFrame
Last synced: 18 Dec 2024
https://github.com/rggh/api-4
Using FastAPI with spaCy to identify entities
Last synced: 07 Dec 2024
https://github.com/arya-io/ner-entitylinker
A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.
ai disambiguation entityextraction entitylinking machinelearning namedentityrecognition naturallanguageprocessing nlp python spacy streamlit textprocessing wikipediaapi
Last synced: 11 Jan 2025
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 16 Nov 2024
https://github.com/free-analytics/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 09 Jan 2025
https://github.com/stephenombuya/ai-powered-writing-assistant
An advanced writing assistant that helps users improve their writing through grammar checking, style analysis, and intelligent suggestions.
flask-application pytest python3 spacy sqlalchemy sqlite3 textblob-sentiment-analysis writing-assistant
Last synced: 09 Jan 2025
https://github.com/araobp/bach-network
J. S. Bach's network with spaCy(NLP)
Last synced: 17 Nov 2024
https://github.com/ashenoooone/semantic-book-analyzer
Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍
fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript
Last synced: 15 Nov 2024
https://github.com/raul23/nlp
Performing various NLP tasks with different Python libraries
cld2 compact-language-detector langdetect langid language-classification nameparser natural-language-processing nlp nltk python spacy textcat
Last synced: 13 Jan 2025
https://github.com/michabirklbauer/hgb_dse_text_mining
Contents for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 09 Nov 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 09 Oct 2024
https://github.com/victowang/wikigame
A python script to play the Wikipedia game
nlp python spacy wikigame wikipedia-game
Last synced: 05 Jan 2025
https://github.com/ianhaggerty/final-capstone
This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.
amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud
Last synced: 18 Dec 2024
https://github.com/hackerajofficial/chatbot
ChatBot capable of answering user queries while also integrating a conversational form to collect user information such as Name, Email, Phone Number, and Address using Python with Django
chat-application chatbot chatbots chatterbot django hackeraj hackerajofficial spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit
Last synced: 30 Nov 2024
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 21 Dec 2024
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 10 Oct 2024
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 28 Dec 2024
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 17 Nov 2024
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 17 Nov 2024
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024
https://github.com/lilivalgo/nlp-for-ipcc-climate-reports
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping
Last synced: 17 Nov 2024
https://github.com/jamnicki/bachelor_thesis_project
System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning
active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy
Last synced: 21 Dec 2024
https://github.com/rohetoric/text-vector-visualisation
Website: https://rohetoric.github.io/text-vector-visualisation/
data-science data-visualization fasttext fasttext-embeddings machine-learning python3 spacy spacy-nlp tensorflow tensorflow-examples tensorflow-experiments tensorflow-tutorials tensorflow1 tensorflow2
Last synced: 21 Dec 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 30 Dec 2024
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 09 Dec 2024
https://github.com/tony-stone-code/codealpha_simple_chatbot
This is a simple chatbot, built with python.
ai bot-development chatbot css flask flask-application flask-web htlm5 javascript python python3 spacy spacy-nlp web-development
Last synced: 23 Nov 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 20 Dec 2024
https://github.com/direct-phonology/phony
phonology in spaCy!
linguistics nlp phonology python spacy
Last synced: 19 Nov 2024
https://github.com/christram/tln-miage-ia2-2
NLP Project #2
deep-learning nlp python spacy
Last synced: 18 Nov 2024
https://github.com/karimosman89/resume-screening
Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.
machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing
Last synced: 25 Dec 2024
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 15 Dec 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/malcolmgreaves/py_ml_img
A Python 3 image for NLP & ML. Includes spaCy & NLTK model data.
docker-image machine-learning nlp nltk python3 spacy
Last synced: 13 Dec 2024
https://github.com/caterinatasinato/machine-learning-nlp-projects
Projects I worked on as Trainee in Data Analytics at ProfessionAI
gensim matplotlib nltk pandas sklearn spacy
Last synced: 19 Dec 2024
https://github.com/serenasensini/medspacy-tutorial
Use case to show medspaCy functionalities.
medspacy nlp nlp-machine-learning spacy spacy-nlp spacy-pipeline
Last synced: 20 Nov 2024
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024