Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-29 00:22:24 UTC
- JSON Representation
https://github.com/charlesyuan02/named_entity_recognition
Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.
conll-2003 named-entity-recognition ner nlp spacy transformer
Last synced: 19 Dec 2024
https://github.com/ggordonhall/measurement_tagger
Spacy Measurement Tagger
dependency-parser measurement-tagger nlp python spacy tagger
Last synced: 26 Dec 2024
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 14 Oct 2024
https://github.com/somenath203/named-entity-recognizer
Click below to checkout the website
huggingface huggingface-spaces named-entity-recognition ner spacy streamlit torch transformers
Last synced: 14 Oct 2024
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy
I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.
machine-learning named-entity-recognition ner spacy spacy-nlp
Last synced: 11 Jan 2025
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 16 Nov 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 16 Nov 2024
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 21 Dec 2024
https://github.com/aydan-moon/news_headlines_ner
Named Entity Recognition (NER) model for analyzing entities in news headlines using spaCy and trained on the CoNLL-2003 dataset.
conll-2003 ner nlp python spacy
Last synced: 20 Nov 2024
https://github.com/thjbdvlt/quelquhui
tokenizer for french
french french-nlp nlp spacy tokenizer-nlp
Last synced: 09 Oct 2024
https://github.com/sukanyadutta52/topic_modeling
What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?
climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy
Last synced: 26 Dec 2024
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 07 Dec 2024
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 07 Dec 2024
https://github.com/xwiz/spacy_symspell
Spacy symspell extension
spacy spelling-correction spelling-suggestions symspell
Last synced: 07 Dec 2024
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 17 Nov 2024
https://github.com/2pa4ul2/mcq-quiz-maker-nlp
Quizzable a quiz generator for short reviews with Spacy and NLTK
flask nlp nltk python question-generation quizapp spacy
Last synced: 09 Oct 2024
https://github.com/praju-1/deep_learning
This repository include Deep_learning concept which is subset of machine learning which is based on Neural Networking.
keras nltk pandas python sklearn spacy statistics tensorflow
Last synced: 15 Dec 2024
https://github.com/salma-4/nlp-task
Preprocessing using NLTK ,SPACY
nltk-library python spacy svm-model
Last synced: 09 Oct 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect
Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%
named-entity-recognition ner python spacy spacy-models
Last synced: 09 Oct 2024
https://github.com/direct-phonology/phony
phonology in spaCy!
linguistics nlp phonology python spacy
Last synced: 19 Nov 2024
https://github.com/pythonicforge/e.c.h.o-mini
A miniature model of ECHO intended for my portfolio
ai express javascript nltk python spacy
Last synced: 22 Nov 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 20 Dec 2024
https://github.com/arkadiuszkaros/nlp-book-pos-extractor
This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.
extractor nlp part-of-speech-tagging python spacy
Last synced: 07 Dec 2024
https://github.com/rrayhka/indonesian-ner-spacy
Fine-tuning SpaCy for Indonesian Named Entity Recognition (NER) with custom dataset.
indonesian named-entity-recognition ner nlp spacy
Last synced: 09 Oct 2024
https://github.com/nxgeo/id-svo-extractor
id-svo-extractor: Extract SVO triples from Indonesian text.
artificial-intelligence indonesian-language indonesian-linguistics indonesian-nlp information-extraction knowledge-extraction knowledge-representation natural-language-processing nlp python rdf-triples spacy spacy-stanza stanza text-analysis triple-extraction
Last synced: 07 Dec 2024
https://github.com/thekartikeyamishra/documentsummarizer
The Document Summarizer is a Python-based application that extracts summaries from uploaded text and PDF documents using Natural Language Processing (NLP) techniques. This project includes a basic GUI to interact with the application, upload documents, and view the summarized content.
machine-learning nlp nlp-machine-learning pdfplumber python spacy tkinter tkinter-gui
Last synced: 07 Dec 2024
https://github.com/arnabd64/spacy-ner-hf-space
A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.
gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification
Last synced: 09 Oct 2024
https://github.com/medspacy/nlp_postprocessor
A spaCy component for executing custom logic at the end of a pipeline.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 09 Jan 2025
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 26 Dec 2024
https://github.com/ajaykumar095/natural_language_processing
Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.
ann nltk-python python rnn spacy tensorflow text-preprocessing textblob
Last synced: 22 Dec 2024
https://github.com/jamnicki/bachelor_thesis_project
System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning
active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy
Last synced: 21 Dec 2024
https://github.com/ayaz-amin/speechpos
A simple Python script that tags speech to parts-of-speech
deep-learning machine-learning python3 spacy
Last synced: 01 Dec 2024
https://github.com/iv4n-ga6l/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 26 Nov 2024
https://github.com/praadnya/govt-circular-analysis
Uses OCR and NER techniques for parsing Goverment Circulars
annotations graphdb ner ocr spacy
Last synced: 07 Jan 2025
https://github.com/ivangael/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 31 Oct 2024
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 08 Dec 2024
https://github.com/tbarlow12/wiki-answer
I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions
nlp python question-answering spacy wikipedia
Last synced: 08 Dec 2024
https://github.com/satoru-shibata-jpn/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 02 Dec 2024
https://github.com/leosimoes/coursera-usp-pln-i
Atividades do curso "Processamento Neural de Linguagem Natural em Português I" oferecido pela USP através do Coursera.
Last synced: 02 Dec 2024
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/oroszgy/cookiecutter-ml-flask
Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker
docker flask flask-application machine-learning nlp rest-api scikit-learn spacy
Last synced: 08 Dec 2024
https://github.com/d5555/textcat_dataset_imdb
Movie Review Dataset for binary sentiment classification
categories dataset spacy textcat textcategorizer
Last synced: 02 Jan 2025
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 30 Dec 2024
https://github.com/surbhi242singh/text_summarizer
machine-learning nlp spacy tokenization
Last synced: 09 Oct 2024
https://github.com/pedcapa/nlpower
FastAPI-based service designed to provide real-time text analysis. It leverages some Natural Language Processing (NLP) libraries to offer functionalities such as sentiment analysis, keyword extraction, and text summarization.
Last synced: 09 Oct 2024
https://github.com/randika00/ism-web-automation-y23cp-web
Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.
2captcha-api beautifulsoup regex scrapy selenium spacy webdriver
Last synced: 08 Dec 2024
https://github.com/cmucheru/chatbot
A conversational chatbot for embedding in a site.
Last synced: 15 Dec 2024
https://github.com/shwetam19/python-ai-chatbot
Pluto.ai is an intelligent chatbot built using Flask. It provides dynamic conversations with features like user authentication, sentiment analysis, NLP-powered intent matching, and API integrations.
ai chatbot flask nlp nltk python spacy sqlalchemy
Last synced: 15 Dec 2024
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 16 Nov 2024
https://github.com/turbolent/spacy-http
spaCy as a HTTP service
api named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy
Last synced: 28 Nov 2024
https://github.com/raniasakrr/breakthrough-hire
The project aims to help job seekers understand the essential qualifications required for specific jobs and assess how well their skills match those positions. Additionally, it assists recruiters in improving their resume selection processes by analyzing and comprehending job advertisements.
bert cvanalysis flask ner nlp python scraping sentence-similarity spacy sqlalchemy transformer
Last synced: 09 Oct 2024
https://github.com/ntinouldinho/machine-learning-classification-and-speech-generation
Explored Greek Parliament Proceedings and tried to classify each speech to a corresponding parliamentary political party.
artificial-intelligence classification-machine-learning machine-learning neural-networks pandas python sklearn spacy
Last synced: 08 Dec 2024
https://github.com/thekartikeyamishra/resumeevaluatorapp
The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.
flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob
Last synced: 09 Dec 2024
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/francislauriano/chatsoftex
Plataforma inovadora desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.
cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow
Last synced: 09 Dec 2024
https://github.com/viniciusmecosta/cv_classifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 09 Oct 2024
https://github.com/ledsouza/nlp-article-classification
This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.
gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp
Last synced: 03 Dec 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 09 Oct 2024
https://github.com/aranzadata/moviereviewclassifier
Modelo de análisis de sentimientos basado en BERT para 45,000 reseñas de películas, logrando una puntuación F1 de 0.88 al aprovechar técnicas avanzadas de preprocesamiento de texto con NLTK y SpaCy
Last synced: 09 Dec 2024
https://github.com/kavyachouhan/manasvi
An AI-powered chatbot built with Django and spaCy that provides real-time emotional support. Manasvi uses natural language processing (NLP) and sentiment analysis to engage users in meaningful conversations about mental health, offering personalized responses based on emotional tone.
chatbot django machine-learning mental-health mental-health-chatbot nlp python sentiment-analysis spacy text-processing web-app
Last synced: 09 Dec 2024
https://github.com/richackashyap/using-bart-model-and-named-entity-recognition-to-summarize-text-and-create-a-mind-map-
Generation of mind maps based on any given paragraph
Last synced: 18 Dec 2024
https://github.com/rafelafrance/angiospermtraiter
Using rule-based parsers to extract information from plant treatments
Last synced: 09 Dec 2024
https://github.com/fyt3rp4til/tfidf-emotiondetection
multinomial-naive-bayes n-grams random-forest spacy tfidf-vectorizer
Last synced: 09 Oct 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 13 Dec 2024
https://github.com/galal-pic/gd-project
annotations data fine-tuning ner nlp python spacy
Last synced: 23 Dec 2024
https://github.com/crodriguez1a/kaggle-la-jobs
Helping the City of Los Angeles to structure and analyze its job descriptions
kaggle linguistic-analysis ml nlu python spacy
Last synced: 16 Dec 2024
https://github.com/tony-stone-code/codealpha_simple_chatbot
This is a simple chatbot, built with python.
ai bot-development chatbot css flask flask-application flask-web htlm5 javascript python python3 spacy spacy-nlp web-development
Last synced: 23 Nov 2024
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 20 Dec 2024
https://github.com/viniciusds2020/nlp_classificacao_texto_spacy
Projeto de Machine learning - Classificação de texto NLTK, SpaCy e Sklearn
logistic-regression machine-learning nlp nlp-machine-learning nltk-python pt-br random-forest-classifier spacy
Last synced: 10 Dec 2024
https://github.com/victowang/wikigame
A python script to play the Wikipedia game
nlp python spacy wikigame wikipedia-game
Last synced: 05 Jan 2025
https://github.com/oroszgy/mltools
Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib
data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools
Last synced: 08 Dec 2024
https://github.com/camara94/spacy
nlp nlp-machine-learning space-invaders spacy
Last synced: 23 Dec 2024
https://github.com/camara94/nlp-basique
Dans ce tutoriel, nous découvrir ensemble les bases de NLP en IA
gensim nlp nlp-keywords-extraction nlp-machine-learning pytorch sklearn spacy spacy-nlp tensorflow
Last synced: 23 Dec 2024
https://github.com/giuliosmall/twitter-trending-topics-pipeline
This project demonstrates trending topic detection using Apache Spark and MinIO. It processes Twitter JSON data with PySpark, leveraging distributed data processing and cloud storage. The entire project is containerized with Docker for easy deployment across architectures.
docker minio nlp pyspark pytest spacy spark streamlit
Last synced: 11 Dec 2024
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 18 Dec 2024
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 20 Dec 2024
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 17 Dec 2024
https://github.com/manik2000/radiohead-lyrics
NLP analysis of Radiohead's songs lyrics.
embeddings huggingface-transformers nlp spacy
Last synced: 17 Dec 2024
https://github.com/michabirklbauer/hgb_dse_text_mining_solutions
Solutions for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 04 Jan 2025
https://github.com/rahul1582/named-entity-recognition
A keras implementation of Bidirectional-LSTM for Named Entity Recognition.
bidirectional-lstm keras named-entity-recognition spacy tensorflow
Last synced: 13 Dec 2024
https://github.com/brianj-4/ai-race-engineer
AI Race Engineer for the F1 Games
ai f1-22 intent-classification named-entity-recognition natural-language-processing nlp spacy
Last synced: 13 Dec 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 09 Oct 2024
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 18 Dec 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/kr1shnasomani/summarai
Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)
natural-language-processing pytextrank pytorch spacy transformers
Last synced: 21 Dec 2024
https://github.com/saifinohwal/sentiment-analysis
Sentiment analysis of Steve Jobs speech
lemmetization nlp spacy summarization tokenization wordcloud-visualization
Last synced: 18 Dec 2024
https://github.com/touradbaba/nlp-notebooks
This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.
machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling
Last synced: 09 Oct 2024
https://github.com/arya-io/ner-entitylinker
A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.
ai disambiguation entityextraction entitylinking machinelearning namedentityrecognition naturallanguageprocessing nlp python spacy streamlit textprocessing wikipediaapi
Last synced: 11 Jan 2025