Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2025-01-24 00:29:24 UTC
- JSON Representation
https://github.com/rkirlew/custom-resume-ner-model-development-with-spacy
I developed a custom Named Entity Recognition (NER) model using spaCy. The process involved manually annotating data, training the model, and evaluating its performance on unseen text. This project provided hands-on experience in working with NLP models, data annotation, and model training pipelines.
machine-learning named-entity-recognition ner spacy spacy-nlp
Last synced: 11 Jan 2025
https://github.com/sudip-13/nlp
This repo for tutorial NLP dialog flow chat bot back end configured
dialogflow fastapi fasttext mogodb ner regex spacy tf-idf
Last synced: 14 Oct 2024
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 14 Oct 2024
https://github.com/serenasensini/medspacy-tutorial
Use case to show medspaCy functionalities.
medspacy nlp nlp-machine-learning spacy spacy-nlp spacy-pipeline
Last synced: 21 Jan 2025
https://github.com/aditya172926/text_summarization
Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.
glove machine-learning nlp python scikit-learn spacy
Last synced: 24 Jan 2025
https://github.com/innerdoc/spacy-for-datashare
Let spaCy do the parsing of Named Entities for documents in the Datashare platform
datashare elasticsearch named-entity-recognition natural-language-processing spacy
Last synced: 21 Jan 2025
https://github.com/debugger404/multilanguage-pos
Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.
multilanguage named-entity-recognition ner python3 spacy
Last synced: 22 Dec 2024
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 16 Nov 2024
https://github.com/bghorvath/TextMiningTheBechdelTest
Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test
bechdel bechdel-test coreference-resolution neuralcoref spacy
Last synced: 16 Nov 2024
https://github.com/sloev/spacy_onnx_sentiment_english
english sentiment model for spacy
onnx-models sentiment-analysis spacy spacy-pipeline
Last synced: 08 Dec 2024
https://github.com/fferegrino/zeldakg
A TLOZ inspired knowledge graph
infobox knowledge-graph nltk pandas python spacy wikidata
Last synced: 15 Dec 2024
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 14 Oct 2024
https://github.com/turbolent/spacy-thrift-docker
spacy-thrift as a Docker container
docker named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 08 Dec 2024
https://github.com/karimosman89/legal-document-nlp
Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.
nltk python scikit-learn spacy
Last synced: 28 Dec 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 13 Dec 2024
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/ivangael/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 31 Oct 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 18 Jan 2025
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 21 Dec 2024
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 21 Dec 2024
https://github.com/jamnicki/bachelor_thesis_project
System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning
active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy
Last synced: 21 Dec 2024
https://github.com/rohetoric/text-vector-visualisation
Website: https://rohetoric.github.io/text-vector-visualisation/
data-science data-visualization fasttext fasttext-embeddings machine-learning python3 spacy spacy-nlp tensorflow tensorflow-examples tensorflow-experiments tensorflow-tutorials tensorflow1 tensorflow2
Last synced: 21 Dec 2024
https://github.com/imvladikon/quora-question-pair
duplicates detection experiments on Quora Question Pairs (QQP)
Last synced: 02 Jan 2025
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 16 Jan 2025
https://github.com/oroszgy/mltools
Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib
data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools
Last synced: 08 Dec 2024
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 09 Dec 2024
https://github.com/benevanio/nasa-api-astro
Projeto utilizando a API da nasa.
apdo api api-client api-rest api-server astronomy css frond-end-development html5 javascipt javascipt-ai javascript nasa-api nasa-data react-router reactjs space spaceship spacy
Last synced: 29 Nov 2024
https://github.com/zofiaqlt/nlp_libraries_tweets_analysis
🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)
Last synced: 12 Jan 2025
https://github.com/kr1shnasomani/summarai
Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)
natural-language-processing pytextrank pytorch spacy transformers
Last synced: 21 Dec 2024
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 18 Dec 2024
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 30 Dec 2024
https://github.com/d5555/textcat_dataset_imdb
Movie Review Dataset for binary sentiment classification
categories dataset spacy textcat textcategorizer
Last synced: 02 Jan 2025
https://github.com/veldhub/veld_data__akp_ner_linkedcat
data veld containg machine inferenced named entities and context data.
nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/emmy-bradfield/hilly_xmas
A simple ChatBot built using openAI's davinci 003 as a gift for a dear friend of ours
machine-learning natural-language-processing openai python spacy
Last synced: 21 Jan 2025
https://github.com/aydan-moon/news_headlines_ner
Named Entity Recognition (NER) model for analyzing entities in news headlines using spaCy and trained on the CoNLL-2003 dataset.
conll-2003 ner nlp python spacy
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_code__spacy
Code velds encapsulating usage of spaCy.
Last synced: 21 Jan 2025
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 07 Jan 2025
https://github.com/praadnya/govt-circular-analysis
Uses OCR and NER techniques for parsing Goverment Circulars
annotations graphdb ner ocr spacy
Last synced: 07 Jan 2025
https://github.com/403errors/ai-docparser
An application framework developed using the latest AI technologies to extract the values of specific pre-defined keys from a given PDF document. Also generating a document summary using the key & values extracted in the while doing so.
automation csv-export nlp pdf-files python3 regex reinforcement-learning spacy
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_chain__apis_ner_transform_to_gold
Chain velds encapsulating extraction and conversion of gold data.
named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_data__apis_spacy_ner_models
spacy NER models, trained on APIS ÖBL data.
named-entity-recognition ner nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 20 Dec 2024
https://github.com/veldhub/veld_chain__train_spacy_apis_ner
Chain velds encapsulating a spacy NER training setup on APIS data.
named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_chain__akp_ner_inference
A chain veld encapsulating NER inference.
named-entity-recognition ner nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_chain__apis_ner_evaluate_old_models
Chain velds encapsulating evalution of old spacy models.
named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 21 Jan 2025
https://github.com/veldhub/veld_chain__mara_load_and_publish_models
Chain velds for publishing self-trained MARA models to huggingface.
Last synced: 21 Jan 2025
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 02 Jan 2025
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/michabirklbauer/hgb_dse_text_mining
Contents for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 09 Nov 2024
https://github.com/raul23/nlp
Performing various NLP tasks with different Python libraries
cld2 compact-language-detector langdetect langid language-classification nameparser natural-language-processing nlp nltk python spacy textcat
Last synced: 13 Jan 2025
https://github.com/stephenombuya/ai-powered-writing-assistant
An advanced writing assistant that helps users improve their writing through grammar checking, style analysis, and intelligent suggestions.
flask-application pytest python3 spacy sqlalchemy sqlite3 textblob-sentiment-analysis writing-assistant
Last synced: 09 Jan 2025
https://github.com/free-analytics/nlp
自然言語処理
ai chatbot jupyter-notebook llm machine-learning nlp python spacy transformers
Last synced: 09 Jan 2025
https://github.com/philippeitis/nlp_specifier
Formal verification for natural language software documentation
natural-language-processing nlp spacy
Last synced: 21 Jan 2025
https://github.com/ashenoooone/semantic-book-analyzer
Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍
fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript
Last synced: 15 Jan 2025
https://github.com/samarthhchinivar/nlp-codebasics-playlist
This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy
nlp-machine-learning nltk python3 spacy
Last synced: 06 Jan 2025
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 06 Jan 2025
https://github.com/mugambi645/spacy-text-classification
Text classification with spacy
Last synced: 11 Nov 2024
https://github.com/blue-codes-yep/AI.AT
AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.
ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp
Last synced: 06 Jan 2025
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 10 Oct 2024
https://github.com/zackakil/nlp-using-word-vectors
Code resources for Central London Data Science Project Nights meetup on word vectors
machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors
Last synced: 13 Nov 2024
https://github.com/salma-4/nlp-task
Preprocessing using NLTK ,SPACY
nltk-library python spacy svm-model
Last synced: 22 Jan 2025
https://github.com/iv4n-ga6l/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 25 Jan 2025
https://github.com/rggh/api-4
Using FastAPI with spaCy to identify entities
Last synced: 07 Dec 2024
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 28 Dec 2024
https://github.com/pythonicforge/e.c.h.o-mini
A miniature model of ECHO intended for my portfolio
ai express javascript nltk python spacy
Last synced: 22 Jan 2025
https://github.com/imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit
Last synced: 30 Nov 2024
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 18 Jan 2025
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024
https://github.com/lilivalgo/nlp-for-ipcc-climate-reports
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping
Last synced: 17 Nov 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 22 Dec 2024
https://github.com/chewzzz1014/fyp-ner-archive
final-year-project flair fyp machine-learning ner nlp project spacy transformer
Last synced: 18 Jan 2025
https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect
Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%
named-entity-recognition ner python spacy spacy-models
Last synced: 09 Oct 2024
https://github.com/blacksujit/quantumlens
QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.
ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization
Last synced: 15 Dec 2024
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 08 Dec 2024
https://github.com/raniasakrr/breakthrough-hire
The project aims to help job seekers understand the essential qualifications required for specific jobs and assess how well their skills match those positions. Additionally, it assists recruiters in improving their resume selection processes by analyzing and comprehending job advertisements.
bert cvanalysis flask ner nlp python scraping sentence-similarity spacy sqlalchemy transformer
Last synced: 09 Oct 2024
https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch
Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch
elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec
Last synced: 22 Dec 2024
https://github.com/viniciusmecosta/cv_classifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 09 Oct 2024
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 07 Dec 2024
https://github.com/miweru/vrt_spacy
corpora linguistic-corpora linguistics nlp spacy vrt wrapper
Last synced: 07 Dec 2024
https://github.com/xwiz/spacy_symspell
Spacy symspell extension
spacy spelling-correction spelling-suggestions symspell
Last synced: 07 Dec 2024
https://github.com/praju-1/deep_learning
This repository include Deep_learning concept which is subset of machine learning which is based on Neural Networking.
keras nltk pandas python sklearn spacy statistics tensorflow
Last synced: 15 Dec 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 09 Oct 2024
https://github.com/arkadiuszkaros/nlp-book-pos-extractor
This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.
extractor nlp part-of-speech-tagging python spacy
Last synced: 07 Dec 2024
https://github.com/nxgeo/id-svo-extractor
id-svo-extractor: Extract SVO triples from Indonesian text.
artificial-intelligence indonesian-language indonesian-linguistics indonesian-nlp information-extraction knowledge-extraction knowledge-representation natural-language-processing nlp python rdf-triples spacy spacy-stanza stanza text-analysis triple-extraction
Last synced: 07 Dec 2024