Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-12-24 00:24:28 UTC
- JSON Representation
https://github.com/gugarosa/brainy
🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.
api machine-learning nlp python spacy
Last synced: 07 Dec 2024
https://github.com/fferegrino/zeldakg
A TLOZ inspired knowledge graph
infobox knowledge-graph nltk pandas python spacy wikidata
Last synced: 15 Dec 2024
https://github.com/datarohit/nlp-course-files
The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.
nlp nlp-machine-learning nltk numpy panda python sklearn spacy
Last synced: 23 Dec 2024
https://github.com/devbm7/qgen
Question Generator System
json ml nlp nltk pandas pymupdf-fitz python3 pytorch regular-expressions smtp spacy streamlit t5-large transformers wikipedia-api
Last synced: 10 Dec 2024
https://github.com/turbolent/spacy-thrift-docker
spacy-thrift as a Docker container
docker named-entity-recognition ner nlp part-of-speech part-of-speech-tagger pos python service spacy thrift
Last synced: 08 Dec 2024
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 26 Dec 2024
https://github.com/ivangael/nlp-chatbot-api
A NLP project leveraging NLTK for extracting weather data.
flask nlp-api nlp-chatbot nltk python spacy transformers
Last synced: 31 Oct 2024
https://github.com/lexxai/goit_python_ds_hw_12
Модуль 12. Основи NLP.
nlp nlp-machine-learning nlp-spacy nltk nltk-tokenizer spacy spacy-nlp
Last synced: 18 Dec 2024
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports
This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change
beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping
Last synced: 12 Oct 2024
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 12 Oct 2024
https://github.com/galal-pic/gd-project
annotations data fine-tuning ner nlp python spacy
Last synced: 23 Dec 2024
https://github.com/fyt3rp4til/tfidf-emotiondetection
multinomial-naive-bayes n-grams random-forest spacy tfidf-vectorizer
Last synced: 09 Oct 2024
https://github.com/richackashyap/using-bart-model-and-named-entity-recognition-to-summarize-text-and-create-a-mind-map-
Generation of mind maps based on any given paragraph
Last synced: 18 Dec 2024
https://github.com/pedcapa/nlpower
FastAPI-based service designed to provide real-time text analysis. It leverages some Natural Language Processing (NLP) libraries to offer functionalities such as sentiment analysis, keyword extraction, and text summarization.
Last synced: 09 Oct 2024
https://github.com/surbhi242singh/text_summarizer
machine-learning nlp spacy tokenization
Last synced: 09 Oct 2024
https://github.com/arnabd64/spacy-ner-hf-space
A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.
gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification
Last synced: 09 Oct 2024
https://github.com/rrayhka/indonesian-ner-spacy
Fine-tuning SpaCy for Indonesian Named Entity Recognition (NER) with custom dataset.
indonesian named-entity-recognition ner nlp spacy
Last synced: 09 Oct 2024
https://github.com/2pa4ul2/mcq-quiz-maker-nlp
Quizzable a quiz generator for short reviews with Spacy and NLTK
flask nlp nltk python question-generation quizapp spacy
Last synced: 09 Oct 2024
https://github.com/thjbdvlt/quelquhui
tokenizer for french
french french-nlp nlp spacy tokenizer-nlp
Last synced: 09 Oct 2024
https://github.com/shwetajanwekar/capstone-project_1
Capstone project_1 include python code for best fit regression model, SQL feature store and tableau dashboard
accuracy create engine lasso-regression-model linear-regression mysql-database pandas ridge-regression seaborn skewness sklearn-library spacy sqlalchemy standard-deviation transformation variense xtrain ytrain
Last synced: 21 Dec 2024
https://github.com/malcolmgreaves/py_ml_img
A Python 3 image for NLP & ML. Includes spaCy & NLTK model data.
docker-image machine-learning nlp nltk python3 spacy
Last synced: 13 Dec 2024
https://github.com/caterinatasinato/machine-learning-nlp-projects
Projects I worked on as Trainee in Data Analytics at ProfessionAI
gensim matplotlib nltk pandas sklearn spacy
Last synced: 19 Dec 2024
https://github.com/kivanc57/nlp_data_visualization
This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.
data-science matplotlib nlp parsing plotting python spacy visualization
Last synced: 09 Oct 2024
https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool
An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.
flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling
Last synced: 18 Dec 2024
https://github.com/thjbdvlt/spacy-turlututu
french morphological analysis model for spacy
french french-nlp morphological-analysis nlp part-of-speech-tagging pos-tagging spacy
Last synced: 18 Dec 2024
https://github.com/touradbaba/nlp-notebooks
This repository contains Jupyter notebooks on various NLP techniques, including text processing, classification, sentiment analysis, and topic modeling.
machine-learning nlp nltk sentiment-analysis spacy text-classification text-processing topic-modeling
Last synced: 09 Oct 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 18 Dec 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 09 Oct 2024
https://github.com/prateekrajsrivastav/question-answering-model
This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.
huggingface-transformers matplotlib nltk numpy pandas seaborn spacy
Last synced: 20 Dec 2024
https://github.com/parthapray/pii_scrubbing_llm
This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)
chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn
Last synced: 20 Dec 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 09 Oct 2024
https://github.com/viniciusmecosta/cv_classifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 09 Oct 2024
https://github.com/raniasakrr/breakthrough-hire
The project aims to help job seekers understand the essential qualifications required for specific jobs and assess how well their skills match those positions. Additionally, it assists recruiters in improving their resume selection processes by analyzing and comprehending job advertisements.
bert cvanalysis flask ner nlp python scraping sentence-similarity spacy sqlalchemy transformer
Last synced: 09 Oct 2024
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 08 Dec 2024
https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect
Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%
named-entity-recognition ner python spacy spacy-models
Last synced: 09 Oct 2024
https://github.com/salma-4/nlp-task
Preprocessing using NLTK ,SPACY
nltk-library python spacy svm-model
Last synced: 09 Oct 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 22 Dec 2024
https://github.com/etienne-bobo/information-retreival_project
In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.
information-retrieval nlp prodigy spacy
Last synced: 11 Nov 2024
https://github.com/sadegh15khedry/comments-sentiment-analysis
text classification on comments using an ANN model.
collections deep-learning keras nlp numpy pandas python sentiment-analysis sklearn spacy unicodedata
Last synced: 11 Nov 2024
https://github.com/bonysmoke/speliuk
A more accurate spelling correction for the Ukrainian language.
correction kenlm spacy spelling symspell ukrainian
Last synced: 10 Oct 2024
https://github.com/philippeitis/nlp_specifier
Formal verification for natural language software documentation
natural-language-processing nlp spacy
Last synced: 12 Oct 2024
https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon
An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.
matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud
Last synced: 23 Nov 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 17 Nov 2024
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 17 Nov 2024
https://github.com/direct-phonology/phony
phonology in spaCy!
linguistics nlp phonology python spacy
Last synced: 19 Nov 2024
https://github.com/jonas-jonas/text_mining
Sentiment Analysis using spaCy
jupyter-notebook nlp sentiment-analysis spacy
Last synced: 20 Dec 2024
https://github.com/ahmedabdalkreem/grammer-auto-correct
In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.
decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning
Last synced: 16 Nov 2024
https://github.com/centrefordigitalhumanities/textminer
A script to detect named entities and store them in an Elasticsearch annotated_text field
annotation elasticsearch ner spacy
Last synced: 25 Dec 2024
https://github.com/rahul1582/text-summarisation-using-spacy
A Text Summarizer deployed to Heroku
heroku nlp spacy text-summarisation
Last synced: 13 Dec 2024
https://github.com/izuna385/arxiv-checker-backend
This is an API and backend modules to return accepted papers related to natural language processing from arxiv.
docker fastapi natural-language-processing pytest spacy tdd tdd-python
Last synced: 07 Dec 2024
https://github.com/samarthhchinivar/nlp-codebasics-playlist
This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy
nlp-machine-learning nltk python3 spacy
Last synced: 10 Nov 2024
https://github.com/naveen3830/splashtop_analysis
This repository contains the code for my webapp splashtop website analysis.
nlp-keywords-extraction python spacy streamlit
Last synced: 07 Dec 2024
https://github.com/bglid/job-application-helper
Project to incorporate web scraping of job applications and then analyze them using NLP methods.
nlp spacy streamlit text-processing webscraping
Last synced: 07 Dec 2024
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 21 Dec 2024
https://github.com/luis54929/oscarbot
OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..
ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy
Last synced: 21 Dec 2024
https://github.com/jamnicki/bachelor_thesis_project
System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning
active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy
Last synced: 21 Dec 2024
https://github.com/rohetoric/text-vector-visualisation
Website: https://rohetoric.github.io/text-vector-visualisation/
data-science data-visualization fasttext fasttext-embeddings machine-learning python3 spacy spacy-nlp tensorflow tensorflow-examples tensorflow-experiments tensorflow-tutorials tensorflow1 tensorflow2
Last synced: 21 Dec 2024
https://github.com/oroszgy/mltools
Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib
data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools
Last synced: 08 Dec 2024
https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods
Text-Summarizer-Using-NLP-and-TF-IDF-Methods
Last synced: 09 Dec 2024
https://github.com/benevanio/nasa-api-astro
Projeto utilizando a API da nasa.
apdo api api-client api-rest api-server astronomy css frond-end-development html5 javascipt javascipt-ai javascript nasa-api nasa-data react-router reactjs space spaceship spacy
Last synced: 29 Nov 2024
https://github.com/kr1shnasomani/summarai
Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)
natural-language-processing pytextrank pytorch spacy transformers
Last synced: 21 Dec 2024
https://github.com/lfoppiano/docker-image-spacy
Docker image for shipping spacy
Last synced: 18 Dec 2024
https://github.com/tony-stone-code/codealpha_simple_chatbot
This is a simple chatbot, built with python.
ai bot-development chatbot css flask flask-application flask-web htlm5 javascript python python3 spacy spacy-nlp web-development
Last synced: 23 Nov 2024
https://github.com/rfdzan/summarize-search-result
extractive text summarization with a handful of different libraries
natural-language-processing python spacy
Last synced: 07 Nov 2024
https://github.com/darkrockmountain/spacy-ewc
A spaCy library for Named Entity Recognition with Elastic Weight Consolidation.
catastrophic-forgetting clasificacion entity-recognition ewc labeling machine-learning machine-learning-algorithms model ner nlp spacy spacy-nlp thinc
Last synced: 08 Nov 2024
https://github.com/yathartharora/twitter_bot
A twitter bot using tweepy API and phrasematching
nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot
Last synced: 10 Nov 2024
https://github.com/imvladikon/quora-question-pair
duplicates detection experiments on Quora Question Pairs (QQP)
Last synced: 09 Nov 2024
https://github.com/isabelleysseric/question-answering
Building a Natural Language Question & Answer Search Engine with corpus in Python language.
corpus deep-learning nlp qa question-answering spacy whoosh
Last synced: 08 Nov 2024
https://github.com/xettrisomeman/speechandtext
Practicing NLP using spacy and Sklearn
Last synced: 09 Nov 2024
https://github.com/d5555/textcat_dataset_imdb
Movie Review Dataset for binary sentiment classification
categories dataset spacy textcat textcategorizer
Last synced: 09 Nov 2024
https://github.com/michabirklbauer/hgb_dse_text_mining
Contents for the practical part of the lecture Text Mining
deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining
Last synced: 09 Nov 2024
https://github.com/raul23/nlp
Performing various NLP tasks with different Python libraries
cld2 compact-language-detector langdetect langid language-classification nameparser natural-language-processing nlp nltk python spacy textcat
Last synced: 14 Nov 2024
https://github.com/victowang/wikigame
A python script to play the Wikipedia game
nlp python spacy wikigame wikipedia-game
Last synced: 09 Nov 2024
https://github.com/simeonhristov99/ati
Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).
authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer
Last synced: 14 Nov 2024
https://github.com/medspacy/nlp_postprocessor
A spaCy component for executing custom logic at the end of a pipeline.
clinical-nlp medspacy nlp nlp-library pipeline spacy
Last synced: 11 Nov 2024
https://github.com/mugambi645/spacy-text-classification
Text classification with spacy
Last synced: 11 Nov 2024
https://github.com/arya-io/ner-entitylinker
A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.
ai disambiguation entityextraction entitylinking machinelearning namedentityrecognition naturallanguageprocessing nlp python spacy streamlit textprocessing wikipediaapi
Last synced: 12 Nov 2024
https://github.com/maxzirps/lyrics-sentiment-analysis
Analyse lyrics for their sentiment score
nlp pandas sentiment-analysis spacy spacy-nlp
Last synced: 13 Nov 2024
https://github.com/zofiaqlt/nlp_libraries_tweets_analysis
🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)
Last synced: 13 Nov 2024
https://github.com/zackakil/nlp-using-word-vectors
Code resources for Central London Data Science Project Nights meetup on word vectors
machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors
Last synced: 13 Nov 2024
https://github.com/hariprasath-v/machinehack_intel_oneapi_hackathon_the_llm_challenge
Generate a response for the question from pre-defined text using LLM(Extracted Question-Answering(QA) Model).
accuracy exploratory-data-analysis extractive-question-answering huggingface machine-learning matplotlib nlp nltk numpy pandas python seaborn sklearn spacy spellchecker wordcloud
Last synced: 13 Nov 2024
https://github.com/lucas54neves/dependency-parsing
Repository of the project for the Introduction to Natural Language Processing discipline of the Computer Science course at the University of Lavras, whose task objective is to explore the parsing of dependencies, using the SpaCy tool.
dependency-parsing nlp python spacy spacy-nlp
Last synced: 13 Nov 2024
https://github.com/rggh/api-4
Using FastAPI with spaCy to identify entities
Last synced: 07 Dec 2024
https://github.com/araobp/bach-network
J. S. Bach's network with spaCy(NLP)
Last synced: 17 Nov 2024
https://github.com/ashenoooone/semantic-book-analyzer
Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍
fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript
Last synced: 15 Nov 2024
https://github.com/imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit
Last synced: 30 Nov 2024
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 17 Nov 2024
https://github.com/shiv010hbtu/sentiment-analysis
Sentiment Analysis
django pandas python spacy tensorflow
Last synced: 17 Nov 2024
https://github.com/vanheemstrasystems/spacy
SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Last synced: 17 Nov 2024