Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/veldhub/veld_data__akp_ner_linkedcat

data veld containg machine inferenced named entities and context data.

nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_code__spacy

Code velds encapsulating usage of spaCy.

nlp spacy spacy-nlp

Last synced: 21 Jan 2025

https://github.com/pythonicforge/e.c.h.o-mini

A miniature model of ECHO intended for my portfolio

ai express javascript nltk python spacy

Last synced: 22 Jan 2025

https://github.com/imvladikon/spacy-trankit

💥 Trankit models directly in spaCy💥

nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit

Last synced: 28 Jan 2025

https://github.com/403errors/ai-docparser

An application framework developed using the latest AI technologies to extract the values of specific pre-defined keys from a given PDF document. Also generating a document summary using the key & values extracted in the while doing so.

automation csv-export nlp pdf-files python3 regex reinforcement-learning spacy

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_chain__apis_ner_transform_to_gold

Chain velds encapsulating extraction and conversion of gold data.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_chain__train_spacy_apis_ner

Chain velds encapsulating a spacy NER training setup on APIS data.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/e3oroush/music_sorting

A simple project for categorizing your local musics. Find and delete the duplicate music files in your local machine

duplication-detection mediainfo music-duplication-detection music-information-retrieval python spacy

Last synced: 29 Jan 2025

https://github.com/vanheemstrasystems/spacy

SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.

spacy

Last synced: 17 Nov 2024

https://github.com/lilivalgo/nlp-for-ipcc-climate-reports

This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.

beautifulsoup4 matplotlib nlp pandas pypdf2 python requests seaborn spacy text-analysis text-processing webscraping

Last synced: 17 Nov 2024

https://github.com/veldhub/veld_chain__apis_ner_evaluate_old_models

Chain velds encapsulating evalution of old spacy models.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_chain__mara_load_and_publish_models

Chain velds for publishing self-trained MARA models to huggingface.

nlp spacy spacy-nlp

Last synced: 21 Jan 2025

https://github.com/tbarlow12/wiki-answer

I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions

nlp python question-answering spacy wikipedia

Last synced: 02 Feb 2025

https://github.com/blacksujit/quantumlens

QuantumLens is a cutting-edge, AI-powered information assistant designed to revolutionize how you interact with and process information. By leveraging advanced machine learning algorithms and natural language processing techniques.

ai bert bert-embeddings dataanalysis information integration-flow intellij-idea ml model models nlp-machine-learning processing project research spacy spacy-models spacy-nlp spacy-pipeline summeriza summerization

Last synced: 08 Feb 2025

https://github.com/xettrisomeman/speechandtext

Practicing NLP using spacy and Sklearn

nlp sklearn spacy

Last synced: 02 Jan 2025

https://github.com/yashaswini-lankalapalli/text-summarization

Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.

nlp python spacy transformers

Last synced: 12 Oct 2024

https://github.com/parthapray/pii_scrubbing_llm

This repo contains codes about PII scrubbing heuristics search before calling to LLM (local and remote)

chatgpt-api claude-api cloud edge fastapi hybrid llm ner-spacy ollama-api pii pii-detection scrubbing spacy sqlalchemy uvicorn

Last synced: 12 Feb 2025

https://github.com/prateekrajsrivastav/question-answering-model

This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.

huggingface-transformers matplotlib nltk numpy pandas seaborn spacy

Last synced: 12 Feb 2025

https://github.com/aubainmbk/analyse-des-avis-clients-amazon

Utiliser l’analyse des sentiments et le clustering sur des avis Amazon à des fins de marketing et de satisfaction des clients.

clustering marketing-analytics nlp-machine-learning nltk pca spacy vader-sentiment-analysis

Last synced: 05 Feb 2025

https://github.com/samarthhchinivar/nlp-codebasics-playlist

This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy

nlp-machine-learning nltk python3 spacy

Last synced: 06 Jan 2025

https://github.com/wesslen/spacy-ecfr-ner

spaCy-Prodigy workflow for NER Citation model on eCFR Banking Regulation

nlp prodigy spacy

Last synced: 13 Feb 2025

https://github.com/medspacy/nlp_postprocessor

A spaCy component for executing custom logic at the end of a pipeline.

clinical-nlp medspacy nlp nlp-library pipeline spacy

Last synced: 09 Jan 2025

https://github.com/tristan-mcinnis/spacy-models-setup-and-testing

A Python utility for downloading, storing, and testing Spacy language models for English and Chinese NLP tasks.

chinese english nlp python simple-project spacy testing

Last synced: 10 Feb 2025

https://github.com/thekartikeyamishra/documentsummarizer

The Document Summarizer is a Python-based application that extracts summaries from uploaded text and PDF documents using Natural Language Processing (NLP) techniques. This project includes a basic GUI to interact with the application, upload documents, and view the summarized content.

machine-learning nlp nlp-machine-learning pdfplumber python spacy tkinter tkinter-gui

Last synced: 02 Feb 2025

https://github.com/ayaz-amin/speechpos

A simple Python script that tags speech to parts-of-speech

deep-learning machine-learning python3 spacy

Last synced: 29 Jan 2025

https://github.com/arkadiuszkaros/nlp-book-pos-extractor

This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.

extractor nlp part-of-speech-tagging python spacy

Last synced: 02 Feb 2025

https://github.com/izuna385/arxiv-checker-backend

This is an API and backend modules to return accepted papers related to natural language processing from arxiv.

docker fastapi natural-language-processing pytest spacy tdd tdd-python

Last synced: 02 Feb 2025

https://github.com/leosimoes/coursera-usp-pln-i

Atividades do curso "Processamento Neural de Linguagem Natural em Português I" oferecido pela USP através do Coursera.

nlp pln python spacy

Last synced: 30 Jan 2025

https://github.com/blue-codes-yep/AI.AT

AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.

ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp

Last synced: 06 Jan 2025

https://github.com/rfdzan/summarize-search-result

extractive text summarization with a handful of different libraries

natural-language-processing python spacy

Last synced: 28 Dec 2024

https://github.com/cmucheru/chatbot

A conversational chatbot for embedding in a site.

chatbot spacy

Last synced: 08 Feb 2025

https://github.com/jonas-jonas/text_mining

Sentiment Analysis using spaCy

jupyter-notebook nlp sentiment-analysis spacy

Last synced: 13 Feb 2025

https://github.com/elbersb/depdistance

Calculation of dependency distance

conll conll-u spacy udpipe

Last synced: 02 Feb 2025

https://github.com/francislauriano/chatsoftex

Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.

cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow

Last synced: 03 Feb 2025

https://github.com/rafelafrance/angiospermtraiter

Using rule-based parsers to extract information from plant treatments

botany python spacy

Last synced: 09 Dec 2024

https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries

A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.

clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization

Last synced: 13 Feb 2025

https://github.com/luis54929/oscarbot

OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..

ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy

Last synced: 13 Feb 2025

https://github.com/sohaamir/website_projects

Doing some analytics (scraping, app development) on my GitHub website

nltk requests scrapy spacy streamlit

Last synced: 13 Feb 2025

https://github.com/jamnicki/bachelor_thesis_project

System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning

active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy

Last synced: 13 Feb 2025

https://github.com/crodriguez1a/kaggle-la-jobs

Helping the City of Los Angeles to structure and analyze its job descriptions

kaggle linguistic-analysis ml nlu python spacy

Last synced: 09 Feb 2025

https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect

Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%

named-entity-recognition ner python spacy spacy-models

Last synced: 10 Feb 2025

https://github.com/camara94/nlp-basique

Dans ce tutoriel, nous découvrir ensemble les bases de NLP en IA

gensim nlp nlp-keywords-extraction nlp-machine-learning pytorch sklearn spacy spacy-nlp tensorflow

Last synced: 23 Dec 2024

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 23 Jan 2025

https://github.com/kr1shnasomani/summarai

Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)

natural-language-processing pytextrank pytorch sentencepiece spacy transformers

Last synced: 14 Feb 2025

https://github.com/f1uctus/webanno2spacy

Convert WebAnno TSVs to spaCy's Doc-s.

spacy spacy-extension webanno webanno-tsv

Last synced: 08 Feb 2025

https://github.com/rahul1582/named-entity-recognition

A keras implementation of Bidirectional-LSTM for Named Entity Recognition.

bidirectional-lstm keras named-entity-recognition spacy tensorflow

Last synced: 06 Feb 2025

https://github.com/vuchkov/tdd-python-llp

TDD in Python with Flask/Rest API and ML/LLP covered by Selenium tests - test-driven development, large language processing, machine learning

flask llp ml python rest-api selenium selenium-python spacy tdd tests

Last synced: 06 Feb 2025

https://github.com/abinashsahoo007/project-resume-classification

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud

Last synced: 10 Feb 2025

https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool

An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.

flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling

Last synced: 10 Feb 2025

https://github.com/wanjage/charles-burney-digital

Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat

geovisualisierung ner nlp nodegoat reisebericht spacy

Last synced: 10 Feb 2025

https://github.com/atharvapathak/customer_service_chatbot

Customer Service Chatbot Repository includes a range of features for building custom chatbots that can handle customer service queries and support requests. These features include NLP capabilities and pre-built dialog flows that can help chatbots understand and respond to customer.

chatbot database dialogflow nlp nltk reinforcement-learning restful-api spacy tensorflow

Last synced: 10 Feb 2025

https://github.com/ahmedkhaled404/ner-with-spacy

Named entity recognition using traditional NLP methods

machine-learning matplotlib ner nlp nlp-machine-learning python spacy

Last synced: 10 Feb 2025

https://github.com/sudeatesoglu/nlp-document-processor

An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.

nlp spacy text-processing

Last synced: 10 Feb 2025

https://github.com/muhammadshavaiz/ai_learning

Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.

deep-learning nlp python pytorch spacy

Last synced: 10 Feb 2025

https://github.com/michaelkinfu/hknews-headline-analysis

The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.

beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5

Last synced: 10 Feb 2025

https://github.com/foxbenjaminfox/simil

CLI for semantic string similarity

glove machine-learning python spacy string-similarity

Last synced: 10 Feb 2025

https://github.com/kivanc57/nlp_data_visualization

This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.

data-science matplotlib nlp parsing plotting python spacy visualization

Last synced: 08 Feb 2025

https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review

"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"

matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis

Last synced: 02 Feb 2025

https://github.com/karimosman89/resume-screening

Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.

machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing

Last synced: 25 Dec 2024

https://github.com/etienne-bobo/information-retreival_project

In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.

information-retrieval nlp prodigy spacy

Last synced: 10 Jan 2025

https://github.com/malcolmgreaves/py_ml_img

A Python 3 image for NLP & ML. Includes spaCy & NLTK model data.

docker-image machine-learning nlp nltk python3 spacy

Last synced: 07 Feb 2025

https://github.com/arnabd64/spacy-ner-hf-space

A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.

gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification

Last synced: 08 Feb 2025

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 26 Dec 2024

https://github.com/parthapray/nlp_pipeline_openai

This repo contains nlp pipeline and openai API integration

gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud

Last synced: 26 Dec 2024

https://github.com/rggh/api-4

Using FastAPI with spaCy to identify entities

docker fastapi python spacy

Last synced: 02 Feb 2025

https://github.com/arya-io/ner-entitylinker

A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.

ai disambiguation entityextraction entitylinking machinelearning namedentityrecognition naturallanguageprocessing nlp python spacy streamlit textprocessing wikipediaapi

Last synced: 11 Jan 2025

https://github.com/maxzirps/lyrics-sentiment-analysis

Analyse lyrics for their sentiment score

nlp pandas sentiment-analysis spacy spacy-nlp

Last synced: 12 Jan 2025

https://github.com/asaficontact/stack_classifier_project

We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.

cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization

Last synced: 14 Feb 2025

https://github.com/aidan-zamfir/the-iliad

Data analysis & relationship network for the characters of Homers Iliad

data data-analysis dataframes networks networkx python selenium spacy webscraping

Last synced: 12 Jan 2025

https://github.com/ledsouza/nlp-article-classification

This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.

gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp

Last synced: 30 Jan 2025

https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch

Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch

elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec

Last synced: 14 Feb 2025

spaCy Awesome Lists
spaCy Categories