Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/henx117/chatbot

My chatbot python project

chatbot python python3 spacy

Last synced: 14 Oct 2024

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 19 Oct 2024

https://github.com/omar7tech/text-summarization

This repository explores the process of automatic text summarization using traditional methods and modern NLP models. It includes steps for text cleaning, word frequency analysis, and summarization, along with a comparison of summaries generated by different transformer models.

natural-language-processing python spacy text-summarization tokenization

Last synced: 31 Oct 2024

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 14 Oct 2024

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 03 Aug 2024

https://github.com/toshimelonhead/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 13 Aug 2024

https://github.com/florensadimer/nlp_ner_soccer_pt-br

Anotação Manual e Comparação com Modelos Treinados

annotation llm machine-learning ner nlp spacy

Last synced: 21 Oct 2024

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 03 Aug 2024

https://github.com/kailejie/ner

This repository implements Named Entity Recognition (NER) using spaCy, NLTK, and BERT (from the Hugging Face Transformers library). The project runs on a Streamlit web application, allowing users to upload a CSV file containing subject lines to perform NER and visualize the results. It can be run locally or on Google Colab.

bert ner nltk spacy

Last synced: 31 Oct 2024

https://github.com/thyripian/core

This repository contains the Centralized Operational Reporting Engine (CORE), designed for processing diverse datasets and integrating with Elasticsearch, PostgreSQL, and SQLite. It features a React-based UI for interacting with the backend, offering data extraction, processing, and search functionalities.

api csv data-science elasticsearch flask fullstack-development javascript pandas postgresql python react spacy sqlite

Last synced: 26 Oct 2024

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 04 Aug 2024

https://github.com/miteshgupta07/ats-scoring-system

An ATS (Applicant Tracking System) scoring system that evaluates and ranks resumes based on keyword matching and relevance.

ats ats-system nlp python resume-parser spacy

Last synced: 31 Oct 2024

https://github.com/charlesyuan02/named_entity_recognition

Utilizing Spacy and Tensorflow to train custom Named Entity Recognizers.

conll-2003 named-entity-recognition ner nlp spacy transformer

Last synced: 31 Oct 2024

https://github.com/inshh04/codealpha_chatbotforfaqs_inshanadeem

The FAQ Chatbot is a Python-based conversational agent designed to interact with users and respond to frequently asked questions. It offers a simple and engaging way to provide automated responses, handle polite interactions like thanking the user, and end conversations gracefully. This project serves as a basic template for building more advanced.

chatbot faqbot faqchatbot faqs keyword-extraction nlp nlp-machine-learning progressive-web-app project python python3 pythonprojects spacy spacy-nlp

Last synced: 31 Oct 2024

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 05 Nov 2024

https://github.com/tbarlow12/wiki-answer

I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions

nlp python question-answering spacy wikipedia

Last synced: 19 Oct 2024

https://github.com/oroszgy/mltools

Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib

data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools

Last synced: 19 Oct 2024

https://github.com/randika00/ism-web-automation-y23cp-web

Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.

2captcha-api beautifulsoup regex scrapy selenium spacy webdriver

Last synced: 20 Oct 2024

https://github.com/crodriguez1a/kaggle-la-jobs

Helping the City of Los Angeles to structure and analyze its job descriptions

kaggle linguistic-analysis ml nlu python spacy

Last synced: 28 Oct 2024

https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods

Text-Summarizer-Using-NLP-and-TF-IDF-Methods

nlp spacy text-summarization

Last synced: 21 Oct 2024

https://github.com/kavyachouhan/manasvi

An AI-powered chatbot built with Django and spaCy that provides real-time emotional support. Manasvi uses natural language processing (NLP) and sentiment analysis to engage users in meaningful conversations about mental health, offering personalized responses based on emotional tone.

chatbot django machine-learning mental-health mental-health-chatbot nlp python sentiment-analysis spacy text-processing web-app

Last synced: 21 Oct 2024

https://github.com/rafelafrance/angiospermtraiter

Using rule-based parsers to extract information from plant treatments

botany python spacy

Last synced: 21 Oct 2024

https://github.com/nanditha-prabhu/qa-system-via-srl

Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques

bert qa-system spacy srl

Last synced: 31 Oct 2024

https://github.com/ahmedkhaled404/ner-with-spacy

Named entity recognition using traditional NLP methods

machine-learning matplotlib ner nlp nlp-machine-learning python spacy

Last synced: 31 Oct 2024

https://github.com/thjbdvlt/spacy-viceverser

lemmatisation du français avec hunspell et spacy

french hunspell lemmatization nlp nlp-french spacy

Last synced: 31 Oct 2024

https://github.com/kazkozdev/novelgenerator

NovelGenerator - AI-powered fiction book generator that uses Ollama's LLMs to create complete novels with coherent plot structures, developed characters and multiple writing styles.

ai-novels fiction-generator nlp novel-writing ollama python spacy text-generation

Last synced: 02 Nov 2024

https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries

A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.

clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization

Last synced: 03 Nov 2024

https://github.com/sohaamir/website_projects

Doing some analytics (scraping, app development) on my GitHub website

nltk requests scrapy spacy streamlit

Last synced: 03 Nov 2024

https://github.com/imvladikon/spacy-trankit

💥 Trankit models directly in spaCy💥

nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit

Last synced: 14 Oct 2024

https://github.com/lfoppiano/docker-image-spacy

Docker image for shipping spacy

docker image spacy

Last synced: 30 Oct 2024

https://github.com/wanjage/charles-burney-digital

Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat

geovisualisierung ner nlp nodegoat reisebericht spacy

Last synced: 31 Oct 2024

https://github.com/presizhai/rmp-ai-assistant

This project implements a RAG system for a Rate My Professor service, leveraging Pinecone for vector storage and OpenAI for text embeddings. It preprocesses professor reviews using SpaCy for cleaning and sentiment analysis, enabling the AI assistant to provide more nuanced recommendations and insights based on student queries.

generative-ai large-language-model natural-language-processing openai software-development software-engineering spacy

Last synced: 31 Oct 2024

https://github.com/thjbdvlt/litteralement

schéma de base de données postgresql EAV hybride pour l'analyse de textes en français

eav french nlp nlp-french postgresql spacy sql

Last synced: 31 Oct 2024

https://github.com/foxbenjaminfox/simil

CLI for semantic string similarity

glove machine-learning python spacy string-similarity

Last synced: 01 Nov 2024

https://github.com/thjbdvlt/spacy-presque

normalisation de mots (français) pour spacy

french nlp normalization spacy spacy-extensions

Last synced: 31 Oct 2024

https://github.com/trikztr/gptscrape

GPTScrape: A tool for web scraping that uses spaCy for NLP and GPT4All for converting scraped text into structured JSON.

ai data-extraction data-scraping gpt gpt4all llm npl python scraping spacy spacy-nlp web-scraping

Last synced: 31 Oct 2024

https://github.com/asaficontact/stack_classifier_project

We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.

cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization

Last synced: 05 Nov 2024

https://github.com/giuliosmall/twitter-trending-topics-pipeline

This project demonstrates trending topic detection using Apache Spark and MinIO. It processes Twitter JSON data with PySpark, leveraging distributed data processing and cloud storage. The entire project is containerized with Docker for easy deployment across architectures.

docker minio nlp pyspark pytest spacy spark streamlit

Last synced: 24 Oct 2024

https://github.com/luis54929/oscarbot

OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..

ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy

Last synced: 03 Nov 2024

https://github.com/arnabd64/spacy-ner-hf-space

A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.

gradio huggingface-spaces named-entity-recognition nlp spacy spacy-pipeline token-classification

Last synced: 09 Oct 2024

https://github.com/pedcapa/nlpower

FastAPI-based service designed to provide real-time text analysis. It leverages some Natural Language Processing (NLP) libraries to offer functionalities such as sentiment analysis, keyword extraction, and text summarization.

fastapi nlp nltk spacy

Last synced: 09 Oct 2024

https://github.com/dmytrovoytko/mlops-spacy-sentiment-analysis

MLOps project Training and Deployment of Spacy model for Sentiment analysis

amazon ml-engineering mlflow mlops nlp prefect sentiment-analysis spacy text-classification

Last synced: 31 Oct 2024

https://github.com/rrayhka/indonesian-ner-spacy

Fine-tuning SpaCy for Indonesian Named Entity Recognition (NER) with custom dataset.

indonesian named-entity-recognition ner nlp spacy

Last synced: 09 Oct 2024

https://github.com/salma-4/nlp-task

Preprocessing using NLTK ,SPACY

nltk-library python spacy svm-model

Last synced: 09 Oct 2024

https://github.com/coueghlani/nlp

Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos

mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy

Last synced: 31 Oct 2024

https://github.com/jonas-jonas/text_mining

Sentiment Analysis using spaCy

jupyter-notebook nlp sentiment-analysis spacy

Last synced: 02 Nov 2024

https://github.com/meefs/entseeker

entseeker is a command-line tool for Named Entity Recognition (NER) and web entity searches in text files. It uses spaCy's NLP capabilities for standard named entities and custom rules for web-related entities.

ai named-entity-recognition spacy spacy-nlp text-classification text-processing

Last synced: 31 Oct 2024

https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect

Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%

named-entity-recognition ner python spacy spacy-models

Last synced: 09 Oct 2024

https://github.com/bonysmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

correction kenlm spacy spelling symspell ukrainian

Last synced: 10 Oct 2024

https://github.com/sukanyadutta52/sentiment-analysis

An Analysis of How Machine Perceives Women and How Women Feel about Themselves As a Result of This Perception: Sentiment Analysis

flair matplotlib nltk-library pandas regular-expression sentiment-analysis spacy textblob vader-sentiment-analysis women-beauty-standard

Last synced: 17 Oct 2024

https://github.com/philippeitis/nlp_specifier

Formal verification for natural language software documentation

natural-language-processing nlp spacy

Last synced: 12 Oct 2024

https://github.com/xettrisomeman/speechandtext

Practicing NLP using spacy and Sklearn

nlp sklearn spacy

Last synced: 11 Oct 2024

https://github.com/ivangael/nlp-chatbot-api

A NLP project leveraging NLTK for extracting weather data.

flask nlp-api nlp-chatbot nltk python spacy transformers

Last synced: 31 Oct 2024

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 31 Oct 2024

https://github.com/2pa4ul2/mcq-quiz-maker-nlp

Quizzable a quiz generator for short reviews with Spacy and NLTK

flask nlp nltk python question-generation quizapp spacy

Last synced: 09 Oct 2024

https://github.com/woranov/spacy-lazy-docbin

Lazy-loadable and indexable spaCy DocBins

spacy spacy-extension

Last synced: 12 Oct 2024

https://github.com/yashaswini-lankalapalli/text-summarization

Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.

nlp python spacy transformers

Last synced: 12 Oct 2024

https://github.com/manik2000/radiohead-lyrics

NLP analysis of Radiohead's songs lyrics.

embeddings huggingface-transformers nlp spacy

Last synced: 09 Oct 2024

https://github.com/oroszgy/cookiecutter-ml-flask

Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker

docker flask flask-application machine-learning nlp rest-api scikit-learn spacy

Last synced: 19 Oct 2024

https://github.com/muhammadshavaiz/ai_learning

Google Colab notebooks showcasing PyTorch implementations and experiments. Covers deep learning techniques, including neural networks and NLP concepts.

deep-learning nlp python pytorch spacy

Last synced: 31 Oct 2024

https://github.com/raniasakrr/breakthrough-hire

The project aims to help job seekers understand the essential qualifications required for specific jobs and assess how well their skills match those positions. Additionally, it assists recruiters in improving their resume selection processes by analyzing and comprehending job advertisements.

bert cvanalysis flask ner nlp python scraping sentence-similarity spacy sqlalchemy transformer

Last synced: 09 Oct 2024

https://github.com/viniciusmecosta/cv_classifier

A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.

catboost fastapi python3 sklearn spacy

Last synced: 09 Oct 2024

https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review

"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"

matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis

Last synced: 12 Oct 2024

https://github.com/sydney-informatics-hub/clause-segmenter

A clause segmenting tool utilising Python's SpaCy

nlp python spacy

Last synced: 09 Oct 2024

https://github.com/vanheemstrasystems/spacy

SpaCy - spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.

spacy

Last synced: 12 Oct 2024

https://github.com/f1uctus/webanno2spacy

Convert WebAnno TSVs to spaCy's Doc-s.

spacy spacy-extension webanno webanno-tsv

Last synced: 09 Oct 2024

https://github.com/abinashsahoo007/project-resume-classification

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud

Last synced: 31 Oct 2024

https://github.com/lilivalgo/nlp-analysis-of-un-climate-change-reports

This project uses Natural Language Processing (NLP) techniques to analyze large amounts of textual data from UN reports on climate change. By applying NLP, the project aims to extract valuable information that can shed light on critical aspects of climate change

beautifulsoup4 matplotlib pandas pypdf2 seaborn spacy text-analysis text-processing webscraping

Last synced: 12 Oct 2024

https://github.com/adishtienmetz/context-game

A context word guessing game. Try to guess the word in minimum tries!

python3 spacy sqlite3

Last synced: 09 Oct 2024

https://github.com/direct-phonology/phony

phonology in spaCy!

linguistics nlp phonology python spacy

Last synced: 12 Oct 2024

https://github.com/etienne-bobo/information-retreival_project

In this project, it was a question of designing a model capable of recognizing terms related to our field which is: INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES.

information-retrieval nlp prodigy spacy

Last synced: 11 Oct 2024

https://github.com/cano1998/sentiment-analysis-report-for-amazon-product-reviews

Sentiment analysis of Amazon product reviews. The analysis provides insights into customer sentiment and opinions regarding specific products sold on Amazon.

pdf pdf-generation sentiment-analysis spacy text-blob

Last synced: 31 Oct 2024

spaCy Awesome Lists
spaCy Categories