Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

spaCy

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

https://github.com/lilivalgo/analisis_reportes_onu_cambio_climatico

Web Scraping, manipulación de files.PDF, NPL con SpaCy

beautifulsoup4 pandas pypdf2 python requests spacy wordcloud

Last synced: 02 Feb 2025

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 16 Nov 2024

https://github.com/bghorvath/TextMiningTheBechdelTest

Text mining movie scripts to explore long-term trend of female representation in movies according to the Bechdel test

bechdel bechdel-test coreference-resolution neuralcoref spacy

Last synced: 16 Nov 2024

https://github.com/ccoreilly/spacy-catala-generator

Training and dataset used for the catalan spacy model

catala catalan catalan-language spacy spacy-models

Last synced: 17 Dec 2024

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 14 Oct 2024

https://github.com/araobp/bach-network

J. S. Bach's network with spaCy(NLP)

graphology spacy visjs

Last synced: 17 Jan 2025

https://github.com/gugarosa/brainy

🧠 An intelligent Python-inspired Machine Learning API for training NLP-based models.

api machine-learning nlp python spacy

Last synced: 02 Feb 2025

https://github.com/randika00/ism-web-automation-y23cp-web

Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.

2captcha-api beautifulsoup regex scrapy selenium spacy webdriver

Last synced: 03 Feb 2025

https://github.com/fferegrino/zeldakg

A TLOZ inspired knowledge graph

infobox knowledge-graph nltk pandas python spacy wikidata

Last synced: 15 Dec 2024

https://github.com/jtlicardo/process-visualizer-web

Web interface for the process-visualizer project

bert bpmn nlp openai spacy

Last synced: 15 Nov 2024

https://github.com/codebasics/ner-resume-parser

A tutorial for NER Resume Parser to get the keywords out of a resume.

mlflow mlflow-tracking nlp python spacy spacy-models spacy-nlp

Last synced: 16 Jan 2025

https://github.com/jash271/youglance

Package for analyzing Youtube Videos from searching by relevant entities to analyzing sentiments and clustering different parts of the video according to your liking

cosine-similarity named-entity-recognition ner nlp nltk python sentiment-analysis spacy tfidf topic-modeling

Last synced: 27 Jan 2025

https://github.com/oroszgy/spacy-tokenizer-benchmark

Quick and dirty scripts to measure the performance of spaCy

benchmark natural-language-processing nlp python spacy tokenizer

Last synced: 03 Feb 2025

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 20 Nov 2024

https://github.com/debugger404/multilanguage-pos

Named Entity Recognition with SpaCy - 🌐📝 Repository for NER using SpaCy's MultiLanguage module. Supports multiple languages.

multilanguage named-entity-recognition ner python3 spacy

Last synced: 22 Dec 2024

https://github.com/ivangael/nlp-chatbot-api

A NLP project leveraging NLTK for extracting weather data.

flask nlp-api nlp-chatbot nltk python spacy transformers

Last synced: 31 Oct 2024

https://github.com/oroszgy/mltools

Common utility methods and classes to ease the work with sklearn, spacy, pandas, matplotlib

data-science machine-learning nlp pandas sklearn sklearn-compatible spacy tools

Last synced: 03 Feb 2025

https://github.com/oroszgy/cookiecutter-ml-flask

Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker

docker flask flask-application machine-learning nlp rest-api scikit-learn spacy

Last synced: 03 Feb 2025

https://github.com/ahmedabdalkreem/grammer-auto-correct

In this project work to make classification between the phase is correct or wrong if phase is right print the correct phase if phase is wrong be input of Transfer Learning and print the phase begore correct.

decision-trees logistic-regression machine-learning matplotlib-pyplot naive-bayes-classifier nlp nltk-library pandas-library python random-forest sklearn spacy svm-model transfer-learning

Last synced: 16 Jan 2025

https://github.com/woranov/spacy-lazy-docbin

Lazy-loadable and indexable spaCy DocBins

spacy spacy-extension

Last synced: 18 Jan 2025

https://github.com/centrefordigitalhumanities/textminer

A script to detect named entities and store them in an Elasticsearch annotated_text field

annotation elasticsearch ner spacy

Last synced: 25 Dec 2024

https://github.com/rahul1582/text-summarisation-using-spacy

A Text Summarizer deployed to Heroku

heroku nlp spacy text-summarisation

Last synced: 13 Dec 2024

https://github.com/simeonhristov99/ati

Ati is a web-based application for predicting which famous classic Bulgarian novelist wrote a piece of text (short or long).

authorship-attribution embeddings jupyter-notebook multiclass-classification nlp optuna pycaret python3 scraping-websites spacy transformer

Last synced: 13 Jan 2025

https://github.com/veldhub/veld_data__akp_ner_linkedcat

data veld containg machine inferenced named entities and context data.

nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_code__spacy

Code velds encapsulating usage of spaCy.

nlp spacy spacy-nlp

Last synced: 21 Jan 2025

https://github.com/ntinouldinho/machine-learning-classification-and-speech-generation

Explored Greek Parliament Proceedings and tried to classify each speech to a corresponding parliamentary political party.

artificial-intelligence classification-machine-learning machine-learning neural-networks pandas python sklearn spacy

Last synced: 03 Feb 2025

https://github.com/403errors/ai-docparser

An application framework developed using the latest AI technologies to extract the values of specific pre-defined keys from a given PDF document. Also generating a document summary using the key & values extracted in the while doing so.

automation csv-export nlp pdf-files python3 regex reinforcement-learning spacy

Last synced: 21 Jan 2025

https://github.com/thekartikeyamishra/resumeevaluatorapp

The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.

flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob

Last synced: 03 Feb 2025

https://github.com/naveen3830/splashtop_analysis

This repository contains the code for my webapp splashtop website analysis.

nlp-keywords-extraction python spacy streamlit

Last synced: 07 Dec 2024

https://github.com/bglid/job-application-helper

Project to incorporate web scraping of job applications and then analyze them using NLP methods.

nlp spacy streamlit text-processing webscraping

Last synced: 07 Dec 2024

https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries

A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.

clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization

Last synced: 21 Dec 2024

https://github.com/luis54929/oscarbot

OscarBot: Chatbot de IA personalizado para el área de tecnología del Banco de Occidente. Asistente inteligente para procesos internos y consultas hacia tecnología..

ai banco-de-occidente banking banking-applications chatbot chatterbot machine-learning nlp python3 spacy

Last synced: 21 Dec 2024

https://github.com/jamnicki/bachelor_thesis_project

System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning

active-learning active-learning-in-nlp annotation-tool argilla kpwr named-entity-recognition nlp optimization sampling-methods sequence-labeling sequential-data spacy

Last synced: 21 Dec 2024

https://github.com/imvladikon/quora-question-pair

duplicates detection experiments on Quora Question Pairs (QQP)

fasttext nlp paraphrase spacy

Last synced: 02 Jan 2025

https://github.com/veldhub/veld_chain__apis_ner_transform_to_gold

Chain velds encapsulating extraction and conversion of gold data.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/jonas-jonas/text_mining

Sentiment Analysis using spaCy

jupyter-notebook nlp sentiment-analysis spacy

Last synced: 20 Dec 2024

https://github.com/veldhub/veld_chain__train_spacy_apis_ner

Chain velds encapsulating a spacy NER training setup on APIS data.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/veldhub/veld_chain__apis_ner_evaluate_old_models

Chain velds encapsulating evalution of old spacy models.

named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner

Last synced: 21 Jan 2025

https://github.com/zofiaqlt/nlp_libraries_tweets_analysis

🎯 Exploration of NLP libraries (nltk, spacy) and tweets analysis - use of Python and JupyterLab (Data collection, Cleaning, EDA, Classification, and Data Visualization)

nlp nltk python spacy

Last synced: 12 Jan 2025

https://github.com/veldhub/veld_chain__mara_load_and_publish_models

Chain velds for publishing self-trained MARA models to huggingface.

nlp spacy spacy-nlp

Last synced: 21 Jan 2025

https://github.com/sohaamir/website_projects

Doing some analytics (scraping, app development) on my GitHub website

nltk requests scrapy spacy streamlit

Last synced: 21 Dec 2024

https://github.com/kr1shnasomani/summarai

Text summarizer using NLP (Extractive Summarization & Abstractive Summarization)

natural-language-processing pytextrank pytorch spacy transformers

Last synced: 21 Dec 2024

https://github.com/tbarlow12/wiki-answer

I wanted to create a question answerer for Wikipedia articles. This project reads articles and responds to a set of questions, getting 31% accuracy on the test set of questions

nlp python question-answering spacy wikipedia

Last synced: 02 Feb 2025

https://github.com/lfoppiano/docker-image-spacy

Docker image for shipping spacy

docker image spacy

Last synced: 18 Dec 2024

https://github.com/xettrisomeman/speechandtext

Practicing NLP using spacy and Sklearn

nlp sklearn spacy

Last synced: 02 Jan 2025

https://github.com/angelospanag/kleio-bot

A bot that aggregates the last 50 tweets of each political party currently in the Greek parliament and creates a word cloud for each daily

bot nlp python spacy twitter

Last synced: 03 Jan 2025

https://github.com/isabelleysseric/question-answering

Building a Natural Language Question & Answer Search Engine with corpus in Python language.

corpus deep-learning nlp qa question-answering spacy whoosh

Last synced: 30 Dec 2024

https://github.com/rtmigo/spacy_installer_py

Installing and removing spaCy language models from Python code, without using the command line

install nlp pip python spacy uninstall

Last synced: 21 Jan 2025

https://github.com/yashaswini-lankalapalli/text-summarization

Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.

nlp python spacy transformers

Last synced: 12 Oct 2024

https://github.com/d5555/textcat_dataset_imdb

Movie Review Dataset for binary sentiment classification

categories dataset spacy textcat textcategorizer

Last synced: 02 Jan 2025

https://github.com/amishra15/text-summarizer-using-nlp-and-tf-idf-methods

Text-Summarizer-Using-NLP-and-TF-IDF-Methods

nlp spacy text-summarization

Last synced: 04 Feb 2025

https://github.com/aranzadata/moviereviewclassifier

Modelo de análisis de sentimientos basado en BERT para 45,000 reseñas de películas, logrando una puntuación F1 de 0.88 al aprovechar técnicas avanzadas de preprocesamiento de texto con NLTK y SpaCy

bert-embeddings nltk spacy

Last synced: 04 Feb 2025

https://github.com/emmy-bradfield/hilly_xmas

A simple ChatBot built using openAI's davinci 003 as a gift for a dear friend of ours

machine-learning natural-language-processing openai python spacy

Last synced: 21 Jan 2025

https://github.com/aydan-moon/news_headlines_ner

Named Entity Recognition (NER) model for analyzing entities in news headlines using spaCy and trained on the CoNLL-2003 dataset.

conll-2003 ner nlp python spacy

Last synced: 21 Jan 2025

https://github.com/paulo-santos-ds/analise_de_sentimentos_em_criticas_de_filmes

Este projeto visa desenvolver um sistema para filtrar e categorizar resenhas de filmes

lgbm math matplotlib nltk pandas python re sklearn spacy torch

Last synced: 21 Jan 2025

https://github.com/yathartharora/twitter_bot

A twitter bot using tweepy API and phrasematching

nlp phrase-extraction spacy spacy-nlp twitter twitter-api twitter-bot

Last synced: 07 Jan 2025

https://github.com/praadnya/govt-circular-analysis

Uses OCR and NER techniques for parsing Goverment Circulars

annotations graphdb ner ocr spacy

Last synced: 07 Jan 2025

https://github.com/samarthhchinivar/nlp-codebasics-playlist

This is a GitHub repository containing Jupyter notebooks and Python scripts related to natural language processing (NLP) concepts and techniques covered in the "NLP with Python" playlist by Codebasics YouTube channel. The notebooks cover topics such as text preprocessing, feature extraction using Python libraries NLTK, SpaCy

nlp-machine-learning nltk python3 spacy

Last synced: 06 Jan 2025

https://github.com/stephenombuya/ai-powered-writing-assistant

An advanced writing assistant that helps users improve their writing through grammar checking, style analysis, and intelligent suggestions.

flask-application pytest python3 spacy sqlalchemy sqlite3 textblob-sentiment-analysis writing-assistant

Last synced: 09 Jan 2025

https://github.com/philippeitis/nlp_specifier

Formal verification for natural language software documentation

natural-language-processing nlp spacy

Last synced: 21 Jan 2025

https://github.com/thekartikeyamishra/documentsummarizer

The Document Summarizer is a Python-based application that extracts summaries from uploaded text and PDF documents using Natural Language Processing (NLP) techniques. This project includes a basic GUI to interact with the application, upload documents, and view the summarized content.

machine-learning nlp nlp-machine-learning pdfplumber python spacy tkinter tkinter-gui

Last synced: 02 Feb 2025

https://github.com/arkadiuszkaros/nlp-book-pos-extractor

This project focuses on extracting sentences from the text of two popular book series: Harry Potter and Game of Thrones. Using Natural Language Processing (NLP) techniques powered by spaCy, the project aims to identify and analyze the parts of speech (POS) for each word in a sentence.

extractor nlp part-of-speech-tagging python spacy

Last synced: 02 Feb 2025

https://github.com/ashenoooone/semantic-book-analyzer

Веб-сервис для извлечения ключевых слов из введения книг по дискретной математике в формате PDF. Фронтенд: React.js, Webpack, FSD, RTK, TypeScript. Бэкенд: FastAPI, FastAPI Users, SQLAlchemy, Pydantic, Pymorphy3, Spacy. Включает авторизацию, регистрацию и историю запросов. 📚🔍

fastapi fastapi-users nlp pymorphy2 pymorphy3 python3 reactjs rtk rtkquery spacy spacy-nlp sqlalchemy typescript

Last synced: 15 Jan 2025

https://github.com/mugambi645/basic-spacy-nlp

Basic NLP with spacy

nlp spacy

Last synced: 16 Jan 2025

https://github.com/izuna385/arxiv-checker-backend

This is an API and backend modules to return accepted papers related to natural language processing from arxiv.

docker fastapi natural-language-processing pytest spacy tdd tdd-python

Last synced: 02 Feb 2025

https://github.com/blue-codes-yep/AI.AT

AI-Powered Text-To-Speech Script Generator This web application uses AI to generate captivating and informative video scripts based on user inputs. It is still under development, but it has the potential to be a useful tool.

ai automation chatbot flask langchain-python llm nlp python3 react reactjs spacy spacy-nlp

Last synced: 06 Jan 2025

https://github.com/bonysmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

correction kenlm spacy spelling symspell ukrainian

Last synced: 10 Oct 2024

https://github.com/rfdzan/summarize-search-result

extractive text summarization with a handful of different libraries

natural-language-processing python spacy

Last synced: 28 Dec 2024

https://github.com/mugambi645/spacy-text-classification

Text classification with spacy

machine-learning nlp spacy

Last synced: 11 Nov 2024

https://github.com/elbersb/depdistance

Calculation of dependency distance

conll conll-u spacy udpipe

Last synced: 02 Feb 2025

https://github.com/zackakil/nlp-using-word-vectors

Code resources for Central London Data Science Project Nights meetup on word vectors

machine-learning natural-language-processing nlp python spacy word-embeddings word-vectors

Last synced: 13 Nov 2024

https://github.com/salma-4/nlp-task

Preprocessing using NLTK ,SPACY

nltk-library python spacy svm-model

Last synced: 22 Jan 2025

https://github.com/iv4n-ga6l/nlp-chatbot-api

A NLP project leveraging NLTK for extracting weather data.

flask nlp-api nlp-chatbot nltk python spacy transformers

Last synced: 25 Jan 2025

https://github.com/pythonicforge/e.c.h.o-mini

A miniature model of ECHO intended for my portfolio

ai express javascript nltk python spacy

Last synced: 22 Jan 2025

https://github.com/imvladikon/spacy-trankit

💥 Trankit models directly in spaCy💥

nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit

Last synced: 28 Jan 2025

https://github.com/asaficontact/stack_classifier_project

We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.

cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization

Last synced: 22 Dec 2024

https://github.com/an0nx/ner-model-for-company-names-gost-and-product-names-detect

Модель NER для определения названий компаний, стандартов госта и названий товаров с точностью 97%

named-entity-recognition ner python spacy spacy-models

Last synced: 09 Oct 2024

spaCy Awesome Lists
spaCy Categories