An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with text-vectorization

A curated list of projects in awesome lists tagged with text-vectorization .

https://github.com/contextlab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization

Last synced: 29 Jan 2026

https://github.com/ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization

Last synced: 07 Apr 2025

https://github.com/amansrivastava17/bns-short-text-similarity

📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.

bns bns-vectorizer cosine-similarity nlp short-text-semantic-similarity term-frequency text-classification text-similarity text-vectorization tf-idf

Last synced: 18 Aug 2025

https://github.com/sergio11/headline_generation_lstm_transformers

Explore advanced neural networks for crafting captivating headlines! Compare LSTM 🔄 and Transformer 🔀 models through interactive notebooks 📓 and easy-to-use wrapper classes 🛠️. Ideal for content creators and data enthusiasts aiming to automate and enhance headline generation ✨.

deep-learning lstm lstm-model lstm-neural-networks model-comparison model-training model-training-and-evaluation natural-language-processing text-generation text-vectorization transformers

Last synced: 02 Apr 2025

https://github.com/rosette-api-community/visualize-embeddings

A simple Python script for transforming a corpus of documents into text vectors suitable for visualization

machine-learning natural-language-processing nlp python text-embedding text-vectorization tsv visualization

Last synced: 28 Feb 2025

https://github.com/markiskorova/machine-learning-nlp-predict-author

Machine Learning & Natural Language Processing: Predict the author of literary text snippets. Built with TensorFlow and Keras, this project trains an LSTM model on classic literature to identify writing style and authorship.

keras machine-learning natural-language-processing python tensorflow text-tokenization text-vectorization

Last synced: 23 Jan 2026

https://github.com/vidhi1290/scienceqa-insights-exploring-with-llms

Predictive Text Analysis project! This repository contains code for predicting answers to science exam questions using advanced natural language processing techniques. Check out the code and results!

interactive-visualizations kaggle kaggle-competition machine-learning multi-class-classification nlp nlp-machine-learning predictive-text-analysis random-forest-classifier text-analysis text-vectorization

Last synced: 28 Mar 2025

https://github.com/ganesh2409/course-recommendation-system

🚀 Course Recommendation System is a machine learning-powered web application designed to recommend similar courses from Coursera's vast dataset of over 3,000 courses. Built using Python, Scikit-learn, and Streamlit, the app preprocesses course data, applies text vectorization, and leverages cosine similarity to offer personalized recommendations.

cosine-similarity data-science docker machine-learning nlp python recommendation-system streamlit-webapp text-vectorization

Last synced: 28 Feb 2025

https://github.com/rid17pawar/sentiment-analysis-model-experiments

Experiments in the field of Sentiment Analysis using ML Algorithms namely Logistic Regression, Naive Bayes along with tfidf, one hot encoding, bag of words vectorization. Different MLP and RNN models viz. LSTM, GRU, Bidirectional LSTM. Lastly, state of the art BERT model

bag-of-words bert bidirectional-lstm gru logistic-regression lstm ml-algorithms naive-bayes neural-networks one-hot-encoding rnn sentiment-analysis sentiment-classification text-vectorization tfidf tfidf-vectorizer transformer-architecture twitter-sentiment-analysis

Last synced: 12 Dec 2025

https://github.com/avd1729/movie-reviews-classification

IMDB movie review classification using neural network (text-vectorization v/s word-embeddings)

nlp text-vectorization word-embeddings

Last synced: 17 Jan 2026

https://github.com/lorenzorottigni/ml-text-vectorization

Machine Learning course of Piero Savastano 3: CountVectorizer

counter-vectorizer machine-learning text-vectorization

Last synced: 19 Jun 2025

https://github.com/samp1012/email_sms_spam_detector

An Email/SMS spam classifier that aims to identify and distinguish between spam and non-spam messages.

multinomial-naive-bayes naive-bayes-classifier natural-language-processing numpy pandas python scikit-learn spam-detection text-vectorization tokenization

Last synced: 30 Dec 2025

https://github.com/sayamalt/e-commerce-text-classification

Successfully established a machine learning model that can accurately classify an e-commerce product into one of four categories, namely "Books", "Clothing & Accessories", "Household" and "Electronics", based on the product's description.

categorical-encoding cross-validation exploratory-data-analysis hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation text-classification text-preprocessing text-vectorization

Last synced: 09 Nov 2025

https://github.com/vlada-pv/prediction-sociolinguistic-data-based-on-the-diaries-texts-of-the-prozhito-project

The repository contains notebooks created for collecting and preprocessing the corpus of diary entries and for experiments on creating models for predicting gender, age groups of authors and the time period of text creation.

author-profiling bag-of-words bilstm convol convolutional-neural-networks deep-learning diary-entries logistic-regression naive-bayes-classifier neural-networks recurrent-neural-networks sociolinguistics text-preprocessing text-vectorization tf-idf-vectorizer word-embeddings

Last synced: 13 Jul 2025