Projects in Awesome Lists tagged with text-vectorization
A curated list of projects in awesome lists tagged with text-vectorization .
https://github.com/contextlab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 29 Jan 2026
https://github.com/ContextLab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 07 Apr 2025
https://github.com/mkearney/wactor
Word Factor Vectors
r r-package rstats text text-classification text-processing text-vectorization word-embeddings word-vectors word2vec
Last synced: 18 Oct 2025
https://github.com/amansrivastava17/bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
bns bns-vectorizer cosine-similarity nlp short-text-semantic-similarity term-frequency text-classification text-similarity text-vectorization tf-idf
Last synced: 18 Aug 2025
https://github.com/sergio11/headline_generation_lstm_transformers
Explore advanced neural networks for crafting captivating headlines! Compare LSTM 🔄 and Transformer 🔀 models through interactive notebooks 📓 and easy-to-use wrapper classes 🛠️. Ideal for content creators and data enthusiasts aiming to automate and enhance headline generation ✨.
deep-learning lstm lstm-model lstm-neural-networks model-comparison model-training model-training-and-evaluation natural-language-processing text-generation text-vectorization transformers
Last synced: 02 Apr 2025
https://github.com/rosette-api-community/visualize-embeddings
A simple Python script for transforming a corpus of documents into text vectors suitable for visualization
machine-learning natural-language-processing nlp python text-embedding text-vectorization tsv visualization
Last synced: 28 Feb 2025
https://github.com/markiskorova/machine-learning-nlp-predict-author
Machine Learning & Natural Language Processing: Predict the author of literary text snippets. Built with TensorFlow and Keras, this project trains an LSTM model on classic literature to identify writing style and authorship.
keras machine-learning natural-language-processing python tensorflow text-tokenization text-vectorization
Last synced: 23 Jan 2026
https://github.com/vidhi1290/scienceqa-insights-exploring-with-llms
Predictive Text Analysis project! This repository contains code for predicting answers to science exam questions using advanced natural language processing techniques. Check out the code and results!
interactive-visualizations kaggle kaggle-competition machine-learning multi-class-classification nlp nlp-machine-learning predictive-text-analysis random-forest-classifier text-analysis text-vectorization
Last synced: 28 Mar 2025
https://github.com/ganesh2409/course-recommendation-system
🚀 Course Recommendation System is a machine learning-powered web application designed to recommend similar courses from Coursera's vast dataset of over 3,000 courses. Built using Python, Scikit-learn, and Streamlit, the app preprocesses course data, applies text vectorization, and leverages cosine similarity to offer personalized recommendations.
cosine-similarity data-science docker machine-learning nlp python recommendation-system streamlit-webapp text-vectorization
Last synced: 28 Feb 2025
https://github.com/rid17pawar/sentiment-analysis-model-experiments
Experiments in the field of Sentiment Analysis using ML Algorithms namely Logistic Regression, Naive Bayes along with tfidf, one hot encoding, bag of words vectorization. Different MLP and RNN models viz. LSTM, GRU, Bidirectional LSTM. Lastly, state of the art BERT model
bag-of-words bert bidirectional-lstm gru logistic-regression lstm ml-algorithms naive-bayes neural-networks one-hot-encoding rnn sentiment-analysis sentiment-classification text-vectorization tfidf tfidf-vectorizer transformer-architecture twitter-sentiment-analysis
Last synced: 12 Dec 2025
https://github.com/avd1729/movie-reviews-classification
IMDB movie review classification using neural network (text-vectorization v/s word-embeddings)
nlp text-vectorization word-embeddings
Last synced: 17 Jan 2026
https://github.com/lorenzorottigni/ml-text-vectorization
Machine Learning course of Piero Savastano 3: CountVectorizer
counter-vectorizer machine-learning text-vectorization
Last synced: 19 Jun 2025
https://github.com/samp1012/email_sms_spam_detector
An Email/SMS spam classifier that aims to identify and distinguish between spam and non-spam messages.
multinomial-naive-bayes naive-bayes-classifier natural-language-processing numpy pandas python scikit-learn spam-detection text-vectorization tokenization
Last synced: 30 Dec 2025
https://github.com/sayamalt/e-commerce-text-classification
Successfully established a machine learning model that can accurately classify an e-commerce product into one of four categories, namely "Books", "Clothing & Accessories", "Household" and "Electronics", based on the product's description.
categorical-encoding cross-validation exploratory-data-analysis hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation text-classification text-preprocessing text-vectorization
Last synced: 09 Nov 2025
https://github.com/vlada-pv/prediction-sociolinguistic-data-based-on-the-diaries-texts-of-the-prozhito-project
The repository contains notebooks created for collecting and preprocessing the corpus of diary entries and for experiments on creating models for predicting gender, age groups of authors and the time period of text creation.
author-profiling bag-of-words bilstm convol convolutional-neural-networks deep-learning diary-entries logistic-regression naive-bayes-classifier neural-networks recurrent-neural-networks sociolinguistics text-preprocessing text-vectorization tf-idf-vectorizer word-embeddings
Last synced: 13 Jul 2025
https://github.com/lkethridge/machine_learning_for_texts
A Machine Learning Project using Texts from TripleTen
bag-of-words bert embeddings language-representations lemmatization machine-learning-for-text-classification n-grams regular-expressions sentiment-analysis text-vectorization tf-idf word-embeddings word2vec
Last synced: 24 Jun 2025