Projects in Awesome Lists tagged with count-vectorizer
A curated list of projects in awesome lists tagged with count-vectorizer .
https://github.com/ksdkamesh99/spam-classifier
A Natural Language Processing with SMS Data to predict whether the SMS is Spam/Ham with various ML Algorithms like multinomial-naive-bayes,logistic regression,svm,decision trees to compare accuracy and using various data cleaning and processing techniques like PorterStemmer,CountVectorizer,TFIDF Vetorizer,WordnetLemmatizer. It is implemented using LSTM and Word Embeddings to gain accuracy of 97.84%.
bag-of-words count-vectorizer decision-tree-classifier embeddings logistic-regression lstm-neural-networks multinomial-naive-bayes naive-bayes-classifier porter-stemmer sms-spam-detection support-vector-machines tfidf-vectorizer wordnetlemmatizer
Last synced: 12 May 2025
https://github.com/shaadclt/fake-news-detection-decisiontreeclassifier
This project involves detecting fake news using a decision tree classifier in Jupyter Notebook. Fake news detection is an important task in the field of natural language processing and machine learning, as it helps identify and filter out misleading or false information.
count-vectorizer decision-tree-classifier
Last synced: 09 Apr 2025
https://github.com/bhattbhavesh91/tf-idf-example
A simple Sklearn based example to demonstrate the working of TF-IDF.
count-vectorizer sklearn tf-idf tf-idf-calculation
Last synced: 20 Apr 2026
https://github.com/kvarun07/ecom-product-classifier
Text Classification: Predicting product categories from their text descriptions.
count-vectorizer logistic-regression naive-bayes-classifier natural-language-processing neural-network support-vector-machine text-classification tf-idf-vectorizer
Last synced: 10 Apr 2025
https://github.com/anshul21107/movie-recommender-system
The movie recommendation system is implemented using content based filtering
cosine-similarity count-vectorizer machine-learning recommender-system streamlit
Last synced: 17 Apr 2026
https://github.com/shubhamgoyal575/spam_detective
This project uses machine learning to classify messages as spam or ham based on text analysis. It includes data preprocessing, feature extraction (TF-IDF), and classification models like Logistic Regression and Naive Bayes for accurate spam detection. Built with Python and Scikit-Learn. 🚀
count-vectorizer data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis logistic-regression machine-learning machine-learning-algorithms naive-bayes natural-language-processing spam-detection tfidf-vectorizer
Last synced: 02 Jul 2025
https://github.com/datarohit/imdb-reviews-sentiment-analysis
This is a NLP - Sentiment Analysis Project built using Bernoulli-Naive-Bayes Algorithm to Predict is the IMDB Movie Review is Positive or Negative.
bernoulli-naive-bayes classification count-vectorizer multinomial-naive-bayes nltk numpy pandas porter-stemmer regex sentiment-analysis sklearn
Last synced: 20 Sep 2025
https://github.com/varunkhurana07/ecom-product-classifier
Text Classification: Predicting product categories from their text descriptions.
count-vectorizer logistic-regression naive-bayes-classifier natural-language-processing neural-network support-vector-machine text-classification tf-idf-vectorizer
Last synced: 06 Jul 2025
https://github.com/jeffreywijaya100/youtube-comment-textmining
scrapping data komentar youtube yang berkaitan dengan machine learning dalam bahasa Indonesia sebanyak minimal 100 komentar
api-key count-vectorizer machine-learning scraping text-mining tfidf-vectorizer word-cloud youtube-api-v3 youtube-comment-scraper
Last synced: 28 Mar 2025
https://github.com/prneidhardt/natural-language-processing
Twitter US Airline Sentiment
count-vectorizer sentiment-analysis text-processing tf-idf-vectorizer vectorization
Last synced: 03 Feb 2026
https://github.com/lorenzorottigni/ml-sentiment-analysis
Machine Learning course of Piero Savastano 4: CountVectorizer, BernoulliNB, accuracy_score, pandas
accuracy-score count-vectorizer machine-learning movies-reviews pandas sentiment-analysis
Last synced: 01 Aug 2025
https://github.com/justmirr/hate-speech-detection
Built using Python, Streamlit, and NLTK, the Hate Speech Detection App employs a Decision Tree Classifier for identifying hate speech in text. It features real-time speech input, NLP preprocessing, and a user-friendly Streamlit interface, offering both visual and text-to-speech result presentation.
count-vectorizer nltk python sklearn streamlit
Last synced: 14 Feb 2026
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 02 Feb 2026
https://github.com/jeffreywijaya100/ecommerce-product-textmining
Pemodelan klasifikasi menggunakan data product dari sebuah ecommerce dengan ketentuan yang diberikan
classification-report count-vectorizer hyperparameter-tuning machine-learning nltk optuna random-forest-classifier svm-classifier text-mining text-representation tfidf-vectorizer
Last synced: 11 Jul 2025
https://github.com/apfirebolt/spam_email_classifier
An Email classifier using CountVectorizer and Naive Bayes strategy. PyQt5 is used for GUI
count-vectorizer naive-bayes-classifier pandas pyqt5 python scikit-learn
Last synced: 27 Mar 2025