An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with count-vectorizer

A curated list of projects in awesome lists tagged with count-vectorizer .

https://github.com/ksdkamesh99/spam-classifier

A Natural Language Processing with SMS Data to predict whether the SMS is Spam/Ham with various ML Algorithms like multinomial-naive-bayes,logistic regression,svm,decision trees to compare accuracy and using various data cleaning and processing techniques like PorterStemmer,CountVectorizer,TFIDF Vetorizer,WordnetLemmatizer. It is implemented using LSTM and Word Embeddings to gain accuracy of 97.84%.

bag-of-words count-vectorizer decision-tree-classifier embeddings logistic-regression lstm-neural-networks multinomial-naive-bayes naive-bayes-classifier porter-stemmer sms-spam-detection support-vector-machines tfidf-vectorizer wordnetlemmatizer

Last synced: 12 May 2025

https://github.com/shaadclt/fake-news-detection-decisiontreeclassifier

This project involves detecting fake news using a decision tree classifier in Jupyter Notebook. Fake news detection is an important task in the field of natural language processing and machine learning, as it helps identify and filter out misleading or false information.

count-vectorizer decision-tree-classifier

Last synced: 09 Apr 2025

https://github.com/bhattbhavesh91/tf-idf-example

A simple Sklearn based example to demonstrate the working of TF-IDF.

count-vectorizer sklearn tf-idf tf-idf-calculation

Last synced: 20 Apr 2026

https://github.com/anshul21107/movie-recommender-system

The movie recommendation system is implemented using content based filtering

cosine-similarity count-vectorizer machine-learning recommender-system streamlit

Last synced: 17 Apr 2026

https://github.com/shubhamgoyal575/spam_detective

This project uses machine learning to classify messages as spam or ham based on text analysis. It includes data preprocessing, feature extraction (TF-IDF), and classification models like Logistic Regression and Naive Bayes for accurate spam detection. Built with Python and Scikit-Learn. 🚀

count-vectorizer data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis logistic-regression machine-learning machine-learning-algorithms naive-bayes natural-language-processing spam-detection tfidf-vectorizer

Last synced: 02 Jul 2025

https://github.com/datarohit/imdb-reviews-sentiment-analysis

This is a NLP - Sentiment Analysis Project built using Bernoulli-Naive-Bayes Algorithm to Predict is the IMDB Movie Review is Positive or Negative.

bernoulli-naive-bayes classification count-vectorizer multinomial-naive-bayes nltk numpy pandas porter-stemmer regex sentiment-analysis sklearn

Last synced: 20 Sep 2025

https://github.com/jeffreywijaya100/youtube-comment-textmining

scrapping data komentar youtube yang berkaitan dengan machine learning dalam bahasa Indonesia sebanyak minimal 100 komentar

api-key count-vectorizer machine-learning scraping text-mining tfidf-vectorizer word-cloud youtube-api-v3 youtube-comment-scraper

Last synced: 28 Mar 2025

https://github.com/lorenzorottigni/ml-sentiment-analysis

Machine Learning course of Piero Savastano 4: CountVectorizer, BernoulliNB, accuracy_score, pandas

accuracy-score count-vectorizer machine-learning movies-reviews pandas sentiment-analysis

Last synced: 01 Aug 2025

https://github.com/justmirr/hate-speech-detection

Built using Python, Streamlit, and NLTK, the Hate Speech Detection App employs a Decision Tree Classifier for identifying hate speech in text. It features real-time speech input, NLP preprocessing, and a user-friendly Streamlit interface, offering both visual and text-to-speech result presentation.

count-vectorizer nltk python sklearn streamlit

Last synced: 14 Feb 2026

https://github.com/abinashsahoo007/project-resume-classification

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud

Last synced: 02 Feb 2026

https://github.com/apfirebolt/spam_email_classifier

An Email classifier using CountVectorizer and Naive Bayes strategy. PyQt5 is used for GUI

count-vectorizer naive-bayes-classifier pandas pyqt5 python scikit-learn

Last synced: 27 Mar 2025