An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with countvectorizer

A curated list of projects in awesome lists tagged with countvectorizer .

https://github.com/avannaldas/emailsclassification

Classification of emails received on a mass distribution group

countvectorizer email-classifier scikit-learn sklearn text-classification tfidf

Last synced: 01 Jul 2025

https://github.com/sanjaikumar-28/sentimental-analysis

This repository explores the world of restaurant reviews, using Support Vector Machines (SVM) and CountVectorizer to predict the sentiment (positive or negative) expressed in each review. By analyzing textual data, we aim to provide valuable insights for restaurants and improve the overall customer experience.

countvectorizer jupyter-notebook machine-learning pipeline python sentiment-analysis support-vector-machines

Last synced: 07 May 2025

https://github.com/geekquad/text-learning

Basic usage of NLTK. Implementation of concepts like Stemmer, TfIdf, and text.CountVectors

corpus countvectorizer nltk sklearn stopwords tfidf

Last synced: 24 Feb 2025

https://github.com/elifftosunn/bert-bank-model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.

countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer

Last synced: 15 Mar 2025

https://github.com/redinfinitypro/redditsuggestor

Rating: (6/10) The project uses Python libraries and APIs to analyze Reddit data, predict user input, suggest new titles based on cosine similarity, calculate combined scores, and output the best suggestion.

countvectorizer difflib filedialog multinomialnb openpyxl pandas praw random scikit threading tkinter ttk

Last synced: 06 Apr 2025

https://github.com/akashkg03/spam-email-classification

This notebook involves to build a spam email classifier using Naive bayes and feature extraction technique using countvectorizer

classification countvectorizer jupiter-notebook naive-bayes-classifier pandas python

Last synced: 25 Feb 2025

https://github.com/RedInfinityPro/RedditSuggestor

Rating: (6/10) The project uses Python libraries and APIs to analyze Reddit data, predict user input, suggest new titles based on cosine similarity, calculate combined scores, and output the best suggestion.

countvectorizer difflib filedialog multinomialnb openpyxl pandas praw random scikit threading tkinter ttk

Last synced: 30 Sep 2025

https://github.com/sayamalt/text-similarity-quantifier

Successfully developed a machine learning model for computing the similarity score between two text paragraphs taken as input from a webpage.

bag-of-words cosine-similarity cosine-similarity-scores countvectorizer flask machine-learning nlp pandas python text-preprocessing tfidf

Last synced: 09 Nov 2025

https://github.com/somjit101/nlp-stackeroverflow-tag-prediction

A multi-class classification problem where the objective is to read a question posted on the popular reference website, StackOverflow and predict the primary topics it deals with, i.e. tags which the question will be associated with.

bag-of-words countvectorizer logistic-regression multi-class-classification multiclass-logistic-regression natural-language-processing nlp one-vs-rest onevsrestclassifier stackoverflow-tags stemming text-mining tf-idf tfidf-vectorizer word-cloud

Last synced: 06 Mar 2025

https://github.com/proxzima/twitter-sentiment-analysis

Sentiment analysis of tweets using vaderSentiment, CountVectorizer and KMeans

countvectorizer kmeans nlp sentiment-analysis twitter vadersentiment

Last synced: 22 Mar 2025

https://github.com/nazmulhasannihal/fake-news-detection-naive-bayes-and-linearsvc-approaches

An end-to-end data science project: load, clean, and analyze news articles, then train and test text classification models using both LinearSVC and Naive Bayes. Evaluate results, interpret key features, and gain actionable insights. The project includes clear code, exploratory data analysis (EDA), and practical suggestions for further improvement.

countvectorizer jupyter-notebook linearsvc navie-bayes-algorithm python tfidfvectorization

Last synced: 27 Jun 2025

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 31 Mar 2025

https://github.com/elmezianech/classifyreviews_nlp

Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.

accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc

Last synced: 05 Dec 2025

https://github.com/sudarshanc00/movie-recommendation-system

This project is a content-based movie recommendation system built using Python and Streamlit. The system suggests movies similar to a user-selected movie by analyzing plot descriptions and using machine learning techniques like cosine similarity.

cosine-similarity countvectorizer nltk-python python recommender-system

Last synced: 03 Apr 2025

https://github.com/parag000/content-based-movie-recommender

This project builds a content-based movie recommendation system using the TMDB dataset. By combining metadata features like cast, genres, and directors into a "metadata soup," it calculates movie similarity with vectorizers (Count) and cosine similarity. Ideal for learning content-based filtering and text vectorization techniques.

cosine-similarity countvectorizer recommendation-system scikit-learn tfidf-vectorizer vectorization

Last synced: 12 Jul 2025

https://github.com/jasmeetsingh7314/recommendation-system

Recommendation System for games done using python and written in jupyter notebook.

cosine-similarity countvectorizer machine-learning python

Last synced: 20 Jul 2025

https://github.com/elmezianech/email-spam-ham-classifier-nb-simple

Count Vectorizer Naive Bayes Email Classifier: This Python project utilizes a simple Naive Bayes approach with Count Vectorizer to classify emails as spam or ham. The implementation focuses on word frequency for classification.

countvectorizer emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 31 Dec 2025

https://github.com/bk0712/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

accuracy-metrics bernoulli-naive-bayes countvectorizer dataset faker heroku kaggle machine-learning matplotlib nltk numpy python pytorch tf-idf

Last synced: 26 Jun 2025