Projects in Awesome Lists tagged with countvectorizer
A curated list of projects in awesome lists tagged with countvectorizer .
https://github.com/avannaldas/emailsclassification
Classification of emails received on a mass distribution group
countvectorizer email-classifier scikit-learn sklearn text-classification tfidf
Last synced: 01 Jul 2025
https://github.com/kool-cool/kool-cool-movie-recommendations-flask
The provided code snippet performs movie recommendation based on movie metadata using the TMDB Movie Metadata dataset from Kaggle.
countvectorizer datapreprocessing flask machine-learning machine-learning-algorithms moviemetadata movierecommendation movies-recommendation natural-language-processing nlp python reccomendersystem reccommendation tfidfvectorization webapp
Last synced: 31 Dec 2025
https://github.com/tanjib-rafi/spam-sms-detection
ann countvectorizer spam-classification spam-detection spam-sms-detection
Last synced: 10 Sep 2025
https://github.com/sanjaikumar-28/sentimental-analysis
This repository explores the world of restaurant reviews, using Support Vector Machines (SVM) and CountVectorizer to predict the sentiment (positive or negative) expressed in each review. By analyzing textual data, we aim to provide valuable insights for restaurants and improve the overall customer experience.
countvectorizer jupyter-notebook machine-learning pipeline python sentiment-analysis support-vector-machines
Last synced: 07 May 2025
https://github.com/5hraddha/sentiment-analysis
An innovative system for filtering and categorizing movie reviews
countvectorizer dummyclassifier lgbmclassifier logisticregression matplotlib minmaxscaler nltk nltk-stopwords nltk-tokenizer numpy pandas seaborn spacy tfidfvectorizer torch tqdm transformers
Last synced: 18 Jan 2026
https://github.com/geekquad/text-learning
Basic usage of NLTK. Implementation of concepts like Stemmer, TfIdf, and text.CountVectors
corpus countvectorizer nltk sklearn stopwords tfidf
Last synced: 24 Feb 2025
https://github.com/elifftosunn/bert-bank-model
It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.
countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer
Last synced: 15 Mar 2025
https://github.com/redinfinitypro/redditsuggestor
Rating: (6/10) The project uses Python libraries and APIs to analyze Reddit data, predict user input, suggest new titles based on cosine similarity, calculate combined scores, and output the best suggestion.
countvectorizer difflib filedialog multinomialnb openpyxl pandas praw random scikit threading tkinter ttk
Last synced: 06 Apr 2025
https://github.com/akashkg03/spam-email-classification
This notebook involves to build a spam email classifier using Naive bayes and feature extraction technique using countvectorizer
classification countvectorizer jupiter-notebook naive-bayes-classifier pandas python
Last synced: 25 Feb 2025
https://github.com/RedInfinityPro/RedditSuggestor
Rating: (6/10) The project uses Python libraries and APIs to analyze Reddit data, predict user input, suggest new titles based on cosine similarity, calculate combined scores, and output the best suggestion.
countvectorizer difflib filedialog multinomialnb openpyxl pandas praw random scikit threading tkinter ttk
Last synced: 30 Sep 2025
https://github.com/sayamalt/text-similarity-quantifier
Successfully developed a machine learning model for computing the similarity score between two text paragraphs taken as input from a webpage.
bag-of-words cosine-similarity cosine-similarity-scores countvectorizer flask machine-learning nlp pandas python text-preprocessing tfidf
Last synced: 09 Nov 2025
https://github.com/somjit101/nlp-stackeroverflow-tag-prediction
A multi-class classification problem where the objective is to read a question posted on the popular reference website, StackOverflow and predict the primary topics it deals with, i.e. tags which the question will be associated with.
bag-of-words countvectorizer logistic-regression multi-class-classification multiclass-logistic-regression natural-language-processing nlp one-vs-rest onevsrestclassifier stackoverflow-tags stemming text-mining tf-idf tfidf-vectorizer word-cloud
Last synced: 06 Mar 2025
https://github.com/proxzima/twitter-sentiment-analysis
Sentiment analysis of tweets using vaderSentiment, CountVectorizer and KMeans
countvectorizer kmeans nlp sentiment-analysis twitter vadersentiment
Last synced: 22 Mar 2025
https://github.com/hayatiyrtgl/flm_reviews_analysis
Sentiment analysis using ML classifiers for text data.
countvectorizer machine-learning natural-language-processing natural-language-understanding nltk python python3 random-forest random-forest-classifier scikitlearn-machine-learning sentiment-analysis sentiment-classification sklearn sklearn-classify
Last synced: 01 Jul 2025
https://github.com/nazmulhasannihal/fake-news-detection-naive-bayes-and-linearsvc-approaches
An end-to-end data science project: load, clean, and analyze news articles, then train and test text classification models using both LinearSVC and Naive Bayes. Evaluate results, interpret key features, and gain actionable insights. The project includes clear code, exploratory data analysis (EDA), and practical suggestions for further improvement.
countvectorizer jupyter-notebook linearsvc navie-bayes-algorithm python tfidfvectorization
Last synced: 27 Jun 2025
https://github.com/dhruvv1402/spam-detection-python-
This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.
countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf
Last synced: 31 Mar 2025
https://github.com/sanjanahombal/sentiment-analysis-using-neural-networks
This project explores sentiment analysis using neural networks
convolutional-neural-networks countvectorizer deep-learning keras matplotlib neural-network numpy python tensorflow tfidf-vectorizer
Last synced: 06 Jan 2026
https://github.com/ompreetham/nlp-sentiment-analysis-document-scoring-method
NLP Sentiment Analysis Document Scoring Method
countvectorizer document joblib nlkt nlp numpy pandas project re scoring sentiment-analysis sklearn
Last synced: 31 Dec 2025
https://github.com/elmezianech/classifyreviews_nlp
Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.
accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc
Last synced: 05 Dec 2025
https://github.com/sudarshanc00/movie-recommendation-system
This project is a content-based movie recommendation system built using Python and Streamlit. The system suggests movies similar to a user-selected movie by analyzing plot descriptions and using machine learning techniques like cosine similarity.
cosine-similarity countvectorizer nltk-python python recommender-system
Last synced: 03 Apr 2025
https://github.com/shifnyahmed/homeservicerecommendationsystem
This is a service provider hiring website which will also recommend service providers to the user
bootstrap bootstrap4 bootstrap5 countvectorizer css django e-commerce-project flask html html5 javascript jinja2 machine-learning python python-3 python3 recommendation-engine recommendation-system sqlite sqllite-database
Last synced: 14 Oct 2025
https://github.com/parag000/content-based-movie-recommender
This project builds a content-based movie recommendation system using the TMDB dataset. By combining metadata features like cast, genres, and directors into a "metadata soup," it calculates movie similarity with vectorizers (Count) and cosine similarity. Ideal for learning content-based filtering and text vectorization techniques.
cosine-similarity countvectorizer recommendation-system scikit-learn tfidf-vectorizer vectorization
Last synced: 12 Jul 2025
https://github.com/jasmeetsingh7314/recommendation-system
Recommendation System for games done using python and written in jupyter notebook.
cosine-similarity countvectorizer machine-learning python
Last synced: 20 Jul 2025
https://github.com/elmezianech/email-spam-ham-classifier-nb-simple
Count Vectorizer Naive Bayes Email Classifier: This Python project utilizes a simple Naive Bayes approach with Count Vectorizer to classify emails as spam or ham. The implementation focuses on word frequency for classification.
countvectorizer emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering
Last synced: 31 Dec 2025
https://github.com/munavarhs/analysisofjakeandhyde
Analysis of Jake and Hyde(Contrasting communities)
classification-algorithm countvectorizer evaluation-metrics glove-embeddings machine-learning-algorithms sentiment-analysis tf-idf
Last synced: 24 Feb 2025
https://github.com/abideen-olawuwo/language-detection
A language detection app
countvectorizer multinomial-naive-bayes numpy pandas
Last synced: 04 Mar 2025
https://github.com/bk0712/spam-detection-python-
This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.
accuracy-metrics bernoulli-naive-bayes countvectorizer dataset faker heroku kaggle machine-learning matplotlib nltk numpy python pytorch tf-idf
Last synced: 26 Jun 2025