Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with countvectorizer

A curated list of projects in awesome lists tagged with countvectorizer .

https://github.com/geekquad/text-learning

Basic usage of NLTK. Implementation of concepts like Stemmer, TfIdf, and text.CountVectors

corpus countvectorizer nltk sklearn stopwords tfidf

Last synced: 10 Nov 2024

https://github.com/parag000/content-based-movie-recommender

This project builds a content-based movie recommendation system using the TMDB dataset. By combining metadata features like cast, genres, and directors into a "metadata soup," it calculates movie similarity with vectorizers (Count) and cosine similarity. Ideal for learning content-based filtering and text vectorization techniques.

cosine-similarity countvectorizer recommendation-system scikit-learn tfidf-vectorizer vectorization

Last synced: 23 Nov 2024

https://github.com/somjit101/nlp-stackeroverflow-tag-prediction

A multi-class classification problem where the objective is to read a question posted on the popular reference website, StackOverflow and predict the primary topics it deals with, i.e. tags which the question will be associated with.

bag-of-words countvectorizer logistic-regression multi-class-classification multiclass-logistic-regression natural-language-processing nlp one-vs-rest onevsrestclassifier stackoverflow-tags stemming text-mining tf-idf tfidf-vectorizer word-cloud

Last synced: 16 Nov 2024

https://github.com/sayamalt/text-similarity-quantifier

Successfully developed a machine learning model for computing the similarity score between two text paragraphs taken as input from a webpage.

bag-of-words cosine-similarity cosine-similarity-scores countvectorizer flask machine-learning nlp pandas python text-preprocessing tfidf

Last synced: 07 Nov 2024

https://github.com/statute8234/redditsuggestor

Rating: (6/10) The project uses Python libraries and APIs to analyze Reddit data, predict user input, suggest new titles based on cosine similarity, calculate combined scores, and output the best suggestion.

countvectorizer difflib filedialog multinomialnb openpyxl pandas praw random scikit threading tkinter ttk

Last synced: 18 Nov 2024

https://github.com/sudarshanc00/movie-recommendation-system

This project is a content-based movie recommendation system built using Python and Streamlit. The system suggests movies similar to a user-selected movie by analyzing plot descriptions and using machine learning techniques like cosine similarity.

cosine-similarity countvectorizer nltk-python python recommender-system

Last synced: 16 Dec 2024

https://github.com/akashkg03/spam-email-classification

This notebook involves to build a spam email classifier using Naive bayes and feature extraction technique using countvectorizer

classification countvectorizer jupiter-notebook naive-bayes-classifier pandas python

Last synced: 10 Nov 2024

https://github.com/elmezianech/classifyreviews_nlp

Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.

accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc

Last synced: 15 Nov 2024

https://github.com/elmezianech/email-spam-ham-classifier-nb-simple

Count Vectorizer Naive Bayes Email Classifier: This Python project utilizes a simple Naive Bayes approach with Count Vectorizer to classify emails as spam or ham. The implementation focuses on word frequency for classification.

countvectorizer emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 15 Nov 2024