An open API service indexing awesome lists of open source software.

https://github.com/suvroneel/spam-email-and-sms-classifier

It’s an E2E ML project to filter spam msgs by using naive bayes classifier ✨💖
https://github.com/suvroneel/spam-email-and-sms-classifier

google-sheets-api machine-learning multinomial-naive-bayes naive-bayes-classifier natural-language-processing pandas python3

Last synced: 3 months ago
JSON representation

It’s an E2E ML project to filter spam msgs by using naive bayes classifier ✨💖

Awesome Lists containing this project

README

        

🌟Spam Email/SMS Classifier | End-to-End ML Pipeline:
==================================================
Tech Stack:
🐍 Python | 🤖 Scikit-learn | 📊 Pandas/NLTK | ☁️ Streamlit/Google Sheets API

Key Features:

✅ Text Preprocessing: Lowercasing, tokenization, special char removal, stemming

✅ Advanced NLP: TF-IDF vectorization + Multinomial Naïve Bayes

✅ Visual EDA: Word clouds for spam vs. ham patterns
✅ Live Deployment: Streamlit web app + Google Sheets integration for user input tracking

✅ MLOps Ready: Pickle model serialization and scikit-learn pipelines

Business Value:
🛡️ Spam Filtering: Blocks 99% of unwanted messages
📈 Data Collection: Logs predictions for model improvement
🔮 Scalable: Pipeline adapts to new spam patterns

🚧 Future Improvements
1. Deep Learning Upgrade
:construction: CNN Integration:

Implement character-level CNN models (e.g., Char-CNN) for context-aware spam detection

Compare performance against current TF-IDF + Naïve Bayes pipeline

Updates
=======================================================
📍Update 1 - Now user's input data and their respective output will be recorded and will be used for futher training and testings ✔✔

📍Update 2 - Minor fixes . Implemented speeling check for further checking ✔✔

📍Update 3 - The model is much more trained and a navbar along with a breath taking wallpaper is added