Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/elmezianech/email-spam-ham-classifier-nb-hard
Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.
https://github.com/elmezianech/email-spam-ham-classifier-nb-hard
ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering
Last synced: about 8 hours ago
JSON representation
Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.
- Host: GitHub
- URL: https://github.com/elmezianech/email-spam-ham-classifier-nb-hard
- Owner: elmezianech
- Created: 2023-11-13T21:36:31.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-11-13T21:41:28.000Z (about 1 year ago)
- Last Synced: 2023-11-13T22:34:21.691Z (about 1 year ago)
- Topics: ai, emails, jupyter-notebook, machine-learning, ml, naive-bayes, naive-bayes-classifier, numpy, pandas, python, spam-detection, spam-filtering
- Language: Jupyter Notebook
- Homepage:
- Size: 0 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Email-Spam-Ham-Classifier-NB-hard
Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm. The implementation directly computes probabilities and handles missing values using constant value imputation.This project used the "Email Dataset for Spam Detection" sourced from Kaggle. The dataset contains a collection of emails labeled as spam and ham for training and testing the classifier.
Link: https://www.kaggle.com/datasets/bhaskarreddy072/mail-datacsv