Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mehrab-kalantari/sms-spam-classifier

Building spam classifier using different algorithms and choosing the best one to build a streamlit application
https://github.com/mehrab-kalantari/sms-spam-classifier

classification english-nlp imbalanced-learning machine-learning nlp streamlit supervised-learning voting-classifier

Last synced: 4 days ago
JSON representation

Building spam classifier using different algorithms and choosing the best one to build a streamlit application

Awesome Lists containing this project

README

        

# SMS Spam Classifier
[Dataset on kaggle](https://www.kaggle.com/datasets/uciml/sms-spam-collection-dataset?resource=download)

## Contents
### Data cleaning
* Drop null columns
* Drop duplicates
* Label encoding
* Drop NaN values

### Data understanding and EDA
* Pie chart
* Bar chart
* Histogram
* Pair plot
* Correlation matrix

### Data preprocessing
* To lower case
* Tokenization
* Removing special characters
* Removing stopwords and punctuation
* Stemming

### Modeling
* Single classifiers
* Voting classifier

### Evaluation
Here we choose the best model from the following measures
* Accuracy
* Precision
* F1 score

![models](/sample/models.png)

### Deployment
Deploy our model on streamlit

---
### Sample
![s](sample/sample.jpg)