https://github.com/rmodi6/email-classification
Classifying emails into custom user labels
https://github.com/rmodi6/email-classification
email-dataset knn-classification machine-learning naive-bayes-classification natural-language-processing python
Last synced: 3 months ago
JSON representation
Classifying emails into custom user labels
- Host: GitHub
- URL: https://github.com/rmodi6/email-classification
- Owner: rmodi6
- Created: 2016-02-15T09:34:37.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2026-03-04T17:19:51.000Z (3 months ago)
- Last Synced: 2026-03-04T23:55:55.110Z (3 months ago)
- Topics: email-dataset, knn-classification, machine-learning, naive-bayes-classification, natural-language-processing, python
- Language: Python
- Homepage:
- Size: 448 KB
- Stars: 27
- Watchers: 2
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Email-Classification
### Abstract
In the world of Internet today, huge amount of data is transferred between computers in the form of emails. Consequently, it is getting difficult to sort the important emails manually from the unimportant ones. Email classification has been extensively studied and researched in the past but most of the research has been in the field of spam detection and filtering. This paper focuses on classifying emails into custom folders that are relevant to the user. We have used two different approaches here—Naïve Bayes classifier and k-nearest neighbors algorithm. The Naïve Bayes classifier is based on a probabilistic model, while the k-nearest neighbors algorithm is based on a similarity measure with the training emails. We propose the method of using these two approaches in email classification, analyze the performance of these algorithms, and compare their results. Then, we propose some future work for further optimization and better efficiency.
### Project Demo
[](https://www.youtube.com/watch?v=0bxs2zrR5fU "Click to play the video")
### Publication
A [technical paper](https://doi.org/10.1007/978-981-10-6875-1_9) of this project was published in Advances in Intelligent Systems and Computing (AISC, volume 564) series of Springer 2017