https://github.com/thomasnield/bayes_email_spam
Experimenting with Bayes and Email Spam Categorization
https://github.com/thomasnield/bayes_email_spam
Last synced: 3 months ago
JSON representation
Experimenting with Bayes and Email Spam Categorization
- Host: GitHub
- URL: https://github.com/thomasnield/bayes_email_spam
- Owner: thomasnield
- License: apache-2.0
- Created: 2018-02-26T00:20:25.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-03-19T21:13:49.000Z (about 8 years ago)
- Last Synced: 2025-07-28T14:50:04.566Z (10 months ago)
- Language: Kotlin
- Size: 17.6 KB
- Stars: 7
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Bayes Email Spam Filter
Experimenting with Bayes and Email Spam Categorization in Kotlin. No dependencies needed besides Kotlin 1.2 std-lib.
Chapter 13 of [O'Reilly Data Science from Scratch](https://github.com/joelgrus/data-science-from-scratch/blob/master/code-python3/naive_bayes.py) has been the most helpful resource for this project.
**OUTPUT:**
```
Score for an email containing message: "discount viagra wholesale, hurry while this offer lasts"
0.9990090904079181
Score for an email containing message: "interesting meeting on amazon cloud services discount program"
0.01754275128116032
Spammiest Words
viagra 0.7
this 0.5
for 0.5
prescription 0.5
hey 0.3
Hammiest (non-spam) Words
your 0.7857142857142857
amazon 0.5
for 0.35714285714285715
a 0.35714285714285715
meeting 0.35714285714285715
```