https://github.com/jonluca/reddit-research
Naive bayes filter for reddit
https://github.com/jonluca/reddit-research
Last synced: 11 months ago
JSON representation
Naive bayes filter for reddit
- Host: GitHub
- URL: https://github.com/jonluca/reddit-research
- Owner: jonluca
- Created: 2017-07-20T19:05:05.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2020-01-18T16:52:55.000Z (over 6 years ago)
- Last Synced: 2025-04-04T23:12:13.772Z (about 1 year ago)
- Language: JavaScript
- Homepage:
- Size: 5.86 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Reddit Naive Bayes Filter
We will attempt to locate as much spam as possible from various subreddits to train our model. This will be used within a reddit bot to flag potentially bad users or psots.
## Format
The filter reads in two files - spam.txt and good.txt, which is a newline-delimited list of spam post titles and valid post titles, correspondingly. It'll train the model using those titles, and then will run it against test files spam-test.txt and good-test.txt which will verify the validity of our model.
## Authors
Currently being written by /u/JonLuca, /u/Clearly-Coder, and /u/BB-r8