https://github.com/msikorski93/spam-detection-with-lstm-polish
Detecting spam (a typical binary classification problem) on Polish emails.
https://github.com/msikorski93/spam-detection-with-lstm-polish
emails embeddings lstm-neural-networks nlp polish-language spam-detection tensorflow word2vec
Last synced: 8 months ago
JSON representation
Detecting spam (a typical binary classification problem) on Polish emails.
- Host: GitHub
- URL: https://github.com/msikorski93/spam-detection-with-lstm-polish
- Owner: msikorski93
- Created: 2023-09-07T21:53:06.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-09-07T22:20:52.000Z (about 2 years ago)
- Last Synced: 2025-01-09T07:51:31.282Z (9 months ago)
- Topics: emails, embeddings, lstm-neural-networks, nlp, polish-language, spam-detection, tensorflow, word2vec
- Language: Jupyter Notebook
- Homepage:
- Size: 2.12 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Spam-Detection-With-LSTM-Polish




The notebook aimed to perform and demonstrate a binary classification problem - spam detection, on a dataset of Polish emails. To complete the task we developed a LSTM model which is a specific type of recurrent neural network (RNN). We proved the high effectiveness of its application for natural language processing (NLP) tasks and achieved an overall 97.06% accuracy. Before training the neural network we used word2vec technique to convert descriptions into embeddings with a pre-trained model specifically for Polish, developed by the Polish Academy of Science. The evaluation of the neural network was done with standard plots.
![]()