{"id":14958395,"url":"https://github.com/abdullahselek/spampy","last_synced_at":"2025-08-21T06:32:43.616Z","repository":{"id":57469543,"uuid":"136163739","full_name":"abdullahselek/spampy","owner":"abdullahselek","description":"Spam filtering module with Machine Learning using SVM (Support Vector Machines).","archived":false,"fork":false,"pushed_at":"2021-02-27T19:13:59.000Z","size":612,"stargazers_count":28,"open_issues_count":0,"forks_count":8,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-12-14T23:28:10.409Z","etag":null,"topics":["enron-spam-dataset","machine-learning","numpy","scipy","sklearn","sklearn-classify","spam-classification","support-vector-machines"],"latest_commit_sha":null,"homepage":"https://spampy.abdullahselek.com","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/abdullahselek.png","metadata":{"files":{"readme":"README.rst","changelog":null,"contributing":null,"funding":null,"license":"COPYING","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-06-05T10:55:41.000Z","updated_at":"2024-11-01T10:21:09.000Z","dependencies_parsed_at":"2022-09-19T10:11:55.596Z","dependency_job_id":null,"html_url":"https://github.com/abdullahselek/spampy","commit_stats":null,"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/abdullahselek%2Fspampy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/abdullahselek%2Fspampy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/abdullahselek%2Fspampy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/abdullahselek%2Fspampy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/abdullahselek","download_url":"https://codeload.github.com/abdullahselek/spampy/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":230497984,"owners_count":18235610,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["enron-spam-dataset","machine-learning","numpy","scipy","sklearn","sklearn-classify","spam-classification","support-vector-machines"],"created_at":"2024-09-24T13:16:55.856Z","updated_at":"2024-12-19T20:27:32.982Z","avatar_url":"https://github.com/abdullahselek.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"spampy\n======\n\n.. image:: https://github.com/abdullahselek/spampy/workflows/spampy%20ci/badge.svg\n    :target: https://github.com/abdullahselek/spampy/actions\n\n.. image:: https://img.shields.io/pypi/v/spampy.svg\n    :target: https://pypi.python.org/pypi/spampy/\n\n.. image:: https://img.shields.io/pypi/pyversions/spampy.svg\n    :target: https://pypi.org/project/spampy\n\n.. image:: https://pepy.tech/badge/spampy\n    :target: https://pepy.tech/project/spampy\n\n.. image:: https://img.shields.io/conda/vn/conda-forge/spampy?logo=conda-forge\n    :target: https://anaconda.org/conda-forge/spampy\n\n.. image:: https://anaconda.org/conda-forge/spampy/badges/latest_release_date.svg\n    :target: https://anaconda.org/conda-forge/spampy\n\n.. image:: https://anaconda.org/conda-forge/spampy/badges/license.svg\n    :target: https://anaconda.org/conda-forge/spampy\n\nSpam filtering module with Machine Learning using SVM. **spampy** is a classifier that uses ``Support Vector Machines``\nwhich tries to classify given raw emails if they are spam or not.\n\nSupport vector machines (SVMs) are supervised learning models with associated learning algorithms that analyze data used\nfor classification and regression analysis. Given a set of training examples, each marked as belonging to one or the other\nof two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making\nit a non-probabilistic binary linear classifier.\n\nMany email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.\n**spampy** is a learning project that you can use filtering spam mails.\n\n**spampy** uses two different datasets for classification. One of the datasets is already imported inside the project under ``spampy/datasets/`` folder.\nSecond dataset is `enron-spam \u003chttp://www.aueb.gr/users/ion/data/enron-spam/\u003e`_ dataset and inside the ``spampy`` folder I created a shell script which\ndownloads and extract it for you.\n\nProject tree\n------------\n\n* email_processor ``Helper to collect features and labels from datasets.``\n* spam_classifier ``Classifies given raw emails.``\n* dataset_downloader ``Enron dataset downloader which uses dataset_downloader.sh``\n\nDependency List\n---------------\n\n* scikit_learn\n* scipy\n* numpy\n* nltk\n* click (for CLI)\n\nTwo main function of ``spam_classifier`` classifies given raw email.\n\n* ``classify_email``\n* ``classify_email_with_enron``\n\nInstalling\n----------\n\nYou can install spampy using Python Package Index::\n\n    $ pip install spampy\n\nInstall with conda from the Anaconda conda-forge channel::\n\n    $ conda install -c conda-forge spampy\n\nInstall from its source repository on GitHub::\n\n    $ pip install -e git+https://github.com/abdullahselek/spampy#egg=spampy\n\nCLI\n---\n\nFor available commands ``python -m spampy -h``\n\n.. code-block::\n\n    Spam filtering module with Machine Learning using SVM.\n    Usage\n      $ python spampy [\u003coptions\u003e]\n    Options\n      --help, -h              Display help message\n      --download, -d          Download enron dataset\n      --eclassify, -ec        Classify given raw email with enron dataset, prompts for raw email\n      --classify, -c          Classify given raw email, prompts for raw email\n      --version, -v           Display installed version\n    Examples\n      $ python spampy --help\n      $ python spampy --download\n      $ python spampy --eclassify\n      $ python spampy --classify\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fabdullahselek%2Fspampy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fabdullahselek%2Fspampy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fabdullahselek%2Fspampy/lists"}