{"id":16272352,"url":"https://github.com/lurenss/spam-filter","last_synced_at":"2025-04-08T15:47:56.742Z","repository":{"id":169101737,"uuid":"356529999","full_name":"lurenss/Spam-filter","owner":"lurenss","description":"Second assignment of Artificial Intelligence course held by Professor Andrea Torsello of Ca' Foscari University of Venice, spam detectors with SVM classification using linear, polynomial of degree 2, RBF kernels and Naive Bayes and k-NN ","archived":false,"fork":false,"pushed_at":"2021-09-12T12:38:33.000Z","size":145,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-14T12:19:57.844Z","etag":null,"topics":["k-nn","linear-kernel","machine-learning","naive-bayes-classifier","polynomial-kernel","rbf-kernel","svm-classifier"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lurenss.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":"SupportVectorMachine.py","governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-04-10T09:12:40.000Z","updated_at":"2021-09-12T12:39:35.000Z","dependencies_parsed_at":null,"dependency_job_id":"ecaef00d-a229-4672-bb55-330fe9f1611b","html_url":"https://github.com/lurenss/Spam-filter","commit_stats":null,"previous_names":["lurenss/spam-filter"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lurenss%2FSpam-filter","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lurenss%2FSpam-filter/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lurenss%2FSpam-filter/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lurenss%2FSpam-filter/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lurenss","download_url":"https://codeload.github.com/lurenss/Spam-filter/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247875525,"owners_count":21010931,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["k-nn","linear-kernel","machine-learning","naive-bayes-classifier","polynomial-kernel","rbf-kernel","svm-classifier"],"created_at":"2024-10-10T18:17:23.248Z","updated_at":"2025-04-08T15:47:51.734Z","avatar_url":"https://github.com/lurenss.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ASSIGNMENT \n\nWrite a spam filter using discrimitative and generative classifiers. Use the Spambase dataset which already represents spam/ham messages through a bag-of-words representations through a dictionary of 48 highly discriminative words and 6 characters. The first 54 features correspond to word/symbols frequencies; ignore features 55-57; feature 58 is the class label (1 spam/0 ham).\n\nPerform SVM classification using linear, polynomial of degree 2, and RBF kernels over the TF/IDF representation.\nCan you transform the kernels to make use of angular information only (i.e., no length)? Are they still positive definite kernels?\nClassify the same data also through a Naive Bayes classifier for continuous inputs, modelling each feature with a Gaussian distribution\n\nPerform k-NN clasification with k=5\nProvide the code, the models on the training set, and the respective performances in 10 way cross validation.\n\nExplain the differences between the three models.\n\n\n\nP.S. you can use a library implementation for SVM, but do implement the Naive Bayes on your own. As for k-NN, you can use libraries if you want, but it might just be easier to do it on your own.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flurenss%2Fspam-filter","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flurenss%2Fspam-filter","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flurenss%2Fspam-filter/lists"}