{"id":13305570,"url":"https://github.com/BaileeRice/Module20","last_synced_at":"2025-03-10T13:31:47.135Z","repository":{"id":63214578,"uuid":"554534457","full_name":"BaileeRice/Module20","owner":"BaileeRice","description":"Testing machine learning models to see if they can accurately determine the genre of songs","archived":false,"fork":false,"pushed_at":"2022-11-21T15:25:50.000Z","size":13809,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2023-05-30T00:20:42.446Z","etag":null,"topics":["excel","google-slides","kmeans","random-forest","sql","tableau"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/BaileeRice.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-10-20T01:09:09.000Z","updated_at":"2022-12-01T00:01:21.000Z","dependencies_parsed_at":"2023-01-23T01:31:21.576Z","dependency_job_id":null,"html_url":"https://github.com/BaileeRice/Module20","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaileeRice%2FModule20","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaileeRice%2FModule20/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaileeRice%2FModule20/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaileeRice%2FModule20/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/BaileeRice","download_url":"https://codeload.github.com/BaileeRice/Module20/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":242859654,"owners_count":20196978,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["excel","google-slides","kmeans","random-forest","sql","tableau"],"created_at":"2024-07-29T17:53:54.378Z","updated_at":"2025-03-10T13:31:44.990Z","avatar_url":"https://github.com/BaileeRice.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Module20\n## Segment #3\n\nFor this project, we were interested in seeing just how efficient a machine learning model will be at accurately predicting the genre of a song. We obtained dataset from Kaggle as it was one of the most popular ones out there, and from this dataset we decided to focus on four specific genres; Rock, R\u0026B, Country, and Rap. All genre were solely chosen based on the fact that they contained the most data collected by spotify as shown in the graph below.\n(https://www.kaggle.com/datasets/thedevastator/popularity-of-spotify-top-tracks-by-genre?select=rap_playlist_tracks.csv).\n\n\n![Screenshot (35)](https://user-images.githubusercontent.com/108035567/202359198-47dad07c-8e77-4593-bb97-fb51b2b11e0f.png)\n\n\n\nWith access to the many files containing different data about genre, we decided to only use the playlist track data and focused on 11 specific attrubutes as shown on the image below. These attributes were chosen because we felt as though they would increase the accuracy of the predictions made by the machine. \n\n![Screenshot (30)](https://user-images.githubusercontent.com/108035567/202317031-8bd8353f-dd4a-4a2d-834c-8db15c10eaa2.png)\n\n\nUsing the K-means clustering method we created an unsupervised machine learning model. The image below depicts a snipet of the code we used for the mlm. The number of clusters represents the four genres we'll focus on for the project.\n\n![Screenshot (33)](https://user-images.githubusercontent.com/108035567/202354197-a276d178-1cfb-41e0-a73a-23f4cb06991d.png)\n\n\nAlthough unsupervised machine learning was a successful at showing patterns within the data as it relates the genre, it was not the best at accurately predicting genre while taking into consideration the 11 attributes in the table above. The image below shows one of the patterns found using this model. The tableau link below will allow you to manipulate the attributes and see the patterns.\n\n![2022-11-16 (2)](https://user-images.githubusercontent.com/108035567/202358443-755b6166-9f1d-420a-9df0-16e3231e4444.png)\n\n\nA supervised version of the machine learning model was created, and upon testing, we found that it had a balanced accuracy score of 72.6%. We also found that speechiness, danceability, and acousticness were the top three biggest contributors of accurate genre prediction as shown in the image below.\n\n![image](https://user-images.githubusercontent.com/108035567/202345398-939cf29c-c16e-4986-9940-9c183b9472ce.png)\n\n\nSlides:https://docs.google.com/presentation/d/1y_DgnlQ9wYASojwS9idZCyP-hs8Qp631jknh44w6OYo/edit?usp=sharing\n\n_______________________________________\n\n#Tableau\n\nBaseline Dashboard -https://public.tableau.com/app/profile/melanie.taylor6095/viz/SpotifyMLClassifications/Baseline_1?publish=yes\nTrack Breakdown Classification - https://public.tableau.com/app/profile/melanie.taylor6095/viz/SpotifyMLClassifications/SongClassificationBreakdown?publish=yes\n_________________________________________\n\nBaseline Attributes Story - https://public.tableau.com/app/profile/melanie.taylor6095/viz/SpotifyMLClassifications/BaselineAttributes?publish=yes\n\nResults Story - https://public.tableau.com/app/profile/melanie.taylor6095/viz/SpotifyMLClassifications/Results?publish=yes\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FBaileeRice%2FModule20","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FBaileeRice%2FModule20","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FBaileeRice%2FModule20/lists"}