{"id":21176596,"url":"https://github.com/anshpg/popularity-based-book-recommendation-system","last_synced_at":"2026-04-28T12:36:26.724Z","repository":{"id":229352765,"uuid":"776512284","full_name":"ANSHPG/Popularity-Based-Book-Recommendation-System","owner":"ANSHPG","description":"I developed a popularity-based book recommendation system using Google Colab, leveraging libraries like pandas, numpy, and matplotlib. The dataset was obtained from Kaggle.com, consisting of three CSV files: books.csv, ratings.csv, and users.csv.","archived":false,"fork":false,"pushed_at":"2024-03-23T18:07:34.000Z","size":25092,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-03T05:34:36.620Z","etag":null,"topics":["collab","csv","kaggle-dataset","matplotlib-pyplot","numpy","pandas","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ANSHPG.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-03-23T18:01:50.000Z","updated_at":"2024-03-23T18:06:22.000Z","dependencies_parsed_at":null,"dependency_job_id":"827f5855-1044-4ae6-a246-1f5871240dd5","html_url":"https://github.com/ANSHPG/Popularity-Based-Book-Recommendation-System","commit_stats":null,"previous_names":["anshpg/-popularity-based-book-recommendation-system"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ANSHPG/Popularity-Based-Book-Recommendation-System","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ANSHPG%2FPopularity-Based-Book-Recommendation-System","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ANSHPG%2FPopularity-Based-Book-Recommendation-System/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ANSHPG%2FPopularity-Based-Book-Recommendation-System/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ANSHPG%2FPopularity-Based-Book-Recommendation-System/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ANSHPG","download_url":"https://codeload.github.com/ANSHPG/Popularity-Based-Book-Recommendation-System/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ANSHPG%2FPopularity-Based-Book-Recommendation-System/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":264275912,"owners_count":23583331,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["collab","csv","kaggle-dataset","matplotlib-pyplot","numpy","pandas","python"],"created_at":"2024-11-20T17:03:30.762Z","updated_at":"2026-04-28T12:36:26.684Z","avatar_url":"https://github.com/ANSHPG.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Popularity-Based-Book-Recommendation-System\n\nI developed a popularity-based book recommendation system using Google Colab, leveraging libraries like pandas, numpy, and matplotlib. The dataset was obtained from Kaggle.com, consisting of three CSV files: books.csv, ratings.csv, and users.csv. For this project, I utilized only the books and ratings CSV files.\n\n![POPULARITY_BOOK_RECOMENDATION](https://github.com/ANSHPG/Popularity-Based-Book-Recommendation-System/assets/132222062/ce06315d-3de8-4acd-b1bc-1cc3ae8242e8)\n\nThe books dataset contained columns such as book title, author, year of publication, image, and ISBN, while the ratings dataset included columns like ISBN, user ID, and user rating. The initial steps involved data preprocessing, including handling null and duplicate values from both CSV files.\n\nNext, I merged both datasets based on the ISBN column. However, before merging, I removed the user ID column from the ratings dataset. Then, I used group-by operations to determine the total number of reviews each book received based on the book title, as some books had multiple ISBNs.\n\nWith the total user ratings calculated, I proceeded to find the mean ratings each book received. Subsequently, I filtered out books that received 250 or more user ratings and arranged them in descending order based on mean ratings.\n\nTo showcase the top 50 recommended books, I utilized the head(50) function and displayed them. Through exploratory data analysis (EDA), I discovered that most people gave ratings of 1.0 to books, followed by a spike at 4.5.\n\nFor reference, I drew inspiration from IMDb's movie recommendation system, incorporating a similar formula, which I explained and implemented in the notebook. The entire project, along with explanations, is documented in a mind-map format and is publicly available for use.\n\nThe repository containing the project is owned by Anshuman Pattnaik and is accessible to all for public use.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fanshpg%2Fpopularity-based-book-recommendation-system","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fanshpg%2Fpopularity-based-book-recommendation-system","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fanshpg%2Fpopularity-based-book-recommendation-system/lists"}