{"id":19235980,"url":"https://github.com/ankitrai259/movielens-dataset-analysis-sql","last_synced_at":"2026-06-23T09:31:46.048Z","repository":{"id":259154694,"uuid":"874818371","full_name":"AnkitRai259/MovieLens-Dataset-Analysis-SQL","owner":"AnkitRai259","description":"Analysis of MovieLens Dataset Using SQL","archived":false,"fork":false,"pushed_at":"2024-10-18T19:37:14.000Z","size":1210,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-23T13:03:32.457Z","etag":null,"topics":["postgresql","sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AnkitRai259.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-10-18T14:14:06.000Z","updated_at":"2024-10-18T20:13:42.000Z","dependencies_parsed_at":"2024-10-23T04:27:12.300Z","dependency_job_id":null,"html_url":"https://github.com/AnkitRai259/MovieLens-Dataset-Analysis-SQL","commit_stats":null,"previous_names":["ankitrai259/movielens-dataset-analysis-sql"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/AnkitRai259/MovieLens-Dataset-Analysis-SQL","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkitRai259%2FMovieLens-Dataset-Analysis-SQL","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkitRai259%2FMovieLens-Dataset-Analysis-SQL/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkitRai259%2FMovieLens-Dataset-Analysis-SQL/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkitRai259%2FMovieLens-Dataset-Analysis-SQL/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AnkitRai259","download_url":"https://codeload.github.com/AnkitRai259/MovieLens-Dataset-Analysis-SQL/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkitRai259%2FMovieLens-Dataset-Analysis-SQL/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34684673,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-23T02:00:07.161Z","response_time":65,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["postgresql","sql"],"created_at":"2024-11-09T16:18:47.079Z","updated_at":"2026-06-23T09:31:46.031Z","avatar_url":"https://github.com/AnkitRai259.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# MovieLens Dataset for SQL Project\n\n# Summary\nThis project utilizes the MovieLens (ml-32m) dataset, which comprises over 32 million ratings and 2 million tag applications across 87,585 movies. The data was generated by 200,948 users between January 9, 1995, and October 12, 2023, and made publicly available on October 13, 2023. The dataset includes the following files:\n\n* links.csv\n* movies.csv\n* ratings.csv\n* tags.csv\n\n# Dataset Overview\n\n* Ratings: 32,000,204 ratings on a 5-star scale (with half-star increments).\n* Tags: 2,000,072 user-generated tags.\n* Users: All included users have rated at least 20 movies; no demographic data is provided.\nFor further details and access to the dataset, visit https://grouplens.org/.\n\n# Project Structure\nThis project consists of SQL scripts that facilitate data analysis on the MovieLens dataset. The primary components include:\n\n* SQL Queries: A collection of SQL scripts to extract insights from the dataset, such as average ratings, movie popularity, and tag usage.\n  \n# Getting Started\n1. A SQL database (e.g., PostgreSQL, MySQL) set up on your machine.\n2. Create a new database and tables in your machine.\n3. Import Dataset: Use your SQL database tools to import the CSV files into corresponding tables. Ensure the following table structure:\n    * links: Contains columns movieId, imdbId, tmdbId\n    * movies: Contains columns movieId, title, genres\n    * ratings: Contains columns userId, movieId, rating, timestamp\n    * tags: Contains columns userId, movieId, tag, timestamp\n4. Run SQL Queries: Execute the SQL scripts in the queries/ directory using your SQL client to analyze the dataset.\n        \n\nQueries are uploaded in pdf format : \"[Analysis of Movie Lens Dataset.pdf](https://github.com/user-attachments/files/17439980/Analysis.of.Movie.Lens.Dataset.pdf)\"\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fankitrai259%2Fmovielens-dataset-analysis-sql","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fankitrai259%2Fmovielens-dataset-analysis-sql","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fankitrai259%2Fmovielens-dataset-analysis-sql/lists"}