https://github.com/minhduc29/leetcode-contest-analytics
A data engineering project to extract, transform, and load LeetCode contest ranking and contest problems data
https://github.com/minhduc29/leetcode-contest-analytics
amazon-redshift amazon-s3 analytics apache-airflow aws aws-glue dag data-engineering data-pipeline elt etl leetcode leetcode-contest pandas
Last synced: about 1 month ago
JSON representation
A data engineering project to extract, transform, and load LeetCode contest ranking and contest problems data
- Host: GitHub
- URL: https://github.com/minhduc29/leetcode-contest-analytics
- Owner: minhduc29
- Created: 2025-01-12T23:55:22.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-01-20T00:59:21.000Z (3 months ago)
- Last Synced: 2025-01-27T05:51:36.699Z (3 months ago)
- Topics: amazon-redshift, amazon-s3, analytics, apache-airflow, aws, aws-glue, dag, data-engineering, data-pipeline, elt, etl, leetcode, leetcode-contest, pandas
- Language: Python
- Homepage:
- Size: 5.66 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# LeetCode Contest Analytics
## Overview
- A data engineering project to extract, transform, and load LeetCode contest ranking data and contest problems data.
- This project was created to analyze data from past LeetCode contests and top contestants, aiming to set the basis
for efficient contest practice and further data analytics from each user profile.
- Data were collected from the top **50,000** users of LeetCode global contest ranking and **2334** problems of
**585** contests.
- Some analytics with visualizations are provided in `analytics/analytics.md`.
- The collected data are stored in the `data/` folder.## Technology Stack
- Orchestration: **Apache Airflow**
- Storage: **Amazon S3**
- ETL jobs: **Pandas**, **AWS Glue**
- Data warehouse & Analytics: **Amazon Redshift**
- For additional libraries, please consult `requirements.txt`.## Potential Ideas
- Machine Learning/Deep Learning models.
- User profile analysis and ranking prediction.
- Further analytics and statistics from other LeetCode data.
- Data pipelines and analytics for other platforms.