https://github.com/zablon-oigo/nba-data-lake
This project automates the creation of a data lake for NBA analytics using AWS services
https://github.com/zablon-oigo/nba-data-lake
aws-athena aws-glue aws-iam aws-s3 boto3-script github-actions python3
Last synced: 7 months ago
JSON representation
This project automates the creation of a data lake for NBA analytics using AWS services
- Host: GitHub
- URL: https://github.com/zablon-oigo/nba-data-lake
- Owner: zablon-oigo
- Created: 2025-01-13T03:37:37.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2025-01-14T12:14:27.000Z (9 months ago)
- Last Synced: 2025-01-14T12:39:31.299Z (9 months ago)
- Topics: aws-athena, aws-glue, aws-iam, aws-s3, boto3-script, github-actions, python3
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README

## NBA Data Lake
This project automates the creation of a data lake for NBA API using AWS services. Amazon S3 bucket stores both raw and processed data. Sample NBA data in JSON format is uploaded to the S3 bucket for analysis. AWS Glue database defines an external table, enabling seamless querying of the data through Amazon Athena.
This setup provides an efficient, fast, and cost-effective solution for performing analytics on the NBA dataset stored in the S3 bucket.#### Technologies
- Cloud Provider: AWS
- Core Services: S3, AWS Glue, Amazon Athena
- External API: NBA Game API (SportsData.io)
- Programming Language: Python 3.x
- IAM Security#### Run Locally
Clone the project
```bash
git clone https://github.com/zablon-oigo/nba-data-lake.git
```Go to the project directory
```bash
cd nba-data-lake
```Install dependencies
```bash
pip install -r requirements.txt
```Deploy resources
```bash
python main.py
```#### Output
