https://github.com/aman-dutta/football-kafka-data-engineering-project
An End-To-End Data Pipeline for keywords related to Football
https://github.com/aman-dutta/football-kafka-data-engineering-project
apache-kafka athena aws glue python s3-bucket scraper
Last synced: 3 months ago
JSON representation
An End-To-End Data Pipeline for keywords related to Football
- Host: GitHub
- URL: https://github.com/aman-dutta/football-kafka-data-engineering-project
- Owner: aman-dutta
- Created: 2023-03-01T10:15:29.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-03-01T13:09:37.000Z (about 2 years ago)
- Last Synced: 2025-01-16T02:24:29.223Z (4 months ago)
- Topics: apache-kafka, athena, aws, glue, python, s3-bucket, scraper
- Language: Python
- Homepage:
- Size: 160 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Football Data Kafka - Data Engineering Project
## Introduction
In this project, I executed an End-To-End Data Pipeline of football related keywords using Kafka.## Architecture
## Technology Used
- Python
- Snscrape
- Apache Kafka
- S3 Bucket
- Amazon Web Service(AWS)
- Glue Crawler
- EC2
- Athena## Commands Used:
```
py -3 main.py -----> The sncrape only runs with Python version 3 and above.
bin/zookeeper-server-start.sh config/zookeeper.properties ----> Starting Zookeeper
Do a "sudo nano config/server.properties" - change ADVERTISED_LISTENERS to public ip of the EC2 instance -----> For connecting with the EC2 instance
bin/kafka-topics.sh --create --topic footballTopic --bootstrap-server {Public IP of your EC2 Instance:9092} --replication-factor 1 --partitions 1
bin/kafka-console-producer.sh --topic footballTopic --bootstrap-server {Public IP of your EC2 Instance:9092} -----> Producer
bin/kafka-console-consumer.sh --topic footballTopic --bootstrap-server {Public IP of your EC2 Instance:9092} -----> Consumer
```### Footnote
Special mention to [Darshil Parmar](https://www.youtube.com/results?search_query=darshil+parmar)
Please reach out to me in case of any queries. I would be happy to help! :smiley: