Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/airscholar/realtime-voting-data-engineering
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
https://github.com/airscholar/realtime-voting-data-engineering
apache-kafka apache-spark bigdata postgresql realtime-analytics realtime-election realtime-voting-system streamlit-dashboard
Last synced: 2 months ago
JSON representation
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
- Host: GitHub
- URL: https://github.com/airscholar/realtime-voting-data-engineering
- Owner: airscholar
- Created: 2023-12-06T23:39:27.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-11T08:01:25.000Z (about 1 year ago)
- Last Synced: 2024-04-18T02:57:15.078Z (9 months ago)
- Topics: apache-kafka, apache-spark, bigdata, postgresql, realtime-analytics, realtime-election, realtime-voting-system, streamlit-dashboard
- Language: Python
- Homepage: https://youtu.be/X-JnC9daQxE
- Size: 2.12 MB
- Stars: 14
- Watchers: 3
- Forks: 11
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Realtime Election Voting System
===============================This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
## System Architecture
![system_architecture.jpg](images%2Fsystem_architecture.jpg)## System Flow
![system_flow.jpg](images%2Fsystem_flow.jpg)## System Components
- **main.py**: This is the main Python script that creates the required tables on postgres (`candidates`, `voters` and `votes`), it also creates the Kafka topic and creates a copy of the `votes` table in the Kafka topic. It also contains the logic to consume the votes from the Kafka topic and produce data to `voters_topic` on Kafka.
- **voting.py**: This is the Python script that contains the logic to consume the votes from the Kafka topic (`voters_topic`), generate voting data and produce data to `votes_topic` on Kafka.
- **spark-streaming.py**: This is the Python script that contains the logic to consume the votes from the Kafka topic (`votes_topic`), enrich the data from postgres and aggregate the votes and produce data to specific topics on Kafka.
- **streamlit-app.py**: This is the Python script that contains the logic to consume the aggregated voting data from the Kafka topic as well as postgres and display the voting data in realtime using Streamlit.## Setting up the System
This Docker Compose file allows you to easily spin up Zookkeeper, Kafka and Postgres application in Docker containers.### Prerequisites
- Python 3.9 or above installed on your machine
- Docker Compose installed on your machine
- Docker installed on your machine### Steps to Run
1. Clone this repository.
2. Navigate to the root containing the Docker Compose file.
3. Run the following command:```bash
docker-compose up -d
```
This command will start Zookeeper, Kafka and Postgres containers in detached mode (`-d` flag). Kafka will be accessible at `localhost:9092` and Postgres at `localhost:5432`.##### Additional Configuration
If you need to modify Zookeeper configurations or change the exposed port, you can update the `docker-compose.yml` file according to your requirements.### Running the App
1. Install the required Python packages using the following command:```bash
pip install -r requirements.txt
```2. Creating the required tables on Postgres and generating voter information on Kafka topic:
```bash
python main.py
```3. Consuming the voter information from Kafka topic, generating voting data and producing data to Kafka topic:
```bash
python voting.py
```4. Consuming the voting data from Kafka topic, enriching the data from Postgres and producing data to specific topics on Kafka:
```bash
python spark-streaming.py
```5. Running the Streamlit app:
```bash
streamlit run streamlit-app.py
```## Screenshots
### Candidates and Parties information
![candidates_and_party.png](images/candidates_and_party.png)
### Voters
![voters.png](images%2Fvoters.png)### Voting
![voting.png](images%2Fvoting.png)### Dashboard
![dashboard_image.png](images%2Fdashboard_image.png)## Video
[![Realtime Voting System Data Engineering](https://img.youtube.com/vi/X-JnC9daQxE/0.jpg)](https://youtu.be/X-JnC9daQxE)