https://github.com/pdoup/ddp-task-docker
Distributed Data Processing - Spring '22
https://github.com/pdoup/ddp-task-docker
apache-ignite bloom-filter docker joins redis
Last synced: 2 months ago
JSON representation
Distributed Data Processing - Spring '22
- Host: GitHub
- URL: https://github.com/pdoup/ddp-task-docker
- Owner: pdoup
- Created: 2022-05-11T09:51:40.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-07-05T17:05:28.000Z (almost 4 years ago)
- Last Synced: 2025-06-30T02:05:00.408Z (12 months ago)
- Topics: apache-ignite, bloom-filter, docker, joins, redis
- Language: Java
- Homepage:
- Size: 107 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ddp-task
#### Implement 3 join algorithms on 2 key-value store databases ([Redis](https://redis.io/ "Redis Homepage") & [Apache Ignite](https://ignite.apache.org/ "Ignite's Homepage")) in a containerized enviroment
1. Double-Pipelined Hash Join
2. Semi Join
3. Two-Way Join Processing Using Bloom Filter Intersection
---
### TODO
- [X] Implement pipelined hash-join
- [X] Add log level for verbose printing
- [X] Implement semi-joins
- [X] Implement Bloom filter joins
- [X] Incorporate my implementation of Bloom Filter
- [X] Add functionality to specify number of characters to store as key
- [X] Added comments