An open API service indexing awesome lists of open source software.

https://github.com/pdoup/ddp-task-docker

Distributed Data Processing - Spring '22
https://github.com/pdoup/ddp-task-docker

apache-ignite bloom-filter docker joins redis

Last synced: 2 months ago
JSON representation

Distributed Data Processing - Spring '22

Awesome Lists containing this project

README

          

# ddp-task

#### Implement 3 join algorithms on 2 key-value store databases ([Redis](https://redis.io/ "Redis Homepage") & [Apache Ignite](https://ignite.apache.org/ "Ignite's Homepage")) in a containerized enviroment
1. Double-Pipelined Hash Join
2. Semi Join
3. Two-Way Join Processing Using Bloom Filter Intersection
---
### TODO

- [X] Implement pipelined hash-join
- [X] Add log level for verbose printing
- [X] Implement semi-joins
- [X] Implement Bloom filter joins
- [X] Incorporate my implementation of Bloom Filter
- [X] Add functionality to specify number of characters to store as key
- [X] Added comments