https://github.com/apssouza22/big-data-pipeline-lambda-arch
A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Architecture)
https://github.com/apssouza22/big-data-pipeline-lambda-arch
bigdata cassandra-database hdfs kafka lambda-architecture spark
Last synced: 18 days ago
JSON representation
A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Architecture)
- Host: GitHub
- URL: https://github.com/apssouza22/big-data-pipeline-lambda-arch
- Owner: apssouza22
- License: apache-2.0
- Created: 2018-11-27T00:40:54.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2025-09-07T07:32:27.000Z (5 months ago)
- Last Synced: 2025-09-07T09:20:49.873Z (5 months ago)
- Topics: bigdata, cassandra-database, hdfs, kafka, lambda-architecture, spark
- Language: Java
- Homepage:
- Size: 1.84 MB
- Stars: 180
- Watchers: 9
- Forks: 83
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE