Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with bigdataprocessing
A curated list of projects in awesome lists tagged with bigdataprocessing .
https://github.com/divithraju/divith-raju-immigration-data-engineering
A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)
apachespark bigdata bigdataprocessing bigdataproject capstone-project datacleaning dataengineering datalake datamodeling datapipeline dataprocessing dataschema dataset datawherehouse pandas sql
Last synced: 31 Dec 2024
https://github.com/aksh-patel1/big-data-processing_parallelize-k-means
Implemented the parallelized version of k-means clustering algorithm in Spark and assess its efficiency using a real-world dataset.
apache-spark aws-s3 bigdata bigdataprocessing docker k-means-clustering parallel-kmeans parallel-kmeans-clustering pyspark python
Last synced: 03 Jan 2025