Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with bigdataprocessing

A curated list of projects in awesome lists tagged with bigdataprocessing .

https://github.com/divithraju/divith-raju-immigration-data-engineering

A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)

apachespark bigdata bigdataprocessing bigdataproject capstone-project datacleaning dataengineering datalake datamodeling datapipeline dataprocessing dataschema dataset datawherehouse pandas sql

Last synced: 31 Dec 2024

https://github.com/aksh-patel1/big-data-processing_parallelize-k-means

Implemented the parallelized version of k-means clustering algorithm in Spark and assess its efficiency using a real-world dataset.

apache-spark aws-s3 bigdata bigdataprocessing docker k-means-clustering parallel-kmeans parallel-kmeans-clustering pyspark python

Last synced: 03 Jan 2025