https://github.com/malekmfs/mst-project
https://github.com/malekmfs/mst-project
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/malekmfs/mst-project
- Owner: MalekMFS
- Created: 2020-09-23T07:16:36.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2021-10-18T07:44:10.000Z (over 3 years ago)
- Last Synced: 2025-02-01T14:11:08.461Z (4 months ago)
- Language: Scala
- Size: 64.5 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
This repo contains implementations for my Master's project, "Comparing and Improving The Minimum Spanning Tree Algorithms in MapReduce" (In Persian), at the CE Algorithms Lab of the Sharif University of Technology.
# Abstract
In recent decades, we have faced the enormous growth of data and graph volumes. This requires modern ways of computation and storage systems and algorithms.
MapReduce is a known way of processing Big Data in a Parallel and primarily Distributed setting. Theoretical models (e.g., Massively Parallel Computation) for Algorithms using this paradigm commonly evaluate the number of rounds and needed communication.
We study the Minimum Spanning Tree (MST) as a fundamental graph problem. This problem in MapReduce is harder for sparse graphs. We introduce an algorithm that performs well comparing previous studies, especially for sparse graphs.We present an empirical study by implementing some algorithms using MapReduce, Apache Spark, and Scala; and experimenting in a distributed setting that we configured to compare them and find important input parameters. In this experiment, we use various graphs with up to a hundred million edges/vertices. Our algorithm showed improvements in the number of rounds and running time for most of the experiments.