An open API service indexing awesome lists of open source software.

https://github.com/vigneshss-07/bigdata_technologies

This repo contains all technical knowledge and implementation of big data technologies.
https://github.com/vigneshss-07/bigdata_technologies

big-data hadoop hadoop-hdfs hbase hive hive-metastore kafka mapreduce-python pyspark spark sparksql

Last synced: 4 months ago
JSON representation

This repo contains all technical knowledge and implementation of big data technologies.

Awesome Lists containing this project

README

        

### Bigdata_Technologies

Started with Hadoop explaining HDFS and its evolution.

# Difference between Apache Hadoop Vs Apache Spark

* https://www.ibm.com/cloud/blog/hadoop-vs-spark
* https://towardsdatascience.com/big-data-analytics-apache-spark-vs-apache-hadoop-7cb77a7a9424

***Big Data ecosystem***

1. https://github.com/dgadiraju/itversity-books/tree/master/Data%20Engineering%20Bootcamp/40%20Big%20Data%20ecosystem%20-%20Overview