Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mrpowers/data-scrapbook

A collection of images and captions to explain core data concepts
https://github.com/mrpowers/data-scrapbook

Last synced: about 1 month ago
JSON representation

A collection of images and captions to explain core data concepts

Awesome Lists containing this project

README

        

# data-scrapbook

This repo contains a collection of images that explain core data computing concepts like Apache Spark, file formats, Delta Lake and associated libraries.

You can learn a lot about data computing with some images with descriptive captions. See these pages to learn more:

* Spark
* Delta Lake

PySpark:

* quinn - TODO
* chispa
* ceja - TODO
* mack
* farsante
* unicron - TODO

Scala Spark:

* spark-sbt.g8 - TODO
* bebe - TODO
* spark-daria
* spark-fast-tests

Pandas:

* beavis - TODO