Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tupol/spark-utils-demos
Demos for the tupol/spark-utils project together with a storyline
https://github.com/tupol/spark-utils-demos
configuration demo framework scala spark
Last synced: about 13 hours ago
JSON representation
Demos for the tupol/spark-utils project together with a storyline
- Host: GitHub
- URL: https://github.com/tupol/spark-utils-demos
- Owner: tupol
- License: mit
- Created: 2019-01-25T16:39:13.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2019-08-31T06:20:42.000Z (over 5 years ago)
- Last Synced: 2024-11-16T13:41:42.430Z (2 months ago)
- Topics: configuration, demo, framework, scala, spark
- Language: Scala
- Size: 29.3 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Creating Simple Configurable Spark Applications in Scala
## Motivation
All of us can recall the first days of picking up a new technology and running the first “*Hello, World!*” or “*Count words*”
applications that get us started with a new language or a platform.Up to a certain point of exploring and creating demos or prototypes everything is nice, but when it comes to creating
*production ready* configurable applications we all have a hard time and actually start thinking about the operational
use of our applications and how they will be deployed to a production system. This is the moment when a few lines of
beautiful code tend to get cluttered by a lot of configuration, wiring and setup.When it comes to Apache Spark it gets even more complicated, setting up the Spark context and setting up the input
sources and outputs.
It would be really nice to have a simple framework that keeps our Spark code clean and uncluttered.## Audience
Developers starting up into the [Apache Spark](https://spark.apache.org/) application development
in [Scala](https://www.scala-lang.org/).Some basic Scala and Apache Spark knowledge is crucial to make sense of this presentation.
This article is not meant as a Scala or an Apache Spark tutorial.
## spark-utils
[`spark-utils`](https://github.com/tupol/spark-utils) is a simple framework, developed
across a few years of writing Spark applications that so far helped me starting up new projects and creating
applications fast and relatively easy.
The main ideas behind building a new Spark application are logic, configuration and execution.[Full article is available as a Wiki page here.](https://github.com/tupol/spark-utils-demos/wiki)