An open API service indexing awesome lists of open source software.

https://github.com/newfront/docker-spark-base

Creates a customizable base image for working with Apache Spark
https://github.com/newfront/docker-spark-base

Last synced: 4 months ago
JSON representation

Creates a customizable base image for working with Apache Spark

Awesome Lists containing this project

README

          

# docker-spark-base
Creates a customizable base image for working with Apache Spark

## Build Phases
### Download Source Phase
- alpine linux - no-cache
* step 1. Downloads the tar from the official tagged spark release in github
* step 2. untar and clean up

### Maven Building and Packaging Phase
- mvn + jdk11
* step 1. mvn package phase (takes a while, but compiles all the spark packages cleanly)

### Final Image Phase
- openjdk:11-jre-slim
This is the final Spark image. It uses the debian slim buster linux image.

~~~
export SPARK_VERSION=3.2.0
export SPARK_USER=500
docker build . \
--build-arg spark_version=${SPARK_VERSION} \
--build-arg spark_user=${SPARK_USER} \
--tag `whoami`/docker-spark-base:${SPARK_VERSION}
~~~