https://github.com/neuw84/datahack-spark
Vagrant box for the Spark Datahack module
https://github.com/neuw84/datahack-spark
ansible infrastructure-as-code spark vagrant zeppelin
Last synced: 2 months ago
JSON representation
Vagrant box for the Spark Datahack module
- Host: GitHub
- URL: https://github.com/neuw84/datahack-spark
- Owner: Neuw84
- License: apache-2.0
- Created: 2017-12-13T08:59:04.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2017-12-15T07:31:54.000Z (over 8 years ago)
- Last Synced: 2025-06-18T11:52:13.477Z (about 1 year ago)
- Topics: ansible, infrastructure-as-code, spark, vagrant, zeppelin
- Language: Shell
- Homepage: http://www.datahack.es
- Size: 49.8 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# datahack-spark
Vagrant box for the Spark [Datahack](https://www.datahack.es) module. This box uses Ansible as local provisioner.
Contains:
---
* Apache Zeppelin 0.7.3
* Apache Spark 2.2.1
* Python (sci-kit, pandas, numpy, matplotlib, etc)
Please use this with at least **8GB** of Ram and **2** processors on the host machine.
The default user/password for Apache Zeppelin is ```admin/admin```.
## Requisites
* VirtualBox 5.0+
* Vagrant 2.0 +
## Usage
* Install Vagrant , Vagrant will install for you VirtualBox if it isn't installed.
* ```git clone``` (**Windows users before cloning:** ```git config --global core.autocrlf input```) or download the [latest](https://github.com/Neuw84/datahack-spark/releases) release
* ```vagrant up``` then wait some minutes.
* Open: