Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jimthompson5802/datascience_containers
Personal docker images for various data science software stacks
https://github.com/jimthompson5802/datascience_containers
data-science docker h2oai jupyter-notebook kubernetes python rstudio-servers spark
Last synced: 27 days ago
JSON representation
Personal docker images for various data science software stacks
- Host: GitHub
- URL: https://github.com/jimthompson5802/datascience_containers
- Owner: jimthompson5802
- License: mit
- Created: 2018-10-23T02:57:20.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2020-02-25T01:04:12.000Z (almost 5 years ago)
- Last Synced: 2024-12-21T12:44:15.300Z (about 1 month ago)
- Topics: data-science, docker, h2oai, jupyter-notebook, kubernetes, python, rstudio-servers, spark
- Language: Shell
- Homepage:
- Size: 627 KB
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 16
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Data Science Software Stack Docker Prototype
Docker images providing the following data science software stacks for peronsal use:
* Anaconda Python with Jupyter Notebook
* Rstudio Server
* Apache Spark (Stand-alone)
* Tensorflow (cpu and gpu versions)
* h2o
* xgboost
* lightgbmSee [wiki](https://github.com/jimthompson5802/datascience_containers/wiki) for additional information.
Built docker images can be found at [dockerhub.com](https://hub.docker.com/u/dsimages/)
This work inspired by these talks:
* [Docker for Data Scientists](https://www.youtube.com/watch?v=GOW6yQpxOIg)
* [Capital One Analytic Garage on Docker](https://www.youtube.com/watch?v=ogDa2-A1y9U)## System Requirements:
* MacOS
* [Docker for Mac](https://store.docker.com/editions/community/docker-ce-desktop-mac) 18.06.1 ce (requires kubernetes enabled)
* Chrome Browser## Directories:
* `bin` scripts to start and stop containers
* `h2oai` h2o Flow server
* `jpynb` Anaconda Python with additional packages: h2o, xgboost, lightgbm
* `pyspnb` Ancaonda Python with stand-alone Spark
* `rstudio` Rstudio Server (Community Edition)
* `tfcpu` Tensorflow with Python 3 and Jupyter Notebook (cpu)
* `tfgpu` Tensorflow with Python 3 and Jupyter Notebook (gpu)