https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
https://github.com/eyadsibai/machine-learning-docker-image
data-science docker docker-image google-cloud machine-learning
Last synced: 11 months ago
JSON representation
Data Science/Machine Learning Docker Image for CPU
- Host: GitHub
- URL: https://github.com/eyadsibai/machine-learning-docker-image
- Owner: eyadsibai
- Created: 2017-01-06T14:26:49.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2019-12-13T14:24:32.000Z (over 6 years ago)
- Last Synced: 2025-03-30T14:51:10.281Z (12 months ago)
- Topics: data-science, docker, docker-image, google-cloud, machine-learning
- Language: Dockerfile
- Homepage:
- Size: 146 KB
- Stars: 13
- Watchers: 5
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Machine Learning/Data Science Platform (Docker Image)
=====================
Requirements
------------
- docker
- docker-machine (to deploy to google cloud)
How to use (in Google Cloud)
----------------------------
```bash
docker-machine create docker-dsp -d google --google-project={project_id} --google-machine-type n1-highmem-8 --google-disk-size "10" --google-disk-type "pd-standard" --google-preemptible --google-machine-image ubuntu-os-cloud/global/images/family/ubuntu-1404-lts --google-scopes "https://www.googleapis.com/auth/cloud-platform"
```
```bash
eval $(docker-machine env docker-dsp)
```
```bash
docker run -d -p 8888:8080 -e "PROJECT_ID={project_id}" eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''
```
- get the ip address of the machine
```bash
docker-machine ip docker-dsp
```
- open the site http://{docker-machine ip docker-dsp}:8080
- to stop the machine
```bash
docker-machine stop docker-dsp
docker-machine start docker-dsp
```
- to delete the instance
```bash
docker-machine rm docker-dsp
```
### Note
when you stop the machine, it would cost you nothing except for the disk that you have it attached. For Google cloud (10GB of disk would cost ~0.4$/month)
How to use (Locally)
--------------------
```bash
docker run -d -p 8888:8888 -v :/home/jovyan/work eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''
```
TODO
----
- access local files (whether running locally or on google machine)
Gear included
-------------
- Python 3.6
- aws packages and tools
- data analysis and data manipulation/data storage
pandas/fastparquet/
- distributed/out-of-core/data workflow
numba/luigi/airflow
- multipurpose machine learning libraries
scikit-learn/orange3
- nlp
nltk
- network analysis
networkx
- visualization
matplotlib/seaborn/holoviews/ggplot/bokeh