An open API service indexing awesome lists of open source software.

https://github.com/sim51/docker-data-science-training


https://github.com/sim51/docker-data-science-training

Last synced: about 2 months ago
JSON representation

Awesome Lists containing this project

README

        

= Docker environment for the data science training

== Launch the environment

=== On all environment

==== Step 1 : Download neo4j plugins

First you need to download the latest version of those plugins, and put them inside the `plugins` directory :

* APOC : https://github.com/neo4j-contrib/neo4j-apoc-procedures/releases/
* Graph Algos : https://github.com/neo4j-contrib/neo4j-graph-algorithms/releases/

For example, the current when I write this file are :

* https://github.com/neo4j-contrib/neo4j-graph-algorithms/releases/download/3.5.3.1/graph-algorithms-algo-3.5.3.1.jar
* https://github.com/neo4j-contrib/neo4j-apoc-procedures/releases/download/3.5.0.2/apoc-3.5.0.2-all.jar

==== Step 2 : Clone the training repository

In this directory, just clone the training repository : `git clone https://github.com/mneedham/data-science-training`

==== Step 3 : Launch docker

You just have to run `docker-compose up`

=== On linux

On linux you can directly run the script `run.sh`, it will download everything you need and also run the `docker-compose up` command for you.

== Use the environment

Your Neo4j database is available on your laptop at this url http://localhost:7474/browser with the following login/password -> neo4j / admin
But inside other container (like the jupyter one) your should use `bolt://data-science-training-neo4j`.

Jupyter is available at the following url http://localhost:8888/
To login, you will need to search the token in the docker logs something. You should see something similar to that : `[I 17:39:57.299 NotebookApp] http://(9d6f11a3c1a1 or 127.0.0.1):8888/?token=04833158d59cd1ac094602f9b097822a904fa285607e1c59`