Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pingsutw/2019-minicourse-submarine
DESIGN AND IMPLEMENTATION OF A MACHINE LEARNING PLATFORM
https://github.com/pingsutw/2019-minicourse-submarine
Last synced: 3 months ago
JSON representation
DESIGN AND IMPLEMENTATION OF A MACHINE LEARNING PLATFORM
- Host: GitHub
- URL: https://github.com/pingsutw/2019-minicourse-submarine
- Owner: pingsutw
- License: mit
- Created: 2019-12-18T01:00:49.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2023-05-01T13:48:40.000Z (almost 2 years ago)
- Last Synced: 2024-11-08T09:03:11.315Z (3 months ago)
- Language: Jupyter Notebook
- Size: 3.18 MB
- Stars: 12
- Watchers: 2
- Forks: 3
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# 2019-minicourse-submarine
DESIGN AND IMPLEMENTATION OF A MACHINE LEARNING PLATFORM2019-minicourse-submarine [slide](https://docs.google.com/presentation/d/1KdOmE7ErS5SAeTr_YURkSXUwUPlNzA7RJchDUiN7F2U/edit?usp=sharing), [doc](https://hackmd.io/@pingsutw/H11HN5Z1U)
data:image/s3,"s3://crabby-images/b1cb1/b1cb12d8eb642609a71f1b25f87548371c2dd5e8" alt=""
### What is Apache Submarine?
Apache Submarine is a unified AI platform which allows engineers and data scientists to run Machine Learning and Deep Learning workload in distributed cluster.
Goals of Submarine:
- It allows jobs easy access data/models in HDFS and other storages.
- Can launch services to serve TensorFlow/PyTorch models.
- Support run distributed TensorFlow jobs with simple configs.
- Support run user-specified Docker images.
- Support specify GPU and other resources.
- Support launch TensorBoard for training jobs if user specified.
- Support customized DNS name for roles (like TensorBoard.$user.$domain:6006)### Prerequisites
- Maven 3.3 or later ( 3.6.2 is known to fail, see SUBMARINE-273 )
- JDK 1.8### Install mini-submarine
```shell=
git clone https://github.com/apache/submarine.git
cd submarine
mvn clean install package -DskipTests
cd dev-support/mini-submarine
./build_mini-submarine.sh
```### Pull from dockerhub without maven and java
```shell=
docker pull hadoopsubmarine/mini-submarine:0.3.0-SNAPSHOT
```### Run mini-submarine
```shell=
docker run -it -h submarine-dev --net=bridge --privileged -P local/mini-submarine:0.3.0-SNAPSHOT /bin/bash# In the container, use root user to bootstrap hdfs and yarn
/tmp/hadoop-config/bootstrap.shsu yarn
# Run distributed training on hadoop
cd && cd submarine && ./run_submarine_mnist_tony.sh
```### ML code in a real-world ML system is a lot smaller than the infrastructure
data:image/s3,"s3://crabby-images/81e9c/81e9c15c0655510c89c442d208396e4a0b7c8701" alt=""### Deep learning use cases in the real world.
data:image/s3,"s3://crabby-images/62854/6285434be1cd1adf8ffee53e716457ae5f0dc7af" alt=""
data:image/s3,"s3://crabby-images/c4b07/c4b07168f5ad9f20915c45af023fa4138d6d000e" alt="Machine Learning Platform"
## About this tutorial
After this tutorial, you will know :
**[Apache Submarine](https://github.com/apache/submarine)** - is Cloud Native Machine Learning Platform
**[Apache airflow](https://github.com/apache/airflow)** -
a platform to programmatically author, schedule, and monitor workflows.
**[kaggle](https://www.kaggle.com/)** - an online
community of data scientists and machine learners. Kaggle allows users to
find and publish data sets, explore and build models in a web-based
data-science environment, work with other data scientists and machine
learning engineers, and enter competitions to solve data science challenges.**[jupyter notebook](https://jupyter.org/)** - an open-source web application that allows you to create
and share documents that contain live code, equations, visualizations and narrative text**[mlflow](https://mlflow.org/)** - An open source platform for the machine learning lifecycle
## Prerequisites
- Ubuntu >= 16.04
- Docker
- Docker-compose
- memory >= 5G### Installation
```shell script
sudo apt install docker-compose # install docker-compose
sudo apt-get install docker.io # install docker
service docker status
```### Install Docker Desktop on Mac
[docker doc](https://docs.docker.com/docker-for-mac/install/)
### Join Kaggle Competition
[House Prices: Advanced Regression Techniques](https://www.kaggle.com/c/house-prices-advanced-regression-techniques)### Setting kaggle user name and API key in kaggle.json
[create a kaggle's public key](https://www.kaggle.com/docs/api)
```shell script
cd airflow
vim kaggle.json
# {"username":"", "key":""}
```
### Build
```shell script
sudo docker-compose build
```### Usage
```shell script
sudo docker-compose -f docker-compose.yml up
```### UI Links
- **mlflow** : [localhost:5000](localhost:5000)
- **jupyter notebook** : [localhost:7000](localhost:7000)
- **airflow** : [localhost:8080](localhost:8080)#### 1. Turn on airflow DAG
data:image/s3,"s3://crabby-images/63b40/63b40bac8889f0e2a95852c9552b563409183694" alt=""#### 2. Trigger DAG
data:image/s3,"s3://crabby-images/f94e2/f94e2d4fa3d6fecb4b7ce93597e2aa34f23d8868" alt=""#### 3. Open data_visualization.ipynb and start visualizing data
data:image/s3,"s3://crabby-images/c5d74/c5d74b23ea88016f6f69fe4bc5a0362c1eceef1d" alt=""data:image/s3,"s3://crabby-images/c954e/c954e9ee422646f4d7eb0018bf3c93c16ab06c89" alt=""
#### 4. mlflow compare ml experiments
data:image/s3,"s3://crabby-images/9593a/9593a7e55424f502f40062924e6b20627479b4a9" alt=""#### 5. Try to optimize ML model
```python
# open ./dags/src/training.py and tune parameters
params = {
"colsample_bytree": 0.4603,
"gamma": 0.0468,
"learning_rate": 0.05,
"max_depth": 20,
"min_child_weight": 2,
"n_estimators": 2200,
"reg_alpha": 0.4640,
"reg_lambda": 0.8571,
"subsample": 0.5213,
"random_state": 7,
"nthread": -1
}
```#### 6. Kaggle Leaderboard
[Leaderboard](https://www.kaggle.com/c/house-prices-advanced-regression-techniques/leaderboard)
data:image/s3,"s3://crabby-images/07230/07230ee1010de617a7547fbbb5b03567dd44747b" alt=""