https://github.com/sql-machine-learning/zeppelin-sqlflow
https://github.com/sql-machine-learning/zeppelin-sqlflow
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/sql-machine-learning/zeppelin-sqlflow
- Owner: sql-machine-learning
- License: apache-2.0
- Created: 2021-05-26T06:40:26.000Z (over 4 years ago)
- Default Branch: develop
- Last Pushed: 2021-06-25T05:35:03.000Z (over 4 years ago)
- Last Synced: 2025-04-30T16:44:14.614Z (8 months ago)
- Language: Java
- Size: 19.5 KB
- Stars: 10
- Watchers: 2
- Forks: 3
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# zeppelin-sqlflow
## Project Description
Integrate Ant Group open source machine learning tool [SQLFlow](http://sqlflow.org/sqlflow) into the web-based notebook tool [Apache Zeppelin](http://zeppelin.apache.org/) to expand the interpreter language supported by Zeppelin.
## Motivation
Integrate SQLFlow into Apache Zeppelin:
- Rich SQLFlow usage scenarios. Users can also use SQLFlow in this notebook environment;
- Expanded the interpreter language supported by Zeppelin;
- SQLFlow can be used in conjunction with other languages or data processors on this platform to fully demonstrate its convenience in AI applications.
## Quick Overview
Here are examples for training a TensorFlow [DNNClassifier](https://www.tensorflow.org/api_docs/python/tf/estimator/DNNClassifier) model using sample data Iris.train, and running prediction using the trained model in Zeppelin environment.

## How to install zeppelin-sqlflow
The installation is divided into the following steps:
- Prepare the interpreter project code jar package `zeppelin-sqlflow-0.9.0.jar` and the configuration file `interpreter-setting.json`;
- Switch to the installation directory of Zeppelin and create the `interpreter/sqlflow` subdirectory:
```bash
mkdir ~/zeppelin-0.9.0-SNAPSHOT/interpreter/sqlflow
```
- Copy the jar package and configuration file in step 1 to the directory created in step 2:
```bash
# This is the path on my machine, for reference only.
cp ~/zeppelin-sqlflow-0.9.0.jar ~/zeppelin-0.9.0-SNAPSHOT/interpreter/sqlflow
cp ~/interpreter-setting.json ~/zeppelin-0.9.0-SNAPSHOT/interpreter/sqlflow
```
- Start zeppelin
```bash
cd ~/zeppelin-0.9.0-SNAPSHOT/bin/
./zeppelin-daemon.sh start
```
- Refer to the manual [OperationGuide](https://github.com/sql-machine-learning/zeppelin-sqlflow/blob/develop/doc/OperationGuide.md) for specific use steps.
## Developer introduction

The project was initiated by the foundation platform development team of the IT department in PCCC. It aims to strengthen the function expansion of the Zeppelin in data exploration and model training,which provides data scientists with more comprehensive and rich data processing tools.