Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ehsanmok/tvm-rust

(MERGED) Rust bindings for TVM runtime
https://github.com/ehsanmok/tvm-rust

compiler deep-learning nnvm rust-library tvm

Last synced: about 2 months ago
JSON representation

(MERGED) Rust bindings for TVM runtime

Host: GitHub
URL: https://github.com/ehsanmok/tvm-rust
Owner: ehsanmok
License: apache-2.0
Archived: true
Created: 2018-05-29T15:46:08.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2019-02-03T07:20:41.000Z (almost 6 years ago)
Last Synced: 2024-11-11T03:44:15.192Z (2 months ago)
Topics: compiler, deep-learning, nnvm, rust-library, tvm
Language: Rust
Homepage: https://github.com/dmlc/tvm/tree/master/rust
Size: 260 KB
Stars: 27
Watchers: 3
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # DEPRECATED

**The RFC is closed and this has been merge into [TVM](https://github.com/dmlc/tvm/tree/master/rust).**

# TVM Runtime Frontend Support

This crate provides an idiomatic Rust API for [TVM](https://github.com/dmlc/tvm) runtime frontend as part of the ~~[ongoing RFC](https://github.com/dmlc/tvm/issues/1601)~~. Currently this requires **Nightly Rust**.

Checkout the [docs](https://ehsanmok.github.io/tvm_frontend/tvm_frontend/index.html).

## What Does This Crate Offer?

Here is a major workflow

1. Train your **Deep Learning** model using any major framework such as [PyTorch](https://pytorch.org/), [Apache MXNet](https://mxnet.incubator.apache.org/) or [TensorFlow](https://www.tensorflow.org/)

2. Use **TVM** to build optimized model artifacts on a supported context such as CPU, GPU, OpenCL, Vulkan, VPI, ROCM, etc.

3. Deploy your models using **Rust** :heart:

### Example: Deploy Image Classification from Pretrained Resnet18 on ImageNet1k

Please checkout [examples/resnet](https://github.com/ehsanmok/tvm-rust/tree/master/examples/resnet) for the complete end-to-end example.

Here's a Python snippet for downloading and building a pretrained Resnet18 via MXNet and TVM

```python

block = get_model('resnet18_v1', pretrained=True)

    

sym, params = nnvm.frontend.from_mxnet(block)

# add the softmax layer for prediction

net = nnvm.sym.softmax(sym)

# compile the model

with nnvm.compiler.build_config(opt_level=opt_level):

    graph, lib, params = nnvm.compiler.build(

        net, target, shape={"data": data_shape}, params=params)

# same the model artifacts

lib.save(os.path.join(target_dir, "deploy_lib.o"))

cc.create_shared(os.path.join(target_dir, "deploy_lib.so"),

                [os.path.join(target_dir, "deploy_lib.o")])

with open(os.path.join(target_dir, "deploy_graph.json"), "w") as fo:

    fo.write(graph.json())

with open(os.path.join(target_dir,"deploy_param.params"), "wb") as fo:

    fo.write(nnvm.compiler.save_param_dict(params))

```

Now, we need to input the artifacts to create and run the *Graph Runtime* to detect our input cat image

![cat](https://github.com/dmlc/mxnet.js/blob/master/data/cat.png?raw=true)

as demostrated in the following Rust snippet

```rust

let graph = fs::read_to_string("deploy_graph.json")?;

// load module

let lib = Module::load(&Path::new("deploy_lib.so"))?;

// get the global TVM graph runtime function

let runtime_create_fn = Function::get_function("tvm.graph_runtime.create", true).unwrap();

let runtime_create_fn_ret = call_packed!(

    runtime_create_fn,

    &graph,

    &lib,

    &ctx.device_type,

    &ctx.device_id

)?;

// get graph runtime module

let graph_runtime_module = runtime_create_fn_ret.to_module();

// get the registered `load_params` from runtime module

let load_param_fn = graph_runtime_module

    .get_function("load_params", false)

    .unwrap();

// parse parameters and convert to TVMByteArray

let params: Vec = fs::read("deploy_param.params")?;

let barr = TVMByteArray::from(&params);

// load the parameters

call_packed!(load_param_fn, &barr)?;

// get the set_input function

let set_input_fn = graph_runtime_module

    .get_function("set_input", false)

    .unwrap();

call_packed!(set_input_fn, "data", &input)?;

// get `run` function from runtime module

let run_fn = graph_runtime_module.get_function("run", false).unwrap();

// execute the run function. Note that it has no argument.

call_packed!(run_fn,)?;

// prepare to get the output

let output_shape = &mut [1, 1000];

let output = empty(output_shape, TVMContext::cpu(0), TVMType::from("float"));

// get the `get_output` function from runtime module

let get_output_fn = graph_runtime_module

    .get_function("get_output", false)

    .unwrap();

// execute the get output function

call_packed!(get_output_fn, &0, &output)?;

// flatten the output as Vec

let output = output.to_vec::()?;

```

## Installations

Please follow TVM [installations](https://docs.tvm.ai/install/index.html), `export TVM_HOME=/path/to/tvm` and add `libtvm_runtime` to your `LD_LIBRARY_PATH`.

*Note:* To run the end-to-end examples and tests, `tvm`, `nnvm` and `topi` need to be added to your `PYTHONPATH` or it's automatic via an Anaconda environment when install individually.

## Supported TVM Functionalities

### Use TVM to Generate Shared Library

One can use the following Python snippet to generate `add_gpu.so` which add two vectors on GPU.

```python

import os

import tvm

from tvm.contrib import cc

def test_add(target_dir):

    if not tvm.module.enabled("cuda"):

        print(f"skip {__file__} because cuda is not enabled...")

        return

    n = tvm.var("n")

    A = tvm.placeholder((n,), name='A')

    B = tvm.placeholder((n,), name='B')

    C = tvm.compute(A.shape, lambda i: A[i] + B[i], name="C")

    s = tvm.create_schedule(C.op)

    bx, tx = s[C].split(C.op.axis[0], factor=64)

    s[C].bind(bx, tvm.thread_axis("blockIdx.x"))

    s[C].bind(tx, tvm.thread_axis("threadIdx.x"))

    fadd_cuda = tvm.build(s, [A, B, C], "cuda", target_host="llvm", name="myadd")

    fadd_cuda.save(os.path.join(target_dir, "add_gpu.o"))

    fadd_cuda.imported_modules[0].save(os.path.join(target_dir, "add_gpu.ptx"))

    cc.create_shared(os.path.join(target_dir, "add_gpu.so"),

            [os.path.join(target_dir, "add_gpu.o")])

if __name__ == "__main__":

    import sys

    if len(sys.argv) != 2:

        sys.exit(-1)

    test_add(sys.argv[1])

```

### Run the Generated Shared Library

The following code snippet demonstrates how to load and test the generated shared library (`add_gpu.so`) in Rust.

```rust

extern crate tvm_frontend as tvm;

use tvm::*;

fn main() {

    let shape = &mut [2];

    let mut data = vec![3f32, 4.0];

    let mut arr = empty(shape, TVMContext::gpu(0), TVMType::from("float"));

    arr.copy_from_buffer(data.as_mut_slice());

    let mut ret = empty(shape, TVMContext::gpu(0), TVMType::from("float"));

    let path = Path::new("add_gpu.so");

    let ptx = Path::new("add_gpu.ptx");

    let mut fadd = Module::load(path).unwrap();

    let fadd_dep = Module::load(ptx).unwrap();

    assert!(fadd.enabled("gpu"));

    fadd.import_module(fadd_dep);

    fadd.entry_func();

    function::Builder::from(&mut fadd)

        .arg(&arr)

        .arg(&arr)

        .set_output(&mut ret)

        .invoke()

        .unwrap();

    assert_eq!(ret.to_vec::().unwrap(), vec![6f32, 8.0]);

}

```

**Note:** it is required to instruct the `rustc` to link to the generated `add_gpu.so` in runtime, for example by

`cargo:rustc-link-search=native=add_gpu`. 

See the tests and examples custom `build.rs` for more details.

### Convert and Register a Rust Function as a TVM Packed Function

One can use `register_global_func!` macro to convert and register a Rust 

function of type `fn(&[TVMArgValue]) -> Result` to a global TVM **packed function** as follows

```rust

#[macro_use]

extern crate tvm_frontend as tvm;

use tvm::*;

fn main() {

    register_global_func! {

        fn sum(args: &[TVMArgValue]) -> Result {

            let mut ret = 0f32;

            let shape = &mut [2];

            for arg in args.iter() {

                let e = empty(shape, TVMContext::cpu(0), TVMType::from("float"));

                let arr = arg.to_ndarray().copy_to_ndarray(e).unwrap();

                let rnd: ArrayD = ArrayD::try_from(&arr).unwrap();

                ret += rnd.scalar_sum();

            }

            let ret_val = TVMRetValue::from(&ret);

            Ok(ret_val)

        }

    }

    let shape = &mut [2];

    let mut data = vec![3f32, 4.0];

    let mut arr = empty(shape, TVMContext::cpu(0), TVMType::from("float"));

    arr.copy_from_buffer(data.as_mut_slice());

    let mut registered = function::Builder::default();

    registered

        .get_function("sum", true)

        .arg(&arr)

        .arg(&arr);

    assert_eq!(registered.invoke().unwrap().to_float(), 14f64);

    }

```