https://github.com/blace-ai/blace-ai

Cross-platform c++ sdk & model hub for cross-platform AI inference. Ready-to-deploy models including Segment Anything 3, Depth Anything 2 and Gemma.
https://github.com/blace-ai/blace-ai

ai-hub ai-inference ai-sdk cpp libtorch onnxruntime

Last synced: 4 months ago
JSON representation

Cross-platform c++ sdk & model hub for cross-platform AI inference. Ready-to-deploy models including Segment Anything 3, Depth Anything 2 and Gemma.

Host: GitHub
URL: https://github.com/blace-ai/blace-ai
Owner: blace-ai
License: other
Created: 2024-11-28T10:19:01.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2026-01-26T15:36:59.000Z (5 months ago)
Last Synced: 2026-01-27T04:22:02.863Z (5 months ago)
Topics: ai-hub, ai-inference, ai-sdk, cpp, libtorch, onnxruntime
Language: C++
Homepage: https://blace.ai
Size: 2.39 MB
Stars: 66
Watchers: 8
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # blace.ai inference library & c++ model hub  

[![Production Deployments](https://img.shields.io/badge/Production%20Deployments-10.000+-blue)](#) [![C++](https://img.shields.io/badge/C++-%2300599C.svg?logo=c%2B%2B&logoColor=white)](#) [![macOS](https://img.shields.io/badge/macOS-000000?logo=apple&logoColor=F0F0F0)](#) [![Ubuntu](https://img.shields.io/badge/Ubuntu-E95420?logo=ubuntu&logoColor=white)](#) [![Windows](https://custom-icon-badges.demolab.com/badge/Windows-0078D6?logo=windows11)](#)

Welcome to blace.ai — a high-performance C++ meta-inference library originally developed as the core framework behind [Blace Plugins'](https://blaceplugins.com/) AI-driven video editing tools.  

It abstracts away OS, framework (PyTorch and ONNX), and hardware accelerator differences. With a unified API and minimal setup, you can easily integrate AI models into your native, python-less, applications.  

Products using blace.ai are already trusted in production and have been used by more than **10,000 customers worldwide**.

📦 [**download sdk**](https://github.com/blace-ai/blace-ai/releases) | 🌐 [**blace.ai website**](https://blace.ai) | 📖 [**documentation**](https://blace-ai.github.io/blace-ai/) | 🧦 [**c++ model hub**](https://github.com/blace-ai/hub) | 💬 [**discord channel**](https://discord.com/channels/1202176342603616277/1318605344586338404)

## Overview

![Screenshot](img/overview.svg) 



## Features

-  **Cross-Platform:** Write C++ code for AI model inference once - and deploy to 

  all major operating systems (Windows, MacOS Intel/Silicon and Linux).  

- **AI-Framework-Agnostic:** blace.ai supports a growing list of frameworks and providers like Torchscript (CUDA and MPS) and ONNX (CPU and DirectML) out of the box.

- **IPC & Process Isolation:**

blace.ai can run workloads in separate isolated processes connected via high-performance inter-process communication ([IPC](https://blace-ai.github.io/blace-ai/ipc.html)).

This architecture brings several advantages:

  - Crash Containment: AI backends are complex and can fail; isolation ensures such failures only stop the affected process, not your main application.

  - Memory Isolation: GPU and CPU memory usage is separated, preventing leaks from impacting the rest of the system.

  - Lightweight Device Support: IoT and other resource-constrained devices can use the IPC version of the library and offload heavy computations to a more powerful remote system, enabling high-performance AI inference even on minimal hardware. 

- **Self-Contained:** Our library is fully self-contained and ships with all necessary dependencies out of the box,

  making integration effortless and eliminating the hassle of managing third-party packages. Furthermore, it has **no python dependencies**.

- **Performant computation graphs:** We provide you with a set of operators that can be used to write 

  computation graphs for **multi-model inference**. This is the structure of a simple computation graph running the Segment-Anything encoder and decoder seperately, automatically caching the intermediate results (dark orange node): 



  ![Screenshot](img/dag_example_2.svg) 



- **Serializable:** Built-in serialization allows you to **save and transfer

  computation graphs**, making it easier to integrate with distributed systems and enabling seamless scaling across multiple nodes or environments.

- **Smart models:** Models created with the [**Model Wizard**](https://blace-ai.github.io/blace-ai/model_wizard_creation.html) or 

  coming from the [Hub](https://www.blace.ai/hub/) store all needed configuration and 

  metadata about inputs and outputs. This eliminates the need to worry about proper model instantiation or tensor input formatting, such as memory order and sizes, streamlining the integration process.

## Hardware Support

Since the unified model format abstracts from AI frameworks, blace.ai has a deployment path for accelerated inference on most host / hardware combinations.

| Host                              | ONNX CPU | Torchscript CPU | 🏎️ Torchscript MPS | 🏎️ Torchscript CUDA | 🏎️ ONNX DirectML |

|-----------------------------------|----------|-----------------|---------------------|----------------------|-------------------| 

| Ubuntu                            | ✅        | ✅               | -                   | -                    | -                 |

| Ubuntu (NVIDIA card)              | ✅      | ✅               | -                  | ✅                    | -                 |

| MacOS (Intel)                     | ✅      | ✅               | -                  | -                    | -                |

| MacOS (Silicon)                   | ✅      | ✅               | ✅                   | -                    | -                |

| Windows (DirectML supported card) | ✅      | ✅               | -                  | -                    | ✅                 |

| Windows (NVIDIA card)             | ✅      | ✅               | -                  | ✅                    | ✅                 |

## Model Hub

[**blace.ai hub**](https://www.blace.ai/hub/) is a growing list of AI model compatible with the inference sdk. All models are carefully optimized, continuously tested on supported backends and can be integrated into .cpp applications with a few lines of code.

![Screenshot](img/hub.png)

To request integration of a specific model, please [open a ticket](https://github.com/blace-ai/blace-ai/issues/new). 

## Quick Start  

Follow the [**Quickstart Guide**](https://blace-ai.github.io/blace-ai/getting_started.html) to run the first model within a few minutes.

Integrating AI models into your software should be simple. With Blace.ai, you can run AI model inference with just a few lines of code — across **Windows, Linux, and macOS**:  

```cpp

#include "blace_ai.h"

// include the models you want to use

#include "depth_anything_v2_v8_small_v3_ALL_export_version_v17.h"

void runDepthModel(std::string input_file, std::string output_file) {

  // have this present globally

  workload_management::BlaceWorld blace;

  // load image into op

  auto image_mem = raw_memory_from_file(input_file);

  auto world_tensor_orig = CONSTRUCT_OP(blace::ops::FromRawMemoryOp(image_mem));

  // interpolate to smaller size

  auto interpolated = CONSTRUCT_OP(ops::Interpolate2DOp(

      world_tensor_orig, 700, 1288, ml_core::BICUBIC, false, true));

  // construct model inference arguments for cross-platform compatibility

  ml_core::InferenceArgsCollection infer_args;

  infer_args.inference_args.backends = {

      ml_core::TORCHSCRIPT_CUDA_FP16, ml_core::TORCHSCRIPT_MPS_FP16,

      ml_core::ONNX_DML_FP32, ml_core::TORCHSCRIPT_CPU_FP32};

  // construct inference operation

  auto infer_op = CONSTRUCT_OP(ops::InferenceOp(

      depth_anything_v2_v8_small_v3_ALL_export_version_v17, {interpolated},

      infer_args, 0, util::getPathToExe().string()));

  // normalize depth to zero-one range

  auto result_depth = CONSTRUCT_OP(ops::NormalizeToZeroOneOP(infer_op));

  // save result to image file

  result_depth = CONSTRUCT_OP(ops::SaveImageOp(result_depth, output_file));

  // construct (IPC) evaluator and evaluate to raw memory object

#ifndef BLACE_AI_USE_IPC

  computation_graph::GraphEvaluator evaluator(result_depth);

  auto eval_result = evaluator.evaluateToRawMemory();

#else

  ipc::IpcEvaluator ipc_evaluator(result_depth);

  auto eval_result = ipc_evaluator.evaluateToRawMemory();

#endif

  assert(eval_result.first == ml_core::ReturnCode::OK);

  return;

}

```

## Installation

Integrate blace.ai into your CMake project with just two lines.

```

include("../cmake/FindBlace.cmake")

target_link_libraries( PRIVATE 3rdparty::BlaceAI)

```

## Roadmap

- for feature request please contact us via mail or open an issue here 

## Origin

Originally developed as the internal framework for [Blace Plugins'](https://blaceplugins.com/) AI-driven video editing tools, blace.ai has proven its reliability in production environments. Recognizing its potential to benefit a broader audience, we are thrilled to release blace.ai to the public.

## Feedback & Bug Repots

Please [open a ticket](https://github.com/blace-ai/blace-ai/issues/new) or reach out to contact [at] blace.ai

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/blace-ai/blace-ai

Awesome Lists containing this project

README