https://github.com/lostbean/bb_policy

Last synced: 15 days ago
JSON representation

Host: GitHub
URL: https://github.com/lostbean/bb_policy
Owner: lostbean
License: other
Created: 2026-06-02T21:23:49.000Z (23 days ago)
Default Branch: main
Last Pushed: 2026-06-02T21:32:20.000Z (23 days ago)
Last Synced: 2026-06-02T23:17:15.561Z (23 days ago)
Language: Elixir
Size: 141 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
- Agents: AGENTS.md

Awesome Lists containing this project

README

          

# BB.Policy

[![CI](https://github.com/lostbean/bb_policy/actions/workflows/ci.yml/badge.svg)](https://github.com/lostbean/bb_policy/actions/workflows/ci.yml)

[![Hex version](https://img.shields.io/hexpm/v/bb_policy.svg)](https://hex.pm/packages/bb_policy)

Learned policies for [Beam Bots](https://github.com/beam-bots/bb). `bb_policy`

lets robots execute neural-network behaviours that map observations to actions —

train a policy from demonstrations or simulation, export it to ONNX, and deploy

it on real hardware with full safety-system integration.

A policy is a function `π: observation → action`. Given what the robot perceives

(joint positions, velocities, camera frames, forces), the policy outputs what it

should do (target positions, velocities, gripper commands). Inference runs on the

BEAM, in the same runtime as control — so a crashed or slow policy can't take the

robot down with it.

## Status

🚧 **Early scaffold.** The `BB.Policy` behaviour and project conventions are in

place; implementations are landing in phases. See

[`PROJECT_PLAN.md`](https://github.com/lostbean/bb_policy/blob/main/PROJECT_PLAN.md)

for the roadmap and the design decisions behind it.

## Installation

```elixir

def deps do

  [

    {:bb_policy, "~> 0.1"},

    # ONNX inference is optional — add ortex when you deploy a real model:

    {:ortex, "~> 0.1"}

  ]

end

```

## Usage

```elixir

{:ok, result} =

  BB.Policy.run(MyRobot, BB.Policy.ONNX, %{task: :pick_mug},

    policy_opts: [

      model: "priv/models/pick_mug.onnx",

      normalizer: "priv/models/pick_mug.json",

      observation_keys: [:joint_positions, :joint_velocities, :gripper],

      action_keys: [:target_positions, :target_gripper]

    ],

    rate_hz: 20,

    timeout: :timer.seconds(30)

  )

```

## How it fits the framework

| Concern | Where it lives |

|---------|----------------|

| Map observation → action | `BB.Policy` behaviour |

| Fixed-rate control loop | `BB.Policy.Runner` |

| Input/output scaling | `BB.Policy.Normalizer` |

| ONNX model loading & inference | `BB.Policy.ONNX` (via [Ortex](https://github.com/elixir-nx/ortex)) |

| Safety | `BB.Safety` — the runner only applies commands while armed |

| Observability | `[:bb, :policy, …]` telemetry events |

## Documentation

Generated docs live at . Architecture, decisions,

and the phased roadmap are in

[`PROJECT_PLAN.md`](https://github.com/lostbean/bb_policy/blob/main/PROJECT_PLAN.md);

contributor conventions are in

[`AGENTS.md`](https://github.com/lostbean/bb_policy/blob/main/AGENTS.md).

## Licence

Apache-2.0. See [`LICENSES/`](https://github.com/lostbean/bb_policy/tree/main/LICENSES)

and the SPDX headers on each file ([REUSE](https://reuse.software/)-compliant).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lostbean/bb_policy

Awesome Lists containing this project

README