https://github.com/admk/torchmb

Model batching for PyTorch. With the help of torch.fx and einops.
https://github.com/admk/torchmb

Last synced: 6 months ago
JSON representation

Model batching for PyTorch. With the help of torch.fx and einops.

Host: GitHub
URL: https://github.com/admk/torchmb
Owner: admk
Created: 2021-08-07T13:23:14.000Z (almost 4 years ago)
Default Branch: master
Last Pushed: 2024-08-03T12:51:38.000Z (12 months ago)
Last Synced: 2024-12-04T15:17:05.986Z (7 months ago)
Language: Python
Homepage:
Size: 70.3 KB
Stars: 4
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # PyTorch Model Batcher

## Installation

```bash

pip install "git+https://github.com/admk/torchmb.git#egg=torchmb"

```

## Usage

### Model Batching

Common layers are supported. To use, simply instantiate a PyTorch module

and use `torchmb.BatchModule(module, batch)`

to generate a batch of identical models:

```python

from torchmb import BatchModule

model_batch_size = 100

batch_model = BatchModule(LeNet(), batch=model_batch_size)

```

### Forward Passes

For forward passes,

prepare your batch input data `batch_input`

with shape `(model_batch_size, image_batch_size, ...)`,

and use `batch_model` by calling it:

```python

batch_output = batch_mode(batch_input)

```

This computes a `batch_output`

with shape `(model_batch_size, image_batch_size, ...)`.

### Batch Utility Functions

The `torchmb` package also provides batch utility functions

for common top-K and loss functions.

To compute the cross-entropy loss,

prepare a batch of targets

with shape `(model_batch_size, image_batch_size)`,

and use:

```python

from torchmb import batch_loss

loss_func = nn.functional.cross_entropy

losses = batch_loss(

    batch_inputs, batch_targets, model_batch_size, loss_func, 'mean')

```

This computes a batch of loss values `losses`

with shape `(model_batch_size)`.

Similarly, for top-K accuracy evaluation, use:

```python

from torchmb import batch_topk

accs = batch_topk(batch_inputs, batch_targets, model_batch_size, (1, 5))

```

where `accs` is a batch of top-1 and top-5 accuracies

with shape `(2, model_batch_size)`,

and the rows respectively list top-1 and top-5 values.

### Backward Passes

Batched modules and batch utility functions

are fully compatible with automatic differentiation.

To invoke backpropagation on `batch_loss`,

simply use for instance:

```python

batch_loss.sum().backward()

```

The gradients for all batched models

will be independently updated in batch.

### Extending the Model Batcher

If your custom module/functional

is parameter-free and performs isolated computation

for each image,

you don't need to do anything,

because we merge the `model_batch_size` dimension

into `image_batch_size` of the module input by default.

To support custom modules

(for instance, `MyModule`) in torchmb,

implement your `MyBatchModule` class

by inheriting from `AbstractBatchModule`

and register it with `register_module`:

```python

from torch import Tensor

from torchmb import AbstractBatchModule, register_batch_module

class MyBatchModule(AbstractBatchModule):

    base_class = MyModule

    @classmethod

    def from_module(cls, module: MyModule, batch: int) -> 'MyBatchModule':

        return cls(...)

    def __init__(self, batch: int, ...):

        super().__init__(batch)

        ...

    def forward(self, batch_inputs: Tensor) -> Tensor:

        ...

register_batch_module(MyModule)

```

Note that in the forward method,

in the first dimension of `batch_inputs`,

all data values are arranged

in the dictionary order of `(image_batch_size, model_batch_size)`.

and the return value is also expected

to assume the same data order.

## Caveats

To ensure isolated training in batched models,

we performed extensive testing in `tests/test_(functional|layers).py`.

However, it is important to note that

to prevent information leakage,

the user is expected to be aware

of how their algorithms can affect model isolation

in forward and backward passes.

For example,

the SGD optimizer (even with momentum or Nesterov)

does not leak information,

but the `AdamW` violates the constraint.

Platform-dependent behaviour, floating-point rounding errors,

and the choice of algorithms used by CuDNN

can all affect the accuracy of the outputs.

Sometimes there may be a non-negligble difference

between the batch outputs and non-batch results.

This is generally not an issue

because in either case it is very difficult to predict

how errors are introduced in the implementation,

and the user has very little control over this.

In any case,

we do not assume liabilities in unintended behaviours,

nor do we provide any warranties.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/admk/torchmb

Awesome Lists containing this project

README