https://github.com/archinetai/bitcodes-pytorch

A vector quantization method with binary codes, in PyTorch.
https://github.com/archinetai/bitcodes-pytorch

artificial-intelligence deep-learning machine-learning quantization

Last synced: 5 months ago
JSON representation

A vector quantization method with binary codes, in PyTorch.

Host: GitHub
URL: https://github.com/archinetai/bitcodes-pytorch
Owner: archinetai
License: mit
Created: 2022-11-09T15:12:26.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2022-11-11T19:58:05.000Z (about 3 years ago)
Last Synced: 2025-06-15T13:25:35.122Z (6 months ago)
Topics: artificial-intelligence, deep-learning, machine-learning, quantization
Language: Python
Homepage:
Size: 5.86 KB
Stars: 6
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          
# Bitcodes - PyTorch

A new vector quantization method with binary codes, in PyTorch.

```bash

pip install bitcodes-pytorch

```

[![PyPI - Python Version](https://img.shields.io/pypi/v/bitcodes-pytorch?style=flat&colorA=black&colorB=black)](https://pypi.org/project/bitcodes-pytorch/)

## Usage

```python

from bitcodes_pytorch import Bitcodes

bitcodes = Bitcodes(

    features=8,     # Number of features per vector

    num_bits=4,     # Number of bits per vector

    temperature=10, # Gumbel softmax training temperature

)

# Set to eval during inference to make deterministic

bitcodes.eval()

x = torch.randn(1, 6, 8)

# Computes y, the quantzed version of x, and the bitcodes

y, bits = bitcodes(x)

"""

y.shape = torch.Size([1, 6, 8])

bits = tensor([[

  [0, 0, 0, 0],

  [1, 0, 1, 1],

  [1, 0, 0, 1],

  [1, 0, 0, 0],

  [0, 1, 1, 1],

  [0, 0, 1, 0]

]])

"""

```

### Dequantize

```python

y_decoded = bitcodes.from_bits(bits)

assert torch.allclose(y, y_decoded) # Assert passes in eval mode!

```

### Utils: Decimal-Binary Conversion

```python

from bitcodes_pytorch import to_decimal, to_bits

indices = to_decimal(bits)

# tensor([[ 0, 11,  9,  8,  7,  2]])

bits = to_bits(indices, num_bits=4)

"""

bits = tensor([[

  [0, 0, 0, 0],

  [1, 0, 1, 1],

  [1, 0, 0, 1],

  [1, 0, 0, 0],

  [0, 1, 1, 1],

  [0, 0, 1, 0]

]])

"""

```

## Explaination

Current vector quantization methods (e.g. [VQ-VAE](https://arxiv.org/abs/1711.00937#), [RQ-VAE](https://arxiv.org/abs/2203.01941)) either use a single large codebook or multiple smaller codebooks that are used as residuals. Residuals allow for an exponential increase in the number of possible combinations while keeping the number of total codebook items reasonably small by overlapping many codebook elements. If we let $C$ be the codebook size, and $R$ the number of residuals, we can get a theoretical maximum of $C^R$ combinations, assuming that all residuals have the same codebook size. The total number of codebook elements, which is proportional to parameter count, is instead $C\cdot R$. Thus it makes sense to keep $C$ as small as possible to maintain the parameter count reasonably small, while increasing $R$ to exploit the exponential number of combinations.

Here we use $C=2$ making the code binary, where $R=$`num_bits` can be freely chosen. The residuals are overlapped to get the output, instead of quantizing the difference - this allows to remove the residual loop and quantize with large $R$ in parallel.

Another nice property of bitcodes is that we can choose to quantize the bit matrix to integers in different ways after training (e.g. convert to decimal one or two rows at a time).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/archinetai/bitcodes-pytorch

Awesome Lists containing this project

README