Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/tky823/bitlinear158compression


https://github.com/tky823/bitlinear158compression

Last synced: 17 days ago
JSON representation

Awesome Lists containing this project

README

        

# BitLinear158Compression

In this library, we compare following compression models for inference by BitLinear158:

- `BitLinear158`: Ternary weight is managed by `float32` (naive implementation)
- `BitLinear158Int8Inference`: Ternary weight is managed by `int8`
- `BitLinear158Uint2Inference`: Ternary weight is managed by `uint2` defined in `torchao`
- `BitLinear158CppInference`: Ternary weight is managed by `int8` and forward pass is implemented by C++/CUDA. When weight is on CUDA, CUDA kernel is used under the hood.

## Installation

```sh
pip install .
```

## Test

```sh
python tests/test_bitlinear158_inference.py
```

## NOTE
There is room for improvement of computational speed by setting additional compling arguments in `setup.py`.