Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tsingmicro-toolchain/OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
https://github.com/tsingmicro-toolchain/OnnxSlim
Last synced: 7 days ago
JSON representation
A Toolkit to Help Optimize Large Onnx Model
- Host: GitHub
- URL: https://github.com/tsingmicro-toolchain/OnnxSlim
- Owner: tsingmicro-toolchain
- License: mit
- Created: 2023-08-22T03:57:59.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-05-16T12:29:56.000Z (6 months ago)
- Last Synced: 2024-10-21T15:29:42.091Z (22 days ago)
- Language: Python
- Homepage:
- Size: 787 KB
- Stars: 142
- Watchers: 4
- Forks: 9
- Open Issues: 8
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# OnnxSlim
OnnxSlim can help you slim your onnx model, with less operators, but same accuracy, better inference speed.
- 🚀 OnnxSlim is merged to [mnn-llm](https://github.com/wangzhaode/mnn-llm), performance increased by 5%
- 🚀 Rank 1st in the [AICAS 2024 LLM inference optimiztion challenge](https://tianchi.aliyun.com/competition/entrance/532170/customize440) held by Arm and T-head# Installation
## Using Prebuilt
```bash
pip install onnxslim
```
## Build From Source
```
pip install .
```# How to use
```
onnxslim your_onnx_model slimmed_onnx_model
```For more usage, see onnxslim -h or refer to our [examples](./examples)
# References
> * [onnx-graphsurgeon](https://github.com/NVIDIA/TensorRT/tree/main/tools/onnx-graphsurgeon)
> * [Polygraphy](https://github.com/NVIDIA/TensorRT/tree/main/tools/Polygraphy/polygraphy)
> * [onnx-simplifier](https://github.com/daquexian/onnx-simplifier)
> * [tabulate](https://github.com/astanin/python-tabulate)
> * [onnxruntime](https://github.com/microsoft/onnxruntime)# Contact
Discord: https://discord.gg/nRw2Fd3VUS
QQ Group: 873569894