An open API service indexing awesome lists of open source software.

https://github.com/neuralmagic/quant_kernel_benchmarks

Benchmarking code for running quantized kernels from vLLM and other libraries
https://github.com/neuralmagic/quant_kernel_benchmarks

Last synced: 12 months ago
JSON representation

Benchmarking code for running quantized kernels from vLLM and other libraries

Awesome Lists containing this project

README

          

# Example Usage

Run the benchmark (generates a .pkl file with the results)

```
python benchmark_kernels.py --act-type bfloat16 --kernels torch_fp16,machete,fbgemm_i4,marlin,gemlite model_bench
```

Plot the results

```
python plot/plot_normalized_runtime.py .pkl --highlight machete
```