https://github.com/calcuis/gguf-quantizor

quantizor for gguf (bf/f16)
https://github.com/calcuis/gguf-quantizor

cutter gguf quantizor

Last synced: 4 months ago
JSON representation

quantizor for gguf (bf/f16)

Host: GitHub
URL: https://github.com/calcuis/gguf-quantizor
Owner: calcuis
License: mit
Created: 2025-01-19T22:17:23.000Z (9 months ago)
Default Branch: main
Last Pushed: 2025-05-25T20:15:09.000Z (5 months ago)
Last Synced: 2025-06-26T02:09:26.339Z (4 months ago)
Topics: cutter, gguf, quantizor
Language: C++
Homepage: https://pypi.org/project/gguf-cutter
Size: 281 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

### cutter/quantizor for gguf

#### install it via pip/pip3
```
pip install gguf-cutter
```
#### download the cutter by (if no py command; use python/python3 instead)
```
py -m gguf_cutter
```
### how to use it
tag: q2_k, q3_k_s, q3_k_m, q3_k_l, q4_k_s, q4_k_m, q5_k_s, q5_k_m, q6_k, q5_0, q5_1, q4_0, q4_1, q8_0
```
.\quantizor.exe [input_path] [output_path] [tag]
```
#### example:
if you want to cut a `f16` gguf into `q4_k_m`, you should execute the `command` above in that way (put your f16 gguf file in the current directory with `quantizor.exe` together)
```
.\quantizor.exe your-gguf-f16.gguf output-gguf-q4_k_m.gguf q4_k_m
```
then, after completing the process, the quantized `q_4_k_m` will be saved in the current directory

#
### additional chapter: make your own executable quantizor/cutter
#### compile the executable file (i.e.,.exe) for your customized machine or specifed os if the .exe above doesn't work
git clone llama.cpp:
```
git clone https://github.com/ggml-org/llama.cpp
```

apply the custom patch:
```
cd llama.cpp
git checkout tags/b4387
git apply ..\quantizor.patch
```

compile the llama-quantize for your machine:
```
mkdir build
cd build
cmake ..
cmake --build . --config Debug -j10 --target llama-quantize
```

quantize your file:
```
cd bin
.\llama-quantize.exe your-gguf-f16.gguf output-gguf-q4_k_m.gguf q4_k_m
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/calcuis/gguf-quantizor

Awesome Lists containing this project

README