Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/xest/hf2ollama
https://github.com/xest/hf2ollama
gguf huggingface llama llm ollama quantization
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/xest/hf2ollama
- Owner: xest
- License: apache-2.0
- Created: 2024-08-26T02:27:44.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-09-10T09:48:37.000Z (4 months ago)
- Last Synced: 2024-09-18T18:19:11.463Z (4 months ago)
- Topics: gguf, huggingface, llama, llm, ollama, quantization
- Language: Python
- Homepage:
- Size: 59.6 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# hf2ollama
This project was initiated with the goal of converting and quantizing the EXAONE3 model for use in Ollama. Currently, other models are also being included for performance comparison purposes.Flowchart
---------
```mermaid
graph LR
subgraph HF to Quantized GGUF
A[(Hugging Face)]-->B[safetensors]-->C[llamafied]-->D[gguf]-->E[quantized]
end
subgraph Ollama
E-. ollama create .->I[(Ollama)]
end
subgraph UI
J[CLI]<-. ollama run .->I
K[open-webui]<-. api .->I
L[LibreChat]<-. api .->I
end
```Environment Variables
---------------------
* HF_MODEL_ID : Hugging Face Model ID (default: LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct)
* QUANTIZE_METHOD : Quantize method (default: Q4_K_M)
* OLLAMA_MODEL : Ollama model name (default: LGAI-EXAONE_EXAONE-3.0-7.8B-Instruct)
* OLLAMA_HOST : Ollama host url (default: http://host.docker.internal:11434)Targets
-------
* make build : Build a docker image for building
* make download : Download the official model from Hugging Face
* make llamafy : Llamafy the official model
* make convert : Convert the Llamafied model to a gguf model
* make quantize : Quantize the gguf model
* make create : Create a model that can be used in Ollama
* make run : Run Ollama CLI
* make server : Build a docker image for serverDevelopment Targets
-------------------
* make readme : Update README.md
* make shell : Run a shell of the docker image for build
* make ollama-shell : Run a shell of the Ollama docker image
* make ollama-pull : Pull Ollama models
* make clean : Delete the docker images for building
* make distclean : Delete the docker images for building and all the files generatedTested Models
-------------
* https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
* https://huggingface.co/MLP-KTLim/llama-3-Korean-Bllossom-8B
* https://huggingface.co/yanolja/EEVE-Korean-Instruct-10.8B-v1.0
* https://huggingface.co/rtzr/ko-gemma-2-9b-it
* https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1References
----------
* https://huggingface.co/maywell/EXAONE-3.0-7.8B-Instruct-Llamafied
* https://github.com/ggerganov/llama.cpp
* https://github.com/ollama/ollama
* https://github.com/open-webui/open-webui
* https://github.com/danny-avila/LibreChat