https://github.com/continuedev/ggml-server-example

An example of running local models with GGML
https://github.com/continuedev/ggml-server-example

Last synced: 9 months ago
JSON representation

An example of running local models with GGML

Host: GitHub
URL: https://github.com/continuedev/ggml-server-example
Owner: continuedev
Created: 2023-07-18T19:26:59.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2023-08-10T00:47:25.000Z (over 2 years ago)
Last Synced: 2025-01-09T11:32:05.395Z (over 1 year ago)
Size: 5.86 KB
Stars: 39
Watchers: 1
Forks: 6
Open Issues: 3
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Step-by-step: run local models with GGML (~5min + download time for model weights)

### Setup Python environment

1. Clone this repository `git clone https://github.com/continuedev/ggml-server-example`

2. Move into the folder: `cd ggml-server-example`

3. Create a virtual environment: `python3 -m venv env`

4. Activate the virtual environment: `source env/bin/activate` on Mac, `env\Scripts\activate.bat` on Windows, `source env/bin/activate.fish` if using fish terminal

5. Install required packages: `pip install -r requirements.txt`

### Download a model

6. Download a model to the `models/` folder

   - Here is a convenient source of models that can be downloaded: https://huggingface.co/TheBloke

   - For example, download 4-bit quantized WizardLM-7B from here (we recommend this model): https://huggingface.co/TheBloke/wizardLM-7B-GGML/blob/main/wizardLM-7B.ggmlv3.q4_0.bin

### Serve the model

7. Run the server with `python3 -m llama_cpp.server --model models/wizardLM-7B.ggmlv3.q4_0.bin`

### Use with Continue

8. To set this as your default model in Continue, you can open `~/.continue/config.json` either manually or using the `/config` slash command in Continue. Then, import the `GGML` class (`from continuedev.src.continuedev.libs.llm.ggml import GGML`), set `"default_model": "default=GGML(max_context_length=2048)"`, reload your VS Code window, and you're good to go!

---

## Any questions?

Happy to help. Email use at hi@continue.dev.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/continuedev/ggml-server-example

Awesome Lists containing this project

README