https://github.com/mudler/go-ggllm.cpp

Golang bindings for ggllm.cpp
https://github.com/mudler/go-ggllm.cpp

Last synced: 8 months ago
JSON representation

Golang bindings for ggllm.cpp

Host: GitHub
URL: https://github.com/mudler/go-ggllm.cpp
Owner: mudler
License: mit
Created: 2023-07-08T14:49:28.000Z (over 2 years ago)
Default Branch: master
Last Pushed: 2023-07-09T22:30:56.000Z (over 2 years ago)
Last Synced: 2024-12-12T08:36:30.295Z (about 1 year ago)
Language: C++
Homepage:
Size: 42 KB
Stars: 5
Watchers: 2
Forks: 5
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# [![Go Reference](https://pkg.go.dev/badge/github.com/go-skynet/go-ggllm.cpp.svg)](https://pkg.go.dev/github.com/go-skynet/go-ggllm.cpp) go-ggllm.cpp

[ggllm.cpp](https://github.com/cmp-nct/ggllm.cpp) golang bindings.

The go-ggllm.cpp bindings are high level, as such most of the work is kept into the C/C++ code to avoid any extra computational cost, be more performant and lastly ease out maintenance, while keeping the usage as simple as possible.

Check out [this](https://about.sourcegraph.com/blog/go/gophercon-2018-adventures-in-cgo-performance) and [this](https://www.cockroachlabs.com/blog/the-cost-and-complexity-of-cgo/) write-ups which summarize the impact of a low-level interface which calls C functions from Go.

If you are looking for an high-level OpenAI compatible API, check out [here](https://github.com/go-skynet/LocalAI).

## Usage

Note: This repository uses git submodules to keep track of [ggllm.cpp](https://github.com/cmp-nct/ggllm.cpp).

Clone the repository locally:

```bash
git clone --recurse-submodules https://github.com/mudler/go-ggllm.cpp
```

To build the bindings locally, run:

```
cd go-ggllm.cpp
make libgglm.a
```

Now you can run the example with:

```
LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/model/path/here" -t 14
```

## Acceleration

### OpenBLAS

To build and run with OpenBLAS, for example:

```
BUILD_TYPE=openblas make libgglm.a
CGO_LDFLAGS="-lopenblas" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/model/path/here" -t 14
```

### CuBLAS

To build with CuBLAS:

```
BUILD_TYPE=cublas make libgglm.a
CGO_LDFLAGS="-lcublas -lcudart -L/usr/local/cuda/lib64/" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/model/path/here" -t 14
```

### OpenCL

```
BUILD_TYPE=clblas CLBLAS_DIR=... make libgglm.a
CGO_LDFLAGS="-lOpenCL -lclblast -L/usr/local/lib64/" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/model/path/here" -t 14
```

You should see something like this from the output when using the GPU:

```
ggml_opencl: selecting platform: 'Intel(R) OpenCL HD Graphics'
ggml_opencl: selecting device: 'Intel(R) Graphics [0x46a6]'
ggml_opencl: device FP16 support: true
```

## GPU offloading

### Metal (Apple Silicon)

```
BUILD_TYPE=metal make libgglm.a
CGO_LDFLAGS="-framework Foundation -framework Metal -framework MetalKit -framework MetalPerformanceShaders" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go build ./examples/main.go
cp build/bin/ggml-metal.metal .
./main -m "/model/path/here" -t 1 -ngl 1
```

Enjoy!

The documentation is available [here](https://pkg.go.dev/github.com/mudler/go-ggllm.cpp) and the full example code is [here](https://github.com/mudler/go-ggllm.cpp/blob/master/examples/main.go).

## License

MIT

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mudler/go-ggllm.cpp

Awesome Lists containing this project

README