https://github.com/mjrusso/wingman

Emacs package for LLM-assisted code/text completion
https://github.com/mjrusso/wingman

copilot developer-tool emacs emacs-package llama llm

Last synced: 8 months ago
JSON representation

Emacs package for LLM-assisted code/text completion

Host: GitHub
URL: https://github.com/mjrusso/wingman
Owner: mjrusso
License: gpl-3.0
Created: 2025-06-26T02:59:37.000Z (9 months ago)
Default Branch: main
Last Pushed: 2025-07-03T23:35:38.000Z (8 months ago)
Last Synced: 2025-07-04T00:27:09.943Z (8 months ago)
Topics: copilot, developer-tool, emacs, emacs-package, llama, llm
Language: Emacs Lisp
Homepage:
Size: 1.94 MB
Stars: 23
Watchers: 1
Forks: 1
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          #  wingman.el - LLM-assisted text completion

Wingman enables GitHub Copilot-style inline completions within Emacs, powered by a local (or remote) `llama.cpp` server.

_This package is a direct port of the excellent [llama.vim plugin](https://github.com/ggml-org/llama.vim) (see [technical design details](https://github.com/ggml-org/llama.cpp/pull/9787) for more background on how this works)._

![logo](./assets/logo.png)

### Features

* **Inline "Ghost Text" Completions:** Suggestions appear directly in your buffer as you type.

* **Local First:** Works entirely with a self-hosted `llama.cpp` server, ensuring your code stays on your machine.

* **Asynchronous by Default:** Never blocks your editing while waiting for a completion.

* **Response Caching:** Repeated requests for the same context are answered instantly from an in-memory cache.

* **Project-Aware Context:** Uses a ring buffer of text chunks from recently used files (scoped to the [current project](https://www.gnu.org/software/emacs/manual/html_node/emacs/Projects.html)) to provide more relevant suggestions.

## Prerequisites

This package requires a running [llama.cpp](https://github.com/ggml-org/llama.cpp) server instance, accessible at `wingman-llama-endpoint`.

## Installation

### With `use-package` and `straight.el`

```emacs-lisp

(use-package wingman

  :straight (:type git :host github :repo "mjrusso/wingman"))

```

### Manual Installation

Clone this repository:

```bash

git clone https://github.com/mjrusso/wingman.git ~/.emacs.d/lisp/wingman

```

Add then add the directory to your Emacs `load-path` in your `init.el`:

```emacs-lisp

(add-to-list 'load-path "~/.emacs.d/lisp/wingman")

(require wingman)

```

## Configuration

The behaviour of Wingman can be customized through its customization group (`M-x customize-group RET wingman RET`), or by setting variables in your `init.el`.

A minimal configuration:

```emacs-lisp

(use-package wingman

  :straight (:type git :host github :repo "mjrusso/wingman")

  ;; Enable wingman-mode in all programming modes

  :hook (prog-mode . wingman-mode))

```

An example of a more advanced configuration:

```emacs-lisp

(use-package wingman

  :straight (:type git :host github :repo "mjrusso/wingman")

  :ensure t

  :defer t

  :init

  (setq wingman-prefix-key (kbd "C-c w"))

  :hook (prog-mode . wingman-mode)

  :config

  (setq wingman-log-level 4)

  (setq wingman-ring-n-chunks 16)

  ;; default llama.cpp server port is 8012

  (setq wingman-llama-endpoint "http://127.0.0.1:8080/infill")

  ;; assumes use of Modus Themes; substitute with preferred color scheme

  (set-face-attribute 'wingman-overlay-face nil

                      :foreground  (modus-themes-get-color-value 'red-warmer)

                      :background  (modus-themes-get-color-value 'bg-inactive))

  ;; don't provide completions in files that typically contain secrets

  (add-to-list 'wingman-disable-predicates

               (lambda ()

                 (or (derived-mode-p 'envrc-file-mode)

                     (derived-mode-p 'direnv-envrc-mode)

                     (when buffer-file-name

                       (let ((fname (file-name-nondirectory buffer-file-name)))

                         (or (string-equal ".env" fname)

                             (string-equal ".envrc" fname)

                             (string-prefix-p ".localrc" fname)))))))

  :bind

  (:map wingman-mode-prefix-map

   ("TAB" . wingman-fim-inline)

   :map wingman-mode-completion-transient-map

   ("TAB" . wingman-accept-full)

   ("S-TAB" . wingman-accept-line)

   ("M-S-TAB" . wingman-accept-word)))

```

## Usage

1. **Enable the mode:** `wingman-mode` is a buffer-local minor mode. You can enable it with `M-x wingman-mode`, or globally in all applicable buffers with `global-wingman-mode`. (Alternatively, you may enable `wingman-mode` automatically via the hook as shown in the configuration examples above.)

2. **Get Completions:**

   * If `wingman-auto-fim` is `t` (the default), completions will appear automatically as you type.

   * To manually request a completion, use `wingman-fim-inline`. By default this is bound to `C-c w TAB` in the `wingman-mode-map`, and may be customized via `wingman-prefix-key` and `wingman-mode-prefix-map`.

3. **Accept a Completion:**

   * **Full:** Press the "accept full" key (default: ``) to insert the entire suggestion.

   * **Line:** Press the "accept line" key (default: `S-TAB`) to insert only the first line of the suggestion.

   * **Word:** Press the "accept word" key (default: `M-S-TAB`) to insert only the first word.

4. **Dismiss a Completion:**

   * Keep typing, or

   * Move the cursor, or

   * Press the trigger key again

## Appendix

### llama.cpp setup

#### Mac OS

```bash

brew install llama.cpp

```

#### Windows

```bash

winget install llama.cpp

```

#### Other OS

Install from your preferred package manager, build from source, or use the [latest binaries](https://github.com/ggml-org/llama.cpp/releases).

### llama.cpp settings

[Recommended settings](https://github.com/ggml-org/llama.vim/blob/master/README.md#llamacpp-settings), depending on the amount of available VRAM:

- More than 16GB VRAM:

  ```bash

  llama-server --fim-qwen-7b-default

  ```

- Less than 16GB VRAM:

  ```bash

  llama-server --fim-qwen-3b-default

  ```

- Less than 8GB VRAM:

  ```bash

  llama-server --fim-qwen-1.5b-default

  ```

Note that a [FIM ("fill-in-the-middle")-compatible model](https://huggingface.co/collections/ggml-org/llamavim-6720fece33898ac10544ecf9) is required.

## Acknowledgements

This project is a direct port of [llama.vim](https://github.com/ggml-org/llama.vim) to Emacs.

Also see: [copilot.el](https://github.com/copilot-emacs/copilot.el) and [emacs-copilot](https://github.com/jart/emacs-copilot) for alternative approaches and inspiration.

## License

This project is licensed under the GPL-3.0-or-later License. See the `LICENSE` file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mjrusso/wingman

Awesome Lists containing this project

README