https://github.com/huggingface/llm-ls

LSP server leveraging LLMs for code completion (and more?)
https://github.com/huggingface/llm-ls

ai code-generation huggingface ide llamacpp llm lsp lsp-server openai self-hosted

Last synced: 6 months ago
JSON representation

LSP server leveraging LLMs for code completion (and more?)

Host: GitHub
URL: https://github.com/huggingface/llm-ls
Owner: huggingface
License: apache-2.0
Created: 2023-08-10T20:45:27.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-09-26T10:08:15.000Z (about 1 year ago)
Last Synced: 2025-05-16T18:06:43.785Z (6 months ago)
Topics: ai, code-generation, huggingface, ide, llamacpp, llm, lsp, lsp-server, openai, self-hosted
Language: Rust
Homepage:
Size: 343 KB
Stars: 771
Watchers: 21
Forks: 61
Open Issues: 29
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# llm-ls

> [!IMPORTANT]
> This is currently a work in progress, expect things to be broken!

**llm-ls** is a LSP server leveraging LLMs to make your development experience smoother and more efficient.

The goal of llm-ls is to provide a common platform for IDE extensions to be build on. llm-ls takes care of the heavy lifting with regards to interacting with LLMs so that extension code can be as lightweight as possible.

## Features

### Prompt

Uses the current file as context to generate the prompt. Can use "fill in the middle" or not depending on your needs.

It also makes sure that you are within the context window of the model by tokenizing the prompt.

### Telemetry

Gathers information about requests and completions that can enable retraining.

Note that **llm-ls** does not export any data anywhere (other than setting a user agent when querying the model API), everything is stored in a log file (`~/.cache/llm_ls/llm-ls.log`) if you set the log level to `info`.

### Completion

**llm-ls** parses the AST of the code to determine if completions should be multi line, single line or empty (no completion).

### Multiple backends

**llm-ls** is compatible with Hugging Face's [Inference API](https://huggingface.co/docs/api-inference/en/index), Hugging Face's [text-generation-inference](https://github.com/huggingface/text-generation-inference), [ollama](https://github.com/ollama/ollama) and OpenAI compatible APIs, like the [python llama.cpp server bindings](https://github.com/abetlen/llama-cpp-python?tab=readme-ov-file#openai-compatible-web-server).

## Compatible extensions

- [x] [llm.nvim](https://github.com/huggingface/llm.nvim)
- [x] [llm-vscode](https://github.com/huggingface/llm-vscode)
- [x] [llm-intellij](https://github.com/huggingface/llm-intellij)
- [ ] [jupytercoder](https://github.com/bigcode-project/jupytercoder)

## Roadmap

- support getting context from multiple files in the workspace
- add `suffix_percent` setting that determines the ratio of # of tokens for the prefix vs the suffix in the prompt
- add context window fill percent or change context_window to `max_tokens`
- filter bad suggestions (repetitive, same as below, etc)
- oltp traces ?

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/huggingface/llm-ls

Awesome Lists containing this project

README