https://github.com/lean-dojo/LeanCopilot

LLMs as Copilots for Theorem Proving in Lean
https://github.com/lean-dojo/LeanCopilot
formal-mathematics lean lean4 llm-inference machine-learning theorem-proving
Last synced: 6 months ago
JSON representation
LLMs as Copilots for Theorem Proving in Lean
Host: GitHub
URL: https://github.com/lean-dojo/LeanCopilot
Owner: lean-dojo
License: mit
Created: 2023-09-09T01:52:22.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-11-04T01:35:17.000Z (about 1 year ago)
Last Synced: 2024-11-04T02:24:10.046Z (about 1 year ago)
Topics: formal-mathematics, lean, lean4, llm-inference, machine-learning, theorem-proving
Language: C++
Homepage: https://leandojo.org
Size: 1.15 MB
Stars: 988
Watchers: 14
Forks: 91
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

best-of-lean4 - GitHub - 3% open · ⏱️ 15.09.2025): (Tools)
StarryDivineSky - lean-dojo/LeanCopilot
README

          Lean Copilot: LLMs as Copilots for Theorem Proving in Lean

==========================================================

Lean Copilot allows large language models (LLMs) to be used natively in Lean for proof automation, e.g., suggesting tactics/premises and searching for proofs. You can use our built-in models from [LeanDojo](https://leandojo.org/) or bring your own models that run either locally (w/ or w/o GPUs) or on the cloud.

## Table of Contents

1. [Requirements](#requirements)  

1. [Using Lean Copilot in Your Project](#using-lean-copilot-in-your-project)

   1. [Adding Lean Copilot as a Dependency](#adding-lean-copilot-as-a-dependency)

   1. [Getting Started with Lean Copilot](#getting-started-with-lean-copilot)

      1. [Tactic Suggestion](#tactic-suggestion)

      1. [Proof Search](#proof-search)

      1. [Premise Selection](#premise-selection)

1. [Advanced Usage](#advanced-usage)

   1. [Tactic APIs](#tactic-apis)

   1. [Model APIs](#model-apis)

   1. [Bring Your Own Model](#bring-your-own-model)

1. [Caveats](#caveats)

1. [Getting in Touch](#getting-in-touch)

1. [Acknowledgements](#acknowledgements)

1. [Citation](#citation)

## Requirements

* Supported platforms: Linux, macOS, Windows and [Windows WSL](https://learn.microsoft.com/en-us/windows/wsl/install).

* [Git LFS](https://git-lfs.com/).

* Optional (recommended if you have a [CUDA-enabled GPU](https://developer.nvidia.com/cuda-gpus)): CUDA and [cuDNN](https://developer.nvidia.com/cudnn).

* Required for building Lean Copilot itself (rather than a downstream package): CMake >= 3.7 and a C++17 compatible compiler.

## Using Lean Copilot in Your Project

:warning: Your project must use a Lean version of at least `lean4:v4.3.0-rc2`.

### Adding Lean Copilot as a Dependency

1. Add the package configuration option `moreLinkArgs := #["-L./.lake/packages/LeanCopilot/.lake/build/lib", "-lctranslate2"]` to lakefile.lean. For example,

```lean

package «my-package» {

  moreLinkArgs := #[

    "-L./.lake/packages/LeanCopilot/.lake/build/lib",

    "-lctranslate2"

  ]

}

```

Alternatively, if your project uses lakefile.toml, it should include:

```toml

moreLinkArgs = ["-L./.lake/packages/LeanCopilot/.lake/build/lib", "-lctranslate2"]

```

2. Add the following line to lakefile.lean, including the quotation marks:

```lean

require LeanCopilot from git "https://github.com/lean-dojo/LeanCopilot.git" @ "LEAN_COPILOT_VERSION"

```

For stable Lean versions (e.g., `v4.21.0`), set `LEAN_COPILOT_VERSION` to be that version. For the latest unstable Lean versions (e.g., `v4.22.0-rc2`), set `LEAN_COPILOT_VERSION` to `main`. In either case, make sure the version is compatible with other dependencies such as mathlib. If your project uses lakefile.toml instead of lakefile.lean, it should include:

```toml

[[require]]

name = "LeanCopilot"

git = "https://github.com/lean-dojo/LeanCopilot.git"

rev = "LEAN_COPILOT_VERSION"

```

3. If you are using native Windows, add `/.lake/packages/LeanCopilot/.lake/build/lib` to your `Path` variable in Advanced System Settings > Environment Variables... > System variables. 

4. Run `lake update LeanCopilot`.

5. Run `lake exe LeanCopilot/download` to download the built-in models from Hugging Face to `~/.cache/lean_copilot/`. *Alternatively*, you can download the models from Hugging Face manually from

* [ct2-leandojo-lean4-tacgen-byt5-small](https://huggingface.co/kaiyuy/ct2-leandojo-lean4-tacgen-byt5-small)

* [ct2-leandojo-lean4-retriever-byt5-small](https://huggingface.co/kaiyuy/ct2-leandojo-lean4-retriever-byt5-small)

* [premise-embeddings-leandojo-lean4-retriever-byt5-small](https://huggingface.co/kaiyuy/premise-embeddings-leandojo-lean4-retriever-byt5-small)

* [ct2-byt5-small](https://huggingface.co/kaiyuy/ct2-byt5-small)

6. Run `lake build`.

[Here](https://github.com/yangky11/lean4-example/blob/LeanCopilot-demo) is an example of a Lean package depending on Lean Copilot. If you have problems building the project, our [Dockerfile](./Dockerfile), [build.sh](scripts/build.sh) or [build_example.sh](scripts/build_example.sh) may be helpful.

### Getting Started with Lean Copilot

#### Tactic Suggestion

After `import LeanCopilot`, you can use the tactic `suggest_tactics` to generate tactic suggestions. You can click on any of the suggested tactics to use it in the proof.



You can provide a prefix (e.g., `simp`) to constrain the generated tactics:



#### Proof Search

The tactic `search_proof` combines LLM-generated tactics with [aesop](https://github.com/leanprover-community/aesop) to search for multi-tactic proofs. When a proof is found, you can click on it to insert it into the editor.



#### Premise Selection

The `select_premises` tactic retrieves a list of potentially useful premises. Currently, it uses the retriever in [LeanDojo](https://leandojo.org/) to select premises from a fixed snapshot of Lean and [mathlib4](https://github.com/leanprover-community/mathlib4/tree/3ce43c18f614b76e161f911b75a3e1ef641620ff).

![select_premises](https://github.com/lean-dojo/LeanCopilot/assets/114432581/2817663c-ba98-4a47-9ae9-5b8680b6265a)

#### Running LLMs

You can also run the inference of any LLMs in Lean, which can be used to build customized proof automation or other LLM-based applications (not limited to theorem proving). It's possible to run arbitrary models either locally or remotely (see [Bring Your Own Model](#bring-your-own-model)).



## Advanced Usage

**This section is only for advanced users who would like to change the default behavior of `suggest_tactics`, `search_proof`, or `select_premises`, e.g., to use different models or hyperparameters.**

### Tactic APIs

* Examples in [TacticSuggestion.lean](LeanCopilotTests/TacticSuggestion.lean) showcase how to configure `suggest_tactics`, e.g., to use different models or generate different numbers of tactics.

* Examples in [ProofSearch.lean](LeanCopilotTests/ProofSearch.lean) showcase how to configure `search_proof` using options provided by [aesop](https://github.com/leanprover-community/aesop).

* Examples in [PremiseSelection.lean](LeanCopilotTests/PremiseSelection.lean) showcase how to set the number of retrieved premises for `select_premises`.

### Model APIs

**Examples in [ModelAPIs.lean](LeanCopilotTests/ModelAPIs.lean) showcase how to run the inference of different models and configure their parameters (temperature, beam size, etc.).**

Lean Copilot supports two kinds of models: generators and encoders. Generators must implement the `TextToText` interface:

```lean

class TextToText (τ : Type) where

  generate (model : τ) (input : String) (targetPrefix : String) : IO $ Array (String × Float)

```

* `input` is the input string

* `targetPrefix` is used to constrain the generator's output. `""` means no constraint.

* `generate` should return an array of `String × Float`. Each `String` is an output from the model, and `Float` is the corresponding score.

We provide three types of Generators:

* [`NativeGenerator`](LeanCopilot/Models/Native.lean) runs locally powered by [CTranslate2](https://github.com/OpenNMT/CTranslate2) and is linked to Lean using Foreign Function Interface (FFI).

* [`ExternalGenerator`](LeanCopilot/Models/External.lean) is hosted either locally or remotely. See [Bring Your Own Model](#bring-your-own-model) for details.

* [`GenericGenerator`](LeanCopilot/Models/Generic.lean) can be anything that implements the `generate` function in the `TextToText` typeclass.

Encoders must implement `TextToVec`:

```lean

class TextToVec (τ : Type) where

  encode : τ → String → IO FloatArray

```

* `input` is the input string

* `encode` should return a vector embedding produced by the model.

Similar to generators, we have `NativeEncoder`, `ExternalEncoder`, and `GenericEncoder`.

### Bring Your Own Model

In principle, it is possible to run any model using Lean Copilot through `ExternalGenerator` or `ExternalEncoder` (examples in [ModelAPIs.lean](LeanCopilotTests/ModelAPIs.lean)). To use a model, you need to wrap it properly to expose the APIs in [external_model_api.yaml](./external_model_api.yaml). As an example, we provide a [Python API server](./python) and use it to run a few models.

## Caveats

* Lean may occasionally crash when restarting or editing a file. Restarting the file again should fix the problem.

* `select_premises` always retrieves the original form of a premise. For example, `Nat.add_left_comm` is a result of the theorem below. In this case, `select_premises` retrieves `Nat.mul_left_comm` instead of `Nat.add_left_comm`.

```lean

@[to_additive]

theorem mul_left_comm : ∀ a b c : G, a * (b * c) = b * (a * c)

```

* In some cases, `search_proof` produces an erroneous proof with error messages like `fail to show termination for ...`. A temporary workaround is changing the theorem's name before applying `search_proof`. You can change it back after `search_proof` completes.

## Getting in Touch

* For general questions and discussions, please use [GitHub Discussions](https://github.com/lean-dojo/LeanCopilot/discussions).  

* To report a potential bug, please open an issue. In the issue, please include your OS information, the exact steps to reproduce the error on **the latest stable version of Lean Copilot**, and complete logs preferrably in debug mode. **Important: If your issue cannot be reproduced easily, it will be unlikely to receive help.**

* Feature requests and contributions are extremely welcome. Please feel free to start a [discussion](https://github.com/lean-dojo/LeanCopilot/discussions) or open a [pull request](https://github.com/lean-dojo/LeanCopilot/pulls).

## Acknowledgements

* We thank Scott Morrison for suggestions on simplifying Lean Copilot's installation and Mac Malone for helping implement it. Both Scott and Mac work for the [Lean FRO](https://lean-fro.org/).

* We thank Jannis Limperg for supporting our LLM-generated tactics in Aesop ().

## Citation

If you find our work useful, please consider citing [our paper](https://arxiv.org/abs/2404.12534):

```BibTeX

@misc{song2025leancopilotlargelanguage,

      title={Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean}, 

      author={Peiyang Song and Kaiyu Yang and Anima Anandkumar},

      year={2025},

      eprint={2404.12534},

      archivePrefix={arXiv},

      primaryClass={cs.AI},

      url={https://arxiv.org/abs/2404.12534}, 

}

```
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lean-dojo/LeanCopilot

Awesome Lists containing this project

README