https://github.com/kempnerinstitute/llm_uncertainty

Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
https://github.com/kempnerinstitute/llm_uncertainty

Last synced: 10 months ago
JSON representation

Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"

Host: GitHub
URL: https://github.com/kempnerinstitute/llm_uncertainty
Owner: KempnerInstitute
License: apache-2.0
Created: 2024-02-02T00:51:16.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-04-15T19:28:58.000Z (about 2 years ago)
Last Synced: 2025-03-26T02:42:40.963Z (about 1 year ago)
Language: Python
Homepage:
Size: 24.7 MB
Stars: 7
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Distinguishing the Knowable from the Unknowable with Language Models

Code for the preprint "Distinguishing the Knowable from the Unknowable with Language Models," submitted to ICML 2024.

**Abstract**: We study the feasibility of identifying *epistemic* uncertainty (reflecting a lack of knowledge), as opposed to *aleatoric* uncertainty (reflecting entropy in the underlying distribution), in the outputs of large language models (LLMs) over free-form text. In the absence of ground-truth probabilities, we explore a setting where, in order to (approximately) disentangle a given LLM's uncertainty, a significantly larger model stands in as a proxy for the ground truth. We show that small linear probes trained on the embeddings of frozen, pretrained models accurately predict when larger models will be confident at the token level and that probes trained on one text domain generalize to others. Going further, we propose a fully unsupervised method that achieves non-trivial accuracy on the same task. Taken together, we interpret these results as evidence that LLMs naturally contain internal representations of different types of uncertainty that could potentially be leveraged to devise more informative indicators of model confidence in diverse practical settings.

Correspondence to: Gustaf Ahdritz(gahdritz@g.harvard.edu), Tian Qin(tqin@g.harvard.edu), Ben Edelman(bedelman@g.harvard.edu).

## Instructions

### Obtaining model weights

Our scripts support (with some gaps) three models: LLaMA, Pythia, and Llama 2.

* LLaMA: Follow the instructions in the original `lit-llama` repository [here](https://github.com/Lightning-AI/lit-llama/blob/main/howto/download_weights.md).
* Pythia: These should download automatically from Huggingface upon first use
* Llama 2: Follow the instructions in the `lit-gpt` repository [here](https://github.com/Lightning-AI/lit-gpt/blob/main/tutorials/download_llama_2.md).

### Supervised experiments

First, precompute model embeddings with `precompute_logits.py`. Embeddings from different layers can be selected with the `return_after_layer_n` flag. We include our Wikipedia dataset (new Wikipedia articles after the LLaMA training cutoff) under `data/`.

Next, generate a dataset filter for a specific entropy band using `create_dataset_filter.py` (or `create_dataset_filter_no_gap.py` for a gapless filter).

Finally, train probes using `train_head.py.`

### Unsupervised experiments

ICLT is implemented in `repetition.py` and `generate_from_repetition.py`.

The precomputed model embeddings and the dataset filters are created same as above (using `precompute_logits.py` and `create_dataset_filter.py`). You can use `repetition.py` to run the repetiton experiment with sample script as below:

```
python repetition.py /path/to/data/set/filter/folder/ \
0 # indicate filter shard number \
/path/to/prompt.json \
llama # specify model \
7B # specify small model size \
--experiment_name default # specify the name of experiment
```
The repetition results will be saved under ```/path/to/data/set/filter/folder/experiment_name/```. To process the results for classificaiton task, run `generate_from_repetition.py`:

```
python generate_from_repetition.py \
/path/to/data/set/filter/folder/experiment_name \
llama # specify model \
7B # specify small model size
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kempnerinstitute/llm_uncertainty

Awesome Lists containing this project

README