https://github.com/gsarti/pecore

Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑
https://github.com/gsarti/pecore

document-level-nmt explainable-ai feature-attribution interpretability language-model large-language-models machine-translation

Last synced: 11 days ago
JSON representation

Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑

Host: GitHub
URL: https://github.com/gsarti/pecore
Owner: gsarti
License: apache-2.0
Created: 2023-08-04T14:17:07.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-04-18T08:34:08.000Z (about 1 year ago)
Last Synced: 2025-04-20T04:34:17.575Z (28 days ago)
Topics: document-level-nmt, explainable-ai, feature-attribution, interpretability, language-model, large-language-models, machine-translation
Language: Jupyter Notebook
Homepage: https://huggingface.co/spaces/gsarti/pecore
Size: 295 MB
Stars: 14
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Quantifying the Plausibility of Context Reliance in Neural Machine Translation

[Gabriele Sarti](https://gsarti.com) • [Grzegorz Chrupała](https://grzegorz.chrupala.me/) • [Malvina Nissim](https://malvinanissim.github.io/) • [Arianna Bisazza](https://www.cs.rug.nl/~bisazza/)

PECoRe two-step process
PECoRe examples

> **Abstract:** Establishing whether language models can use contextual information in a human-plausible way is important to ensure their safe adoption in real-world settings. However, the questions of when and which parts of the context affect model generations are typically tackled separately, and current plausibility evaluations are practically limited to a handful of artificial benchmarks. To address this, we introduce Plausibility Evaluation of Context Reliance (PECoRe), an end-to-end interpretability framework designed to quantify context usage in language models’ generations. Our approach leverages model internals to (i) contrastively identify context-sensitive target tokens in generated texts and (ii) link them to contextual cues justifying their prediction. We use PECoRe to quantify the plausibility of context-aware machine translation models, comparing model rationales with human annotations across several discourse-level phenomena. Finally, we apply our method to unannotated generations to identify context-mediated predictions and highlight instances of (im)plausible context usage in model translations.

This repository contains scripts and notebooks associated to the paper ["Quantifying the Plausibility of Context Reliance in Neural Machine Translation"](https://openreview.net/forum?id=XTHfNGI3zT). If you use any of the following contents for your work, we kindly ask you to cite our paper:

```bibtex
@inproceedings{sarti-etal-2023-quantifying,
title = "Quantifying the Plausibility of Context Reliance in Neural Machine Translation",
author = "Sarti, Gabriele and
Chrupa{\l}a, Grzegorz and
Nissim, Malvina and
Bisazza, Arianna",
booktitle = "The Twelfth International Conference on Learning Representations (ICLR 2024)",
month = may,
year = "2024",
address = "Vienna, Austria",
publisher = "OpenReview",
url = "https://openreview.net/forum?id=XTHfNGI3zT"
}
```

### Using PECoRe

> [!TIP]
> ✨ You can try PECoRe from our online demo on [Hugging Face Spaces](https://huggingface.co/spaces/gsarti/pecore).

While this repository implements the functions used in the experimental evaluation of the aforementioned paper, we provide a new CLI implementation of PECoRe through the [Inseq interpretability library](https://github.com/inseq-team/inseq). We highly advise researchers to adopt that implementation as it is more robust and generalizable, supporting all decoder-only and encoder-decoder models from the Huggingface library for input and output context dependence detection and attribution. Refer to the `inseq attribute-context` section in the Inseq README for more details.

### Artifacts

All artifacts for the paper, including fine-tuned models and training/evaluation datasets are available in the [PECoRe HuggingFace Collection](https://huggingface.co/collections/gsarti/pecore-iclr-2024-65edab42e28439e21b612c2e). A demo will be made available soon, stay tuned!

### Train a Context-Aware NMT Model

Context-aware NMT models are trained using the `train_context_aware_mt_model.py` script. The script is a modification of the original
[`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/run_translation_no_trainer.py). The script adds the following fields for contextual model training:

- `context_size`: The number of context sentences to use for training. The default value is 0 (sentence-level training).

- `sample_context`: If set, the size of the context for every example is sampled from a uniform distribution between 0 and `context_size` (inclusive). If not passed and `context_size` is greater than 0, the context size is always equal to `context_size`.

- `context_word_dropout`: Probability between 0 and 1 of dropping a word from the context. The default value is 0 (no dropout).

- `use_target_context`: If set, the context is also included in the translated text for the training loss. In that case, the output format for an input `src_ctx src` becomes `tgt_ctx tgt`. Otherwise the output format is `tgt` (only `src` is translated).

Example usage

Here is an example of fine-tuning an mBART 1-to-50 model on the context-augmented IWSLT17 dataset with up to 4 context sentences and a 10% context word dropout:

```shell
accelerate launch scripts/train_context_aware_mt_model.py \
--model_name_or_path facebook/mbart-large-50-one-to-many-mmt \
--source_lang en_XX \
--target_lang fr_XX \
--dataset_name gsarti/iwslt2017_context \
--dataset_config_name iwslt2017-en-fr \
--output_dir outputs/models/iwslt17-mbart50-1toM-ctx4-cwd1-en-fr \
--num_beams 5 \
--max_source_length 512 \
--max_target_length 128 \
--num_train_epochs 20 \
--gradient_accumulation_steps 4 \
--per_device_train_batch_size 8 \
--num_warmup_steps 500 \
--learning_rate 3e-4 \
--checkpointing_steps epoch \
--with_tracking \
--report_to tensorboard \
--context_size 4 \
--sample_context \
--context_word_dropout 0.1
```

Here is an example of continuing the fine-tuning of a context-aware En->Fr OpusMT model on the training portion of SCAT with up to 4 context sentences and a 10% context word dropout:

```shell
accelerate launch scripts/train_context_aware_mt_model.py \
--model_name_or_path context-mt/iwslt17-marian-big-ctx4-cwd1-en-fr \
--dataset_name inseq/scat \
--dataset_config_name sentences \
--output_dir outputs/models/scat-marian-big-ctx4-cwd1-en-fr \
--num_beams 5 \
--max_source_length 512 \
--max_target_length 128 \
--num_train_epochs 2 \
--gradient_accumulation_steps 2 \
--per_device_train_batch_size 8 \
--num_warmup_steps 0 \
--learning_rate 5e-5 \
--checkpointing_steps 1000 \
--logging_steps 200 \
--with_tracking \
--report_to tensorboard \
--context_size 4 \
--sample_context \
--context_word_dropout 0.1
```

## Using the PECoRe CLI

The PECoRe CLI is a command-line interface for running the PECoRe steps on a given model and dataset. The CLI is implemented in the `pecore/cli.py` script and can be used as `pecore-viz` upon installing the package with `pip install -e .`. The current implementation supports the identification of context-sensitive targets (CTI) and the imputation of contextual cues (CCI) for all encoder-decoder models supported by the [Inseq](https://github.com/inseq-team/inseq) framework, including models with language prefix tags (mBART-50, NLLB, M2M100) and models trained with special context tags (e.g. the collection of models found in the [context-mt](https://huggingface.co/context-mt) organization on the HF Hub). The CLI can be used to run the PECoRe steps on a given model and example as follows:

```shell
pecore-viz \
--model_name context-mt/scat-marian-small-ctx4-cwd1-en-fr \
--attributions_aggregate_fns sum \
--model_use_ctx_break \
--impute_with_contextless_output \
--force_context_aware_output_prefix \
--input "Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable and kept it there for ages. Sadly, we could not foresee it would disappear."
```

The example above produces the following output, correctly highlighting the dependence on the pronoun "il" on the nouns "cow" and "animal" in the context.

```shell
Context with contextual cues (std λ=1.00) followed by output sentence
with context-sensitive target spans (std λ=1.00):

Input context: Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable and kept it there for ages.
Input current: Sadly, we could not foresee it would disappear.
Context-aware output: Malheureusement, nous n'avons pas pu prévoir qu'il disparaîtrait.
Using ' ' to separate context and current inputs.

#1. (CTI |kl_divergence| > 0.14, CCI |saliency| > 0.71)
Contextless output: Malheureusement, nous n'avons pas pu prévoir qu'il disparaîtrait.
Current output: Malheureusement, nous n'avons pas pu prévoir qu'il(0.412) disparaîtrait.
Input context: Did I mention we stole a cow(1.524)? A beautiful animal(1.472), truly. We brought it to the stable and kept it
there for ages.
```

When using the CLI to run a regular model, an additional step will be needed to specify the position of the context break in model's generation if an output is not forced by the user. Here is an example using the regular mBART-50 model from the HF Hub:

```shell
pecore-viz \
--model_name facebook/mbart-large-50-one-to-many-mmt \
--input_lang eng --output_lang fra --model_type mbart50-1toM \
--impute_with_contextless_output \
--force_context_aware_output_prefix \
--input "Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable and kept it there for ages. Sadly, we could not foresee it would disappear."
```

The user will be prompted with the following message:

```shell
The following output was generate by the model: J’ai mentionné que nous avons volé une vache, c’est vraiment un beau animal, que nous avons emmené à l’élevage et que nous l’avons gardée pendant des époques. Malheureusement, nous n’avons pas pu prévoir qu’elle disparaîtrait.
Rewrite it here by adding ' ' wherever appropriate to mark context break:
```

The user can then rewrite the output by adding ` ` wherever appropriate to mark the context break:

```shell
J’ai mentionné que nous avons volé une vache, c’est vraiment un beau animal, que nous avons emmené à l’élevage et que nous l’avons gardée pendant des époques. Malheureusement, nous n’avons pas pu prévoir qu’elle disparaîtrait.
```

The final output will be:

```shell
Context with contextual cues (std λ=1.00) followed by output sentence
with context-sensitive target spans (std λ=1.00):

Input context: Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable and kept it there for ages.
Input current: Sadly, we could not foresee it would disappear.
Output context: J’ai mentionné que nous avons volé une vache, c’est vraiment un beau animal, que nous avons emmené à l’élevage et que nous l’avons gardée pendant
des époques.
Context-aware output: J’ai mentionné que nous avons volé une vache, c’est vraiment un beau animal, que nous avons emmené à l’élevage et que nous l’avons gardée
pendant des époques. Malheureusement, nous n’avons pas pu prévoir qu’elle disparaîtrait.
Using language tags for model type 'mbart50-1toM' (eng -> fra).

#1. (CTI |kl_divergence| > 1.08, CCI |saliency| > 0.00)
Contextless output: Malheureusement, nous n'avons pas pu prévoir sa disparition.
Current output: Malheureusement, nous n’(3.505)avons pas pu prévoir qu’elle disparaîtrait.
Input context: Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable(0.002) and kept it there for ages.
Output context: J’(0.004)ai mentionné que nous avons volé une vache, c’(0.002)est vraiment un beau animal, que nous avons emmené à l’(0.003)élevage et que nous
l’(0.007)avons gardée pendant des époques.
```

In this case, we see the model opts to generate the curved apostrophe `’` rather than the straight one `'` used by default in the contextless output to stick to the output context style, employing that character on several occasions (identified as contextual cues by PECoRe).

### Customizing Attribution Method

In this example, we use the attention weight of head 8 in layer 5 for attributing context dependence. This head was found empirically to align well with human intuition.

```shell
pecore-viz \
--model_name context-mt/scat-marian-small-ctx4-cwd1-en-fr \
--attributions_aggregate_fns mean mean \
--model_use_ctx_break \
--impute_with_contextless_output \
--force_context_aware_output_prefix \
--input "Did I mention we stole a cow? A beautiful animal, truly. We brought it to the stable and kept it there for ages. Sadly, we could not foresee it would disappear." \
--attribution_method attention \
--select_attributions_idx 7 4
```

## Reproducing the Paper Results

### Translate with a Context-Aware NMT Model

```shell
python scripts/translate.py \
--model_type mbart50-1toM \
--model_id mbart50-1toM-scat \
--model_name context-mt/scat-mbart50-1toM-ctx4-cwd1-en-fr \
--context_size 4 \
--dataset scat \
--context_word_dropout 1

python scripts/translate.py \
--model_type marian-big \
--model_id marian-big-scat-target \
--model_name context-mt/scat-marian-big-target-ctx4-cwd0-en-fr \
--context_size 4 \
--dataset disc_eval_mt \
--context_word_dropout 0 \
--dataset_config anaphora

python scripts/translate.py \
--model_type marian-big \
--model_id marian-big-scat \
--model_name context-mt/scat-marian-big-ctx4-cwd1-en-fr \
--context_size 4 \
--dataset disc_eval_mt \
--context_word_dropout 1 \
--dataset_config anaphora

python scripts/translate.py \
--model_type mbart50-1toM \
--model_id mbart50-1toM-scat \
--model_name context-mt/scat-mbart50-1toM-ctx4-cwd1-en-fr \
--context_size 0 \
--dataset disc_eval_mt \
--context_word_dropout 0 \
--dataset_config lexical-choice
```

### Evaluate a Context-Aware NMT Model

```shell
python scripts/evaluate_mt_outputs.py \
--filepath outputs/translations/ctx/scat-marian-small-scat-target.txt \
--model_id marian-small-scat-target \
--dataset scat \
--src_lang eng \
--tgt_lang fra \
--metrics bleu comet accuracy flip \
--has_target_context \
--max_idx 250

python scripts/evaluate_mt_outputs.py \
--filepath outputs/translations/ctx/disc_eval_mt-anaphora-marian-small-scat-target.txt \
--model_id marian-small-scat-target \
--dataset disc_eval_mt \
--src_lang eng \
--tgt_lang fra \
--metrics bleu comet accuracy flip \
--has_target_context \
--max_idx 250

python scripts/evaluate_mt_outputs.py \
--filepath outputs/translations/ctx/scat-mbart50-1toM-scat.txt \
--model_id mbart50-1toM-scat \
--dataset scat \
--src_lang eng \
--tgt_lang fra \
--metrics bleu comet accuracy

### Create examples for running PECoRe steps

```shell
python scripts/generate_examples.py \
--dataset scat \
--model_name context-mt/scat-marian-small-target-ctx4-cwd0-en-fr \
--src_lang eng \
--tgt_lang fra \
--model_id marian-small-scat-target \
--model_type marian-small \
--has_context \
--has_contrast \
--has_target_context

python scripts/generate_examples.py \
--dataset scat \
--model_name context-mt/scat-mbart50-1toM-target-ctx4-cwd0-en-fr \
--src_lang eng \
--tgt_lang fra \
--model_id mbart50-1toM-scat-target \
--model_type mbart50-1toM \
--has_context \
--has_target_context \
--has_contrast

python scripts/generate_examples.py \
--dataset disc_eval_mt \
--dataset_config anaphora \
--model_name context-mt/scat-marian-small-ctx4-cwd1-en-fr \
--src_lang eng \
--tgt_lang fra \
--model_id marian-small-scat \
--model_type marian-small \
--has_context \
--has_contrast

python scripts/generate_examples.py \
--dataset scat \
--model_name Helsinki-NLP/opus-mt-en-fr \
--src_lang eng \
--tgt_lang fra \
--model_id marian-small \
--model_type marian-small \
--has_contrast
```

### PECoRe Step 1: Context-sensitive Target Identification (CTI)

```shell
python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-marian-small-scat.tsv \
--model_name context-mt/scat-marian-small-ctx4-cwd1-en-fr \
--model_type marian-small

python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-marian-big-scat.tsv \
--model_name context-mt/scat-marian-big-ctx4-cwd1-en-fr \
--model_type marian-big

python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-mbart50-1toM-scat.tsv \
--model_name context-mt/scat-mbart50-1toM-ctx4-cwd1-en-fr \
--model_type mbart50-1toM

python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-marian-small-scat-target.tsv \
--model_name context-mt/scat-marian-small-target-ctx4-cwd0-en-fr \
--model_type marian-small

python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-marian-big-scat-target.tsv \
--model_name context-mt/scat-marian-big-target-ctx4-cwd0-en-fr \
--model_type marian-big

python scripts/tag_cti_metrics.py \
--examples_path outputs/processed_examples/scat-mbart50-1toM-scat-target.tsv \
--model_name context-mt/scat-mbart50-1toM-target-ctx4-cwd0-en-fr \
--model_type mbart50-1toM
```

### PECoRe Step 2: Contextual Cues Imputation (CCI)

```shell
python scripts/tag_cci_metrics.py \
--examples_path outputs/processed_examples/scat-marian-small-scat.tsv \
--model_name context-mt/scat-marian-small-ctx4-cwd1-en-fr \
--model_type marian-small

python scripts/tag_cci_metrics.py \
--examples_path outputs/processed_examples/scat-mbart50-1toM-scat-target.tsv \
--model_name context-mt/scat-mbart50-1toM-target-ctx4-cwd0-en-fr \
--model_type mbart50-1toM
```

### Evaluate PECoRe Metrics

```shell
python scripts/evaluate_tagged_metrics.py \
--scores_path outputs/scores/scat-marian-small-scat-cti.tsv \
--eval_mode cti \
--use_trained_model

python scripts/evaluate_tagged_metrics.py \
--scores_path outputs/scores/scat-marian-small-scat-cti.tsv \
--eval_mode cti \
--average_example_scores \
--metrics random pcxmi kl_divergence \
--save_preds

python scripts/evaluate_tagged_metrics.py \
--scores_path outputs/scores/scat-marian-small-scat-cci.tsv \
--eval_mode cci \
--example_target_column is_supporting_context \
--average_example_scores \
--metrics random saliency_contrast_prob_diff attention_default attention_best

python scripts/evaluate_tagged_metrics.py \
--scores_path outputs/scores/scat-marian-small-scat-target-cti.tsv \
--eval_mode cti \
--average_example_scores \
--metrics random pcxmi kl_divergence \
--save_preds
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gsarti/pecore

Awesome Lists containing this project

README