https://github.com/heartcored98/transformer_anatomy

Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
https://github.com/heartcored98/transformer_anatomy

attention-head interpretability interpretable-deep-learning transformer-encoder

Last synced: 6 months ago
JSON representation

Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020

Host: GitHub
URL: https://github.com/heartcored98/transformer_anatomy
Owner: heartcored98
License: mit
Created: 2019-01-22T15:41:03.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2022-10-25T08:19:11.000Z (almost 3 years ago)
Last Synced: 2024-11-07T23:39:19.018Z (11 months ago)
Topics: attention-head, interpretability, interpretable-deep-learning, transformer-encoder
Language: Python
Homepage:
Size: 30.8 MB
Stars: 16
Watchers: 2
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

ATPapers - heartcored98 / Transformer_Anatomy - Toolkit for finding and analyzing important attention heads in transformer-based models (Pretrained Language Model / Repository)

README

## [Transformer Anatomy] Roles and Utilization of Attention Heads in Transformer-based Neural Language Models (ACL 2020)

Official Pytorch implementation of **Transformer Anatomy** | [Paper](https://www.aclweb.org/anthology/2020.acl-main.311/)

Jae-young Jo^1,2, Sung-hyon Myaeng¹

¹ _KAIST
² _{Dingbro, Inc}

## Abstract

Sentence encoders based on the transformer architecture have shown promising results on various natural language tasks. The main impetus lies in the pre-trained neural language models that capture long-range dependencies among words, owing to multi-head attention that is unique in the architecture. However, little is known for how linguistic properties are processed, represented, and utilized for downstream tasks among hundreds of attention heads inside the pre-trained transformer-based model. For the initial goal of examining the roles of attention heads in handling a set of linguistic features, we conducted a set of experiments with ten probing tasks and three downstream tasks on four pre-trained transformer families (GPT, GPT2, BERT, and ELECTRA). Meaningful insights are shown through the lens of heat map visualization and utilized to propose a relatively simple sentence representation method that takes advantage of most influential attention heads, resulting in additional performance improvements on the downstream tasks.

> [Note] This repository's work in progress for better reproducibility

### DEMO #1 - Inspecting Internal Linguistic Information Handling Inside Transformers

![Image](https://github.com/heartcored98/Transformer_Anatomy/blob/master/imgs/showcase1.png?raw=true)

Heatmaps of attention head-wise evaluation on the five sentence probing tasks with pre-trained BERT BASE model.

Each column correspond to following tasks (Length, Depth, BigramShift, CoordinationInversion, Tense from the left). For each heatmap, x-axis and y-axis show the index values of the attention heads and the layer numbers (the lower, the closer to the initial input), respectively.

The brighter the color, the higher the accuracy for the attention head and hence more important for the task. Note that the attention heads in the same layer are ordered by their classification accuracy values (i.e. an attention head with the highest accuracy on a layer is at the left-most location) This heatmap could give you intuitive understanding of where the task-related information is handled inside the sentence encoder.

We can observe following internal tendency of **BERT BASE** along various linguistic features.
- Surface and syntactic related information(Length and Depth) is usually captured from the attention heads close to the input layer.
- Word or clause order related information(BigramShift and CoordinationInversion) is well captured from the attention heads located in the middle layer.
- Semantic related information(Tense) is well captured from the attention heads close to the output layer.

### DEMO #2 - Boosting Downstream Task Performance
##### (Extracting Essential Hidden Sentence Representation)

![Image](https://github.com/heartcored98/Transformer_Anatomy/blob/master/imgs/showcase2_downstream_heatmap.png?raw=true)

Heatmaps of attention head-wise evaluation on the four downstream tasks with pre-trained BERT BASE model.

Each column correspond to following tasks (SST5, TREC, SICKEntailment, MRPC from the left).

We can observe following internal tendency of **BERT BASE** along various downstream-task related information.
- Surface and syntactic related information(Length and Depth) is usually captured from the attention heads close to the input layer.
- Word or clause order related information(BigramShift and CoordinationInversion) is well captured from the attention heads located in the middle layer.
- Semantic related information(Tense) is well captured from the attention heads close to the output layer.
According to our results, these reconstructed sentence representation outperform compare to the sentecne representation from the last layer.

Futhermore, we show how the downstream performance could be increased by just pulling internal outperforming hidden representation and using it as sentence representation.

## How to cite
```
@inproceedings{jo-myaeng-2020-roles,
title = "Roles and Utilization of Attention Heads in Transformer-based Neural Language Models",
author = "Jo, Jae-young and
Myaeng, Sung-Hyon",
booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
month = jul,
year = "2020",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.acl-main.311",
pages = "3404--3417",
abstract = "Sentence encoders based on the transformer architecture have shown promising results on various natural language tasks. The main impetus lies in the pre-trained neural language models that capture long-range dependencies among words, owing to multi-head attention that is unique in the architecture. However, little is known for how linguistic properties are processed, represented, and utilized for downstream tasks among hundreds of attention heads inside the pre-trained transformer-based model. For the initial goal of examining the roles of attention heads in handling a set of linguistic features, we conducted a set of experiments with ten probing tasks and three downstream tasks on four pre-trained transformer families (GPT, GPT2, BERT, and ELECTRA). Meaningful insights are shown through the lens of heat map visualization and utilized to propose a relatively simple sentence representation method that takes advantage of most influential attention heads, resulting in additional performance improvements on the downstream tasks.",
}
```

## Reference
These implementations is largely based on the following implementations.
- [huggingface's pytorch-pretrained-BERT](https://github.com/huggingface/pytorch-pretrained-BERT)
- [facebookresearch's SentEval](https://github.com/facebookresearch/SentEval)

**huggingface's pytorch-pretrained-BERT** provides the pre-trained transformer encoders not only BERT but also GPT/GPT2 and Transformer-XL models. The results of the paper is produced by using these implementation with **slight change**.

**facebookresearch's SentEval** provides toolkit for benchmarking the sentence embedding of given sentence encoder model on 17 downstream tasks and 10 probing tasks. The benchmarking result of the paper is produced by using these pre-implemented datasets.

## TODO
- [X] Clean up directory structure
- [X] Recover layer-wise, head-wise | probing-task, downstream-task
- [X] Recover fine-tuning and solve dependency with legacy code
- [X] Add experiments result access class
- [ ] Update embedding caching algorithm -> pickle to parquet
- [ ] Run multiple trial based on ray (sharing cache object)
- [ ] Refactoring Anatomy Wrapper class
- [ ] Semantic / Syntactic Visualization

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/heartcored98/transformer_anatomy

Awesome Lists containing this project

README