Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sap-samples/emnlp2021-attention-contrastive-learning
Repository supporting an accepted paper at EMNLP 2021 on attention contrastive learning.
https://github.com/sap-samples/emnlp2021-attention-contrastive-learning
ai artificial-intelligence emnlp2021 machine-learning natural-language-processing nlp research-paper sample
Last synced: about 2 months ago
JSON representation
Repository supporting an accepted paper at EMNLP 2021 on attention contrastive learning.
- Host: GitHub
- URL: https://github.com/sap-samples/emnlp2021-attention-contrastive-learning
- Owner: SAP-samples
- License: apache-2.0
- Created: 2021-09-02T15:59:34.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-04-02T10:24:08.000Z (9 months ago)
- Last Synced: 2024-04-02T12:24:18.535Z (9 months ago)
- Topics: ai, artificial-intelligence, emnlp2021, machine-learning, natural-language-processing, nlp, research-paper, sample
- Language: Python
- Homepage: https://2021.emnlp.org/
- Size: 101 KB
- Stars: 4
- Watchers: 7
- Forks: 1
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Attention-based Contrastive Learning for Winograd Schemas
[![made-with-python](https://img.shields.io/badge/Made%20with-Python-red.svg)](#python)
[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![REUSE status](https://api.reuse.software/badge/github.com/SAP-samples/emnlp2021-attention-contrastive-learning)](https://api.reuse.software/info/github.com/SAP-samples/emnlp2021-attention-contrastive-learning)#### News
- **11/24/2021: Provided source code**In this repository we will provide the source code for the paper [*Attention-based Contrastive Learning for Winograd Schemas*](https://arxiv.org/abs/2109.05108) to be presented at [EMNLP 2021](https://2021.emnlp.org/). The code is in parts based on the code from [Huggingface Tranformers](https://github.com/huggingface/transformers) and the paper [A Surprisingly Robust Trick for Winograd Schema Challenge](https://github.com/vid-koci/bert-commonsense).
## Abstract
![Schematic Illustration of Attentiomn Contrastive Learning](https://raw.githubusercontent.com/SAP-samples/emnlp2021-attention-contrastive-learning/main/img/attention_contrastive.png)
Self-supervised learning has recently attracted considerable attention in the NLP community for its ability to learn discriminative features using a contrastive objective. This paper investigates whether contrastive learning can be extended to Transfomer attention to tackling the Winograd Schema Challenge. To this end, we propose a novel self-supervised framework, leveraging a *contrastive loss* directly at the level of *self-attention*. Experimental analysis of our attention-based models on multiple datasets demonstrates superior commonsense reasoning capabilities. The proposed approach outperforms all comparable unsupervised approaches while occasionally surpassing supervised ones.
#### Authors:
- [Tassilo Klein](https://tjklein.github.io/)
- [Moin Nabi](https://moinnabi.github.io/)## Requirements
- [Python](https://www.python.org/) (version 3.6 or later)
- [PyTorch](https://pytorch.org/)
- [Huggingface Tranformers](https://github.com/huggingface/transformers)## Download and Installation
1. Install the requiremennts:
```
conda install --yes --file requirements.txt
```or
```
pip install -r requirements.txt
```2. Clone this repository and install dependencies:
```
git clone https://github.com/SAP-samples/emnlp2021-attention-contrastive-learning
cd emnlp2021-attention-contrastive-learning
pip install -r requirements.txt
```3. Create 'data' sub-directory and download files for PDP, WSC challenge, KnowRef, DPR and WinoGrande:
```
mkdir data
wget https://cs.nyu.edu/faculty/davise/papers/WinogradSchemas/PDPChallenge2016.xml
wget https://cs.nyu.edu/faculty/davise/papers/WinogradSchemas/WSCollection.xml
wget https://raw.githubusercontent.com/aemami1/KnowRef/master/Knowref_dataset/knowref_test.json
wget http://www.hlt.utdallas.edu/~vince/data/emnlp12/train.c.txt
wget http://www.hlt.utdallas.edu/~vince/data/emnlp12/test.c.txt
wget https://storage.googleapis.com/ai2-mosaic/public/winogrande/winogrande_1.1.zip
unzip winogrande_1.1.zip
rm winogrande_1.1.zip
cd ..
```4. Train
```
python main-AMEX.py --do_train --do_eval --learning_rate 1.0e-5 --shuffle --ignore_bestacc --loadcachedfeatures --train_batch_size 18 --num_train_epochs 22 --eval_batch_size 10 --max_seq_length 40 --alpha_param 0.05 --beta_param 0.02 --lambda_param 1.0 --warmup_steps 150 --bert_model=roberta-large --cache_dir cache/ --data_dir=data/ --output_dir model_output/ --task_name wscr
```## How to obtain support
[Create an issue](https://github.com/SAP-samples/emnlp2021-attention-contrastive-learning/issues) in this repository if you find a bug or have questions about the content.
For additional support, [ask a question in SAP Community](https://answers.sap.com/questions/ask.html).## Citations
If you use this code in your research,
please cite:```
@inproceedings{klein-nabi-2021-attention-based,
title = "Attention-based Contrastive Learning for {W}inograd Schemas",
author = "Klein, Tassilo and
Nabi, Moin",
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
month = nov,
year = "2021",
address = "Punta Cana, Dominican Republic",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2021.findings-emnlp.208",
pages = "2428--2434"
}
```## License
Copyright (c) 2024 SAP SE or an SAP affiliate company. All rights reserved. This project is licensed under the Apache Software License, version 2.0 except as noted otherwise in the [LICENSE](LICENSES/Apache-2.0.txt) file.