https://github.com/yorko/gender-unbiased_bert-based_pronoun_resolution
Source code for the ACL workshop paper and Kaggle competition by Google AI team
https://github.com/yorko/gender-unbiased_bert-based_pronoun_resolution
acl2019 bert bert-model coreference-resolution gender-bias kaggle
Last synced: about 1 month ago
JSON representation
Source code for the ACL workshop paper and Kaggle competition by Google AI team
- Host: GitHub
- URL: https://github.com/yorko/gender-unbiased_bert-based_pronoun_resolution
- Owner: Yorko
- License: apache-2.0
- Created: 2019-05-03T21:02:52.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2022-01-13T10:49:36.000Z (over 3 years ago)
- Last Synced: 2025-02-25T08:06:42.782Z (3 months ago)
- Topics: acl2019, bert, bert-model, coreference-resolution, gender-bias, kaggle
- Language: Jupyter Notebook
- Homepage: https://www.kaggle.com/c/gendered-pronoun-resolution
- Size: 5.97 MB
- Stars: 40
- Watchers: 6
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Source code for the paper "Resolving Gendered Ambiguous Pronouns with BERT"
[ArXiv](https://arxiv.org/abs/1906.01161), accepted at the 1st ACL Workshop on Gender Bias for Natural Language Processing at [ACL 2019](http://www.acl2019.org/EN/index.xhtml) in Florence on 2nd August 2019.
#### [Kaggle competition](https://www.kaggle.com/c/gendered-pronoun-resolution)
#### Abstract
Pronoun resolution is part of coreference resolution, the task of pairing an expression to its referring entity. This is an important task for natural language understanding and a necessary component of machine translation systems, chat bots and assistants. Neural machine learning systems perform far from ideally in this task, reaching as low as 73\% F1 scores on modern benchmark datasets. Moreover, they tend to perform better for masculine pronouns than for feminine ones. Thus, the problem is both challenging and important for NLP researchers and practitioners. In this project, we describe our BERT-based approach to solving the problem of gender-balanced pronoun resolution. We are able to reach 92\% F1 score and a much lower gender bias on the benchmark dataset shared by Google AI Language team.#### Navigation
- `fine-tuned_model` contains the code reproducing Ken Krige's 5th place solution
- `frozen_model` contains the code reproducing 22th place solution by team "[ods.ai] five zeros" (Yury Kashnitsky, Matei Ionita, Vladimir Larin, Dennis Logvinenko, and Atanas Atanasov)
- `analysis` folder contains scripts for comparison with results by Google AI Language team reported in [this paper](https://arxiv.org/abs/1810.05201)