https://github.com/bjascob/amr_coref

A python library / model for creating co-references between AMR graph nodes.
https://github.com/bjascob/amr_coref

abstract-meaning-representation amr coreference-resolution python

Last synced: 3 months ago
JSON representation

A python library / model for creating co-references between AMR graph nodes.

Host: GitHub
URL: https://github.com/bjascob/amr_coref
Owner: bjascob
License: mit
Created: 2021-03-04T03:12:30.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2022-12-11T03:45:51.000Z (over 2 years ago)
Last Synced: 2025-03-29T22:11:36.602Z (4 months ago)
Topics: abstract-meaning-representation, amr, coreference-resolution, python
Language: Python
Homepage:
Size: 65.4 KB
Stars: 9
Watchers: 1
Forks: 1
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# amr_coref

**A python library / model for creating co-references between AMR graph nodes.**

## About
amr_coref is a python library and trained model designed to do co-referencing
between [Abstract Meaning Representation](https://amr.isi.edu/) graphs.

The project follows the general approach of the [neuralcoref project](https://github.com/huggingface/neuralcoref)
and it's excellent
[blog on the co-referencing](https://medium.com/huggingface/how-to-train-a-neural-coreference-model-neuralcoref-2-7bb30c1abdfe).
However, the model is trained to do direct co-reference resolution between graph nodes and does not depend on
the sentences the graphs were created from.

The trained model achieves the following scores
```
MUC : R=0.647 P=0.779 F₁=0.706
B³ : R=0.633 P=0.638 F₁=0.630
CEAF_m: R=0.515 P=0.744 F₁=0.609
CEAF_e: R=0.200 P=0.734 F₁=0.306
BLANC : R=0.524 P=0.799 F₁=0.542
CoNLL-2012 average score: 0.548
```

## Project Status
**!! The following papers have GitHub projects/code that are better scoring and may be a preferable solution.**
See the uploaded file in [#1](https://github.com/bjascob/amr_coref/issues/1) for a quick view of scores.
* [VGAE as Cheap Supervision for AMR Coreference Resolution](https://github.com/IreneZihuiLi/VG-AMRCoref)
* [End-to-end AMR Coreference Resolution](https://github.com/Sean-Blank/AMRcoref)

Note that due to the use of multiprocessing, this code may only be compatible with a Debian style OS.
See [#3](https://github.com/bjascob/amr_coref/issues/3) for details on the issue.

## Installation and usage
There is currently no pip installation. To use the library, simply clone the code and use it in place.

The pre-trained model can be downloaded from the assets section in [releases](https://github.com/bjascob/amr_coref/releases).

To use the model create a `data` directory and un-tar the model in it.

The script `40_Run_Inference.py`, is an example of how to use the model.

## Training
If you'd like to train the model from scratch, you'll need a copy of the
[AMR corpus](https://catalog.ldc.upenn.edu/LDC2020T02).
To complete training, run the scripts in order.
- 10_Build_Model_TData.py
- 12_Build_Embeddings.py
- 14_Build_Mention_Tokens.py
- 30_Train_Model.py.

You'll need `amr_annotation_3.0` and `GloVe/glove.6B.50d.txt` in your `data` directory

The first few scripts will create the training data in `data/tdata` and the model training
script will create `data/model`. Training takes less than 4 hours.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bjascob/amr_coref

Awesome Lists containing this project

README