https://github.com/raspberryice/gen-arg
Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'
https://github.com/raspberryice/gen-arg
event-extraction information-extraction
Last synced: 4 months ago
JSON representation
Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'
- Host: GitHub
- URL: https://github.com/raspberryice/gen-arg
- Owner: raspberryice
- License: mit
- Created: 2021-04-12T02:42:19.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2023-02-08T16:48:21.000Z (almost 3 years ago)
- Last Synced: 2023-03-03T11:02:34.964Z (almost 3 years ago)
- Topics: event-extraction, information-extraction
- Language: Python
- Homepage:
- Size: 3.54 MB
- Stars: 91
- Watchers: 6
- Forks: 22
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Argument Extraction by Generation
Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'
## Dependencies
- pytorch=1.6
- transformers=3.1.0
- pytorch-lightning=1.0.6
- spacy=2.3.2
## Model Checkpoints
Checkpoints trained from this repo are shared for the WikiEvents dataset and the ACE dataset are available at: [s3://gen-arg-data/checkpoints/].
You can download all the contents from the S3 bucket using AWS cli: `aws s3 cp s3://gen-arg-data/checkpoints/ ./ --recursive`
### Model Predictions
The model predictions on WikiEvents is provided in `outputs/wikievents-pointer-pred`.
Running this file through the `scorer.py` function should give you the exact same numbers as Table 5.
## Datasets
- RAMS (Download at [https://nlp.jhu.edu/rams/])
- ACE05 (Access from LDC[https://catalog.ldc.upenn.edu/LDC2006T06] and preprocessing following OneIE[http://blender.cs.illinois.edu/software/oneie/])
- WikiEvents (Available here [s3://gen-arg-data/wikievents/])
You can download the data through the AWS cli or AWS console.
Alternatively, you can download individual files by
- `wget https://gen-arg-data.s3.us-east-2.amazonaws.com/wikievents/data/.jsonl` for split={train, dev,test}.
- `wget https://gen-arg-data.s3.us-east-2.amazonaws.com/wikievents/data/coref/.jsonlines` for split={train, dev, test}.
Additional processed test files for RAMS can be downloaded by
- `wget https://gen-arg-data.s3.us-east-2.amazonaws.com/RAMS/test_head_coref.jsonlines`
- `wget https://gen-arg-data.s3.us-east-2.amazonaws.com/RAMS/test_head.jsonlines`