Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mswellhao/PacSum

Unsupervised Extractive Summarization based on Position-Augmented Centrality
https://github.com/mswellhao/PacSum

Last synced: about 2 months ago
JSON representation

Unsupervised Extractive Summarization based on Position-Augmented Centrality

Awesome Lists containing this project

README

        

# PacSum

This code is for paper [Sentence Centrality Revisited for Unsupervised Summarization](https://arxiv.org/pdf/1906.03508.pdf) ACL 2019

Some codes are borrowed from [pytorch_pretrained_bert](https://github.com/huggingface/pytorch-transformers) and [gensim](https://github.com/RaRe-Technologies/gensim)

-------
### Dependencies
Python3.6, pytorch >= 1.0, numpy, gensim, pyrouge

-------
### Data used in the paper:

Download https://drive.google.com/open?id=1gNKWkZG4dVr5XrOeQBVicy1fdnpH2d5l

### Bert models fine-tuned using the approach in the paper:

Download https://drive.google.com/file/d/1wbMlLmnbD_0j7Qs8YY8cSCh935WKKdsP/view?usp=sharing

### Tuning the hyperparamters and test the performance using TfIdf or BERT representation
```
python run.py --rep tfidf --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data
```
```
python run.py --rep bert --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data --bert_model_file path/to/model --bert_config_file path/to/config --bert_vocab_file path/to/vocab
```