Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mswellhao/PacSum
Unsupervised Extractive Summarization based on Position-Augmented Centrality
https://github.com/mswellhao/PacSum
Last synced: about 1 month ago
JSON representation
Unsupervised Extractive Summarization based on Position-Augmented Centrality
- Host: GitHub
- URL: https://github.com/mswellhao/PacSum
- Owner: mswellhao
- Created: 2019-05-31T10:51:57.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2021-09-06T09:21:36.000Z (over 3 years ago)
- Last Synced: 2024-08-02T10:27:20.583Z (4 months ago)
- Language: Python
- Size: 18.6 KB
- Stars: 124
- Watchers: 4
- Forks: 27
- Open Issues: 12
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-Multi-Document-Summarization - mswellhao/PacSum
README
# PacSum
This code is for paper [Sentence Centrality Revisited for Unsupervised Summarization](https://arxiv.org/pdf/1906.03508.pdf) ACL 2019
Some codes are borrowed from [pytorch_pretrained_bert](https://github.com/huggingface/pytorch-transformers) and [gensim](https://github.com/RaRe-Technologies/gensim)
-------
### Dependencies
Python3.6, pytorch >= 1.0, numpy, gensim, pyrouge-------
### Data used in the paper:Download https://drive.google.com/open?id=1gNKWkZG4dVr5XrOeQBVicy1fdnpH2d5l
### Bert models fine-tuned using the approach in the paper:
Download https://drive.google.com/file/d/1wbMlLmnbD_0j7Qs8YY8cSCh935WKKdsP/view?usp=sharing
### Tuning the hyperparamters and test the performance using TfIdf or BERT representation
```
python run.py --rep tfidf --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data
```
```
python run.py --rep bert --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data --bert_model_file path/to/model --bert_config_file path/to/config --bert_vocab_file path/to/vocab
```