https://github.com/nayeon7lee/bert-summarization

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/nayeon7lee/bert-summarization
Owner: nayeon7lee
Created: 2019-04-12T14:41:48.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2022-12-08T04:58:34.000Z (over 2 years ago)
Last Synced: 2024-11-02T23:32:43.402Z (8 months ago)
Language: Python
Size: 51.8 KB
Stars: 122
Watchers: 8
Forks: 32
Open Issues: 17
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-bert - nayeon7lee/bert-summarization - Based Natural Language Generation for Text Summarization', Paper: https://arxiv.org/pdf/1902.09243.pdf (BERT Text Summarization Task:)

README

## Implementation of 'Pretraining-Based Natural Language Generation for Text Summarization'

Paper: https://arxiv.org/pdf/1902.09243.pdf

### Versions
* python 2.7
* PyTorch: 1.0.1.post2

### Preparing package/dataset
0. Run: `pip install -r requirements.txt` to install required packages
1. Download chunk CNN/DailyMail data from: https://github.com/JafferWilson/Process-Data-of-CNN-DailyMail
2. Run: `python news_data_reader.py` to create pickle file that will be used in my data-loader

### Running the model
For me, the model was too big for my GPU, so I used smaller parameters as following for debugging purpose.
`CUDA_VISIBLE_DEVICES=3 python main.py --cuda --batch_size=2 --hop 4 --hidden_dim 100`

### Note to reviewer:
* Although I implemented the core-part (2-step summary generation using BERT), I didn't have enough time to implement RL section.
* The 2nd decoder process is very time-consuming (since it needs to create BERT context vector for each timestamp).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/nayeon7lee/bert-summarization

Awesome Lists containing this project

README