Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mattbui/sent-comp

Sentence Compression with deletion, accepted at ICCCI
https://github.com/mattbui/sent-comp

Last synced: about 1 month ago
JSON representation

Sentence Compression with deletion, accepted at ICCCI

Awesome Lists containing this project

README

        

# ICCCI - Sentence Compression with deletion

## Dataset

The dataset is available at: [https://github.com/google-research-datasets/sentence-compression](https://github.com/google-research-datasets/sentence-compression). Download and store the `*.gz` files in `data/` directory.

## Requirements

This project requires python3.6+ and pytorch1.1+. It used the models and embeddings from [FLAIR framework](https://github.com/flairNLP/flairhttps://github.com/flairNLP/flair):

```bash
pip install flair
```

## Preprocess data

In order to train a sequence tagging model, the original data need to be align into sequence tagging format. To align the downloaded data:

```bash
export PRJ_HOME=
bash $PRJ_HOME/runs/preprocess.sh
```

## Training

Different training configs for each settings are available in `runs/`. To start training:

```bash
export PRJ_HOME=
bash $PRJ_HOME/runs/train_.sh
```