Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/liyuanlucasliu/ld-net

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
https://github.com/liyuanlucasliu/ld-net

contextualized-representation language-model model-compression named-entity-recognition ner pytorch sequence-labeling

Last synced: 3 months ago
JSON representation

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Host: GitHub
URL: https://github.com/liyuanlucasliu/ld-net
Owner: LiyuanLucasLiu
License: apache-2.0
Created: 2018-03-18T03:46:19.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2020-02-29T22:34:45.000Z (almost 5 years ago)
Last Synced: 2024-11-09T06:14:53.598Z (3 months ago)
Topics: contextualized-representation, language-model, model-compression, named-entity-recognition, ner, pytorch, sequence-labeling
Language: Python
Homepage: https://arxiv.org/abs/1804.07827
Size: 599 KB
Stars: 146
Watchers: 10
Forks: 13
Open Issues: 4
Metadata Files:
- Readme: ReadMe.md
- License: LICENSE

Awesome Lists containing this project

README

        # LD-Net

[![Documentation Status](https://readthedocs.org/projects/ld-net/badge/?version=latest)](http://ld-net.readthedocs.io/en/latest/?badge=latest)

[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)

**Check Our New NER Toolkit🚀🚀🚀**

- **Inference**:

  - **[LightNER](https://github.com/LiyuanLucasLiu/LightNER)**: inference w. models pre-trained / trained w. *any* following tools, *efficiently*. 

- **Training**:

  - **[LD-Net](https://github.com/LiyuanLucasLiu/LD-Net)**: train NER models w. efficient contextualized representations.

  - **[VanillaNER](https://github.com/LiyuanLucasLiu/Vanilla_NER)**: train vanilla NER models w. pre-trained embedding.

- **Distant Training**:

  - **[AutoNER](https://shangjingbo1226.github.io/AutoNER/)**: train NER models w.o. line-by-line annotations and get competitive performance.

--------------------------------

LD-Net provides sequence labeling models featuring:

- **Efficiency**: constructing *efficient contextualized representations* without retraining language models. 

- **Portability**: *well-organized*, *easy-to-modify* and *[well-documented](http://lm-lstm-crf.readthedocs.io/en/latest/)*.

Remarkablely, our pre-trained NER model achieved:

- **92.08** test F1 on the CoNLL03 NER task.

- **160K words/sec** decoding speed (**6X** speedup compared to its original model).

Details about LD-Net can be accessed at: https://arxiv.org/abs/1804.07827.

- [Model notes](#model-notes)

- [Benchmarks](#benchmarks)

- [Pretrained model](#pretrained-model)

	- [Language models](#language-models)

	- [Named Entity Recognition](#named-entity-recognition)

	- [Chunking](#chunking)

- [Training](#model-training)

	- [Dependency](#dependency)

	- [Data](#data)

	- [Model](#model)

	- [Command](#command)

- [Inference](#inference)

- [Citation](#citation)

## Model Notes

![LD-Net Framework](docs/model_note.png)

## Benchmarks

| Model for CoNLL03 | #FLOPs| Mean(F1) | Std(F1) |

| ------------- |-------------| -----| -----|

| Vanilla NER w.o. LM | 3 M | 90.78 | 0.24 |

| LD-Net (w.o. pruning) | 51 M | 91.86 | 0.15 |

| LD-Net (origin, picked based on dev f1) | 51 M | 91.95 |  |

| LD-Net (pruned) | **5 M** | 91.84 | 0.14 |

| Model for CoNLL00 | #FLOPs| Mean(F1) | Std(F1) |

| ------------- |-------------| -----| -----|

| Vanilla NP w.o. LM | 3 M | 94.42 | 0.08 |

| LD-Net (w.o. pruning) | 51 M | 96.01 | 0.07 |

| LD-Net (origin, picked based on dev f1) | 51 M | 96.13 |  |

| LD-Net (pruned) | **10 M** | 95.66 | 0.04 |

## Pretrained Models

Here we provide both pre-trained language models and pre-trained sequence labeling models.

### Language Models

Our pretrained language model contains word embedding, 10-layer densely-connected LSTM and adative softmax, and achieve an average PPL of 50.06 on the one billion benchmark dataset.

| Forward Language Model | Backward Language Model |

| ------------- |------------- |

| [Download Link](http://dmserv4.cs.illinois.edu/ld0.th) | [Download Link](http://dmserv4.cs.illinois.edu/ld_0.th)|

### Named Entity Recognition

The original pre-trained named entity tagger achieves 91.95 F1, the pruned tagged achieved 92.08 F1.

| Original Tagger | Pruned Tagger |

| ------------- |------------- |

| [Download Link](http://dmserv4.cs.illinois.edu/ner.th) | [Download Link](http://dmserv4.cs.illinois.edu/pner0.th) |

### Chunking

The original pre-trained named entity tagger achieves 96.13 F1, the pruned tagged achieved 95.79 F1.

| Original Tagger | Pruned Tagger |

| ------------- |------------- |

| [Download Link](http://dmserv4.cs.illinois.edu/np.th) | [Download Link](http://dmserv4.cs.illinois.edu/pnp0.th) |

## Training

### Demo Scripts

To pruning the original LD-Net for the CoNLL03 NER, please run:

```

bash ldnet_ner_prune.sh

```

To pruning the original LD-Net for the CoNLL00 Chunking, please run:

```

bash ldnet_np_prune.sh

```

### Dependency

Our package is based on Python 3.6 and the following packages:

```

numpy

tqdm

torch-scope

torch==0.4.1

```

### Data

Pre-process scripts are available in ```pre_seq``` and ```pre_word_ada```, while pre-processed data has been stored in:

| NER | Chunking |

| ------------- |------------- |

| [Download Link](http://dmserv4.cs.illinois.edu/ner_dataset.pk) | [Download Link](http://dmserv4.cs.illinois.edu/np_dataset.pk) |

### Model

Our implementations are available in ```model_seq``` and ```model_word_ada```, and the documentations are hosted in [ReadTheDoc](http://lm-lstm-crf.readthedocs.io/en/latest/)

| NER | Chunking |

| ------------- |------------- |

| [Download Link](http://dmserv4.cs.illinois.edu/ner_dataset.pk) | [Download Link](http://dmserv4.cs.illinois.edu/np_dataset.pk) |

## Inference

For model inference, please check our [LightNER package](https://github.com/LiyuanLucasLiu/LightNER) 

## Citation

If you find the implementation useful, please cite the following paper: [Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling](https://arxiv.org/abs/1804.07827)

```

@inproceedings{liu2018efficient,

  title = "{Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling}", 

  author = {Liu, Liyuan and Ren, Xiang and Shang, Jingbo and Peng, Jian and Han, Jiawei}, 

  booktitle = {EMNLP}, 

  year = 2018, 

}

```