https://github.com/iclementine/stacklstm-parser

a simple modification of Chris Dyer's stack LSTM Parser
https://github.com/iclementine/stacklstm-parser

dependency-parsing dynet

Last synced: 4 months ago
JSON representation

a simple modification of Chris Dyer's stack LSTM Parser

Host: GitHub
URL: https://github.com/iclementine/stacklstm-parser
Owner: iclementine
License: mit
Created: 2018-07-24T21:49:37.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-09-02T19:15:08.000Z (over 7 years ago)
Last Synced: 2024-12-28T02:36:00.890Z (12 months ago)
Topics: dependency-parsing, dynet
Language: Python
Size: 85.6 MB
Stars: 1
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# stackLSTM-parser
a simple modification of Chris Dyer's stack LSTM Parser

requires:

1. dynet 2.0.3 (build from source at github)
2. torchtext
3. conllu
4. python3

## Basic info

It is a simple stack lstm parser using Arc-Standard transition system implemented with dynet, the structure is shown below.

1. embedding for word form, upos, xpos, and word form is replaced by with a probability of c/(c+alpha), where c is the frequency of that word. Word form embedding, xpos embedding, upos embedding is independently droppout out with a probability of p.
2. a stack lstm representation of the stack,
3. a bi-directional lstm representation of the buffer,
4. a mlp to decide with transition to take at each step from representation of the stack and the buffer,
5. a mlp to decide which deprel to take where the oracle transition is a kind of reduction. The mlp for transition and the mlp for deprel are both a one-hidden-layer mlps, which share a common hidden layer.
6. a mlp for composition of the two lstm outputs from the top 2 elements in the stack, it is also a on-hidden-layer mlp. Embedding for deprel for composition is not implemented yet.

It achives

1. UAS: 0.8447712418300654 LAS: 0.8099415204678363 for Universal Dependency for Chinese.
2. UAS: 0.8747954173486089 LAS: 0.843268154018434 for Universal Dependenvy for English.

# how to test it

1. preprocess the datasets and save them as pickle binaries, and build vocabularies for word forms, upos tags, xpos tags in each training corpus. I use torchtext for convenience.

```
python3 build_corpus.py
python3 build_vocabs.py
```

2. Train the model and test it. It saves model in `save` directory. Resuming from the best model of last training is the default behavior.

```
python3 test_model.py
```

Hyper-parameters are just saved in `test_model.py` and I would consider use configParer for the next try.

Enjoy!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/iclementine/stacklstm-parser

Awesome Lists containing this project

README