Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hailiang-wang/transitionparser-yoav
Transition-based dependency parsers
https://github.com/hailiang-wang/transitionparser-yoav
dependency-parser natural-language-processing transition-based-parser
Last synced: 11 days ago
JSON representation
Transition-based dependency parsers
- Host: GitHub
- URL: https://github.com/hailiang-wang/transitionparser-yoav
- Owner: hailiang-wang
- Created: 2018-03-19T08:57:14.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-03-19T09:12:38.000Z (almost 7 years ago)
- Last Synced: 2024-11-17T11:50:30.794Z (2 months ago)
- Topics: dependency-parser, natural-language-processing, transition-based-parser
- Language: Python
- Homepage: https://www.cs.bgu.ac.il/~yoavg/software/transitionparser/
- Size: 6.96 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README
Awesome Lists containing this project
README
Transition Based Dependency Parsers
These are implementations of the (unlabeled) arc-eager and arc-standard dependency parsing algorithms.
These parsers are very fast and are reasonably accurate.
In particular, the arc-standard parser with the features described in [1] (the default feature set) can achieve very competitive accuracies.The input file for both training and parsing should be in CoNLL format (see conll.example).
Columns 8,9,10 are always ignored (but must be present).
When parsing new text, you can put whatever you want in column 7, the parser will overwrite it (it uses this column to report accuracy scores)Compiling:
==========
Speed is achieved using a c/cython extension module.
This needs to be compiled using either cython or a c compiler.
See instructions in ml/READMETraining the parsers:
=====================./eager.py -o model_file [options] conll_input_file
or
./standard.py -o model_file [options] conll_input_file
(use -f instead of -o to create feature vector files for training with an external classifier. If you don't know what it means,
just ignore this option. The model file format is the same as Megam's.)Parsing new text with the trained model:
========================================./eager.py -m model_file [options] conll_file_to_parse > output
or
./standard.py -m model_file [options] conll_file_to_parse > output
References:
~~~~~~~~~~~
[1] Liang Huang, Wenbin Jiang and Qun Liu. 2009.
Bilingually-Constrained (Monolingual) Shift-Reduce Parsing.