An open API service indexing awesome lists of open source software.

https://github.com/zhaocq-nlp/mt-data-processing

Scripts for preprocessing datasets for machine translation.
https://github.com/zhaocq-nlp/mt-data-processing

Last synced: 3 months ago
JSON representation

Scripts for preprocessing datasets for machine translation.

Awesome Lists containing this project

README

        

# MT-data-processing
This repository contains scripts for the shared translation task at the Statistical Machine Translation.
Now, it deals with:
- WMT2017 EN<->DE

## References
- [BPE](https://github.com/rsennrich/subword-nmt)
- [mosesdecoder](https://github.com/moses-smt/mosesdecoder)