https://github.com/zhaocq-nlp/mt-data-processing
Scripts for preprocessing datasets for machine translation.
https://github.com/zhaocq-nlp/mt-data-processing
Last synced: 3 months ago
JSON representation
Scripts for preprocessing datasets for machine translation.
- Host: GitHub
- URL: https://github.com/zhaocq-nlp/mt-data-processing
- Owner: zhaocq-nlp
- License: apache-2.0
- Created: 2018-01-16T09:04:11.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-03-20T13:36:42.000Z (over 7 years ago)
- Last Synced: 2025-04-13T09:11:49.831Z (3 months ago)
- Language: Python
- Size: 69.3 KB
- Stars: 10
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# MT-data-processing
This repository contains scripts for the shared translation task at the Statistical Machine Translation.
Now, it deals with:
- WMT2017 EN<->DE## References
- [BPE](https://github.com/rsennrich/subword-nmt)
- [mosesdecoder](https://github.com/moses-smt/mosesdecoder)