https://github.com/tomrunia/unsupervisedpbsmt
Unsupervised Phrase-Based Statistical Machine Translation
https://github.com/tomrunia/unsupervisedpbsmt
Last synced: 3 months ago
JSON representation
Unsupervised Phrase-Based Statistical Machine Translation
- Host: GitHub
- URL: https://github.com/tomrunia/unsupervisedpbsmt
- Owner: tomrunia
- Created: 2018-06-14T10:07:37.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-07-16T12:21:01.000Z (almost 7 years ago)
- Last Synced: 2025-01-12T15:10:56.224Z (5 months ago)
- Size: 3.91 KB
- Stars: 5
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
*This is unfinished code and currently on hold.*
# Unsupervised Phrase-Based Statistical Machine Translation
Unsupervised learning for machine translation systems. In particular, these approaches are valuable for low-resource languages that have no parallel sentences available.
## Third-Party Software
- [FastText](https://github.com/facebookresearch/fastText) (learning word embeddings)
- [MUSE](https://github.com/facebookresearch/MUSE) (unsupervised learning of bilingual dictionary)
- [Moses](http://statmt.org/moses/) (phrase-based language model)
- [KenLM](https://github.com/kpu/kenlm) (learning smoothed n-gram models)## Installation
git clone --recursive [email protected]:tomrunia/UnsupervisedPBSMT.git
## References
- Conneau, Alexis, et al. "**Word translation without parallel data.**" ICLR 2018.
- Lample, Guillaume, et al. "**Unsupervised Machine Translation Using Monolingual Corpora Only.**" ICLR 2018.
- Lample, Guillaume, et al. "**Phrase-Based & Neural Unsupervised Machine Translation.**" arXiv 2018.