An open API service indexing awesome lists of open source software.

https://github.com/tomrunia/unsupervisedpbsmt

Unsupervised Phrase-Based Statistical Machine Translation
https://github.com/tomrunia/unsupervisedpbsmt

Last synced: 3 months ago
JSON representation

Unsupervised Phrase-Based Statistical Machine Translation

Awesome Lists containing this project

README

        

*This is unfinished code and currently on hold.*

# Unsupervised Phrase-Based Statistical Machine Translation

Unsupervised learning for machine translation systems. In particular, these approaches are valuable for low-resource languages that have no parallel sentences available.

## Third-Party Software

- [FastText](https://github.com/facebookresearch/fastText) (learning word embeddings)
- [MUSE](https://github.com/facebookresearch/MUSE) (unsupervised learning of bilingual dictionary)
- [Moses](http://statmt.org/moses/) (phrase-based language model)
- [KenLM](https://github.com/kpu/kenlm) (learning smoothed n-gram models)

## Installation

git clone --recursive [email protected]:tomrunia/UnsupervisedPBSMT.git

## References

- Conneau, Alexis, et al. "**Word translation without parallel data.**" ICLR 2018.
- Lample, Guillaume, et al. "**Unsupervised Machine Translation Using Monolingual Corpora Only.**" ICLR 2018.
- Lample, Guillaume, et al. "**Phrase-Based & Neural Unsupervised Machine Translation.**" arXiv 2018.