An open API service indexing awesome lists of open source software.

https://github.com/letruongzzio/vi-en-translation


https://github.com/letruongzzio/vi-en-translation

Last synced: 4 months ago
JSON representation

Awesome Lists containing this project

README

          

# VI-EN Translation

1. The project writes code for training GRU, Transformer, BERT, GPT-2 and mBART-50 models to perform Vietnamese to English translation.

2. The data used in this project includes 2 sets:
- TED2020: [[README](https://github.com/letruongzzio/Vietnamese-Poem-Generation/blob/main/data/README)] [[LICENSE](https://github.com/letruongzzio/Vietnamese-Poem-Generation/blob/main/data/LICENSE)].
- PhoMT: [[PAPER](https://arxiv.org/pdf/2110.12199)]
```bibtex
@inproceedings{PhoMT,
title = {{PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation}},
author = {Long Doan and Linh The Nguyen and Nguyen Luong Tran and Thai Hoang and Dat Quoc Nguyen},
booktitle = {Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing},
year = {2021}
}
```

3. Note that this repository is for reference purposes only as there may be some errors with the {`..._DIR`} path. Also, consult [`requirements.txt`](https://github.com/letruongzzio/Vietnamese-Poem-Generation/blob/main/requirements.txt) to install the correct version of the library used:

```bash
pip install -r requirements.txt
```

4. Contributions are welcome. Please create a pull request to contribute to this project and help improve the Vietnamese-English translation model. If you have any questions, please contact me at [lephutruong.2210@gmail.com](mailto:).

5. The project is licensed under the MIT License. See [LICENSE](https://github.com/letruongzzio/Vietnamese-Poem-Generation/blob/main/LICENSE).