https://github.com/fzp0424/MT-R1-Zero
Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"
https://github.com/fzp0424/MT-R1-Zero
Last synced: 12 months ago
JSON representation
Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"
- Host: GitHub
- URL: https://github.com/fzp0424/MT-R1-Zero
- Owner: fzp0424
- Created: 2025-04-14T15:59:37.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-14T16:42:19.000Z (about 1 year ago)
- Last Synced: 2025-04-14T17:45:54.427Z (about 1 year ago)
- Language: Python
- Size: 9.23 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- StarryDivineSky - fzp0424/MT-R1-Zero - R1-Zero是一个基于大语言模型(LLM)的机器翻译项目,其核心在于使用类似于R1-Zero的强化学习方法来提升翻译质量。该项目旨在推进LLM在机器翻译领域的应用,并提供相应的代码实现。主要贡献是提出了一种新的训练范式,通过强化学习优化LLM的翻译能力,使其在特定任务上表现更佳。项目名称来源于其采用的R1-Zero-like强化学习方法。具体实现细节和实验结果请参考相关论文。如果你对使用LLM进行机器翻译,尤其是对强化学习在其中的应用感兴趣,这个项目值得关注。该项目提供了代码,方便研究者复现和进一步探索。 (A01_文本生成_文本对话 / 大语言对话模型及数据)