Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/howardchiang2/fine_tune_llama3_with_orpo

通过ORPO的方法微调llama3
https://github.com/howardchiang2/fine_tune_llama3_with_orpo

llama3 llm orpo

Last synced: 19 days ago
JSON representation

通过ORPO的方法微调llama3

Awesome Lists containing this project

README

        

# fine_tune_llama3_with_ORPO
通过ORPO的方法微调llama3

创新点:结合SFT和对齐学习的新尝试

环境:最好使用colab L4(需要开PRO)

参考论文《ORPO: Monolithic Preference Optimization without Reference Model》

reference to blog:https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html