Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/l294265421/alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
https://github.com/l294265421/alpaca-rlhf

alpaca chatgpt language-model large-language-models llama llm reinforcement-learning rlhf

Last synced: about 1 month ago
JSON representation

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Awesome Lists containing this project