Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by l294265421
A curated list of projects in awesome lists by l294265421 .
https://github.com/l294265421/alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
alpaca chatgpt language-model large-language-models llama llm reinforcement-learning rlhf
Last synced: 31 Jul 2024