An open API service indexing awesome lists of open source software.
Awesome Lists | Featured Topics | Projects
A curated list of projects in awesome lists tagged with llm-rlhf .
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
llm llm-rlhf lora reward rlhf trl trlx
Last synced: 08 Nov 2024