Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ssbuild/llm_rlhf
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
https://github.com/ssbuild/llm_rlhf
llm llm-rlhf lora reward rlhf trl trlx
Last synced: 4 days ago
JSON representation
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
- Host: GitHub
- URL: https://github.com/ssbuild/llm_rlhf
- Owner: ssbuild
- Created: 2023-04-19T10:46:13.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-09-19T09:03:03.000Z (about 1 year ago)
- Last Synced: 2024-04-28T04:59:06.854Z (7 months ago)
- Topics: llm, llm-rlhf, lora, reward, rlhf, trl, trlx
- Language: Python
- Homepage:
- Size: 388 KB
- Stars: 22
- Watchers: 2
- Forks: 1
- Open Issues: 3