Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
https://github.com/voidful/TextRL
chatgpt controlled-nlg gpt-2 gpt-3 language-model nlg nlp pytorch reinforcement-learning rlhf
Last synced: about 2 months ago
JSON representation
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
- Host: GitHub
- URL: https://github.com/voidful/TextRL
- Owner: voidful
- License: mit
- Created: 2021-03-18T09:11:36.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-05-09T17:37:44.000Z (4 months ago)
- Last Synced: 2024-07-28T20:59:35.561Z (about 2 months ago)
- Topics: chatgpt, controlled-nlg, gpt-2, gpt-3, language-model, nlg, nlp, pytorch, reinforcement-learning, rlhf
- Language: Python
- Homepage:
- Size: 400 KB
- Stars: 534
- Watchers: 11
- Forks: 64
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-ChatGPT - TextRL - Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) (Developer Libraries, SDKs, and APIs / Python)
- awesome-human-in-the-loop - Github - voidful/TextRL - 176B/bloom/gpt/bart/T5/MetaICL) (Awesome RHLF / Tools and Resources)
- awesome-chatgpt - TextRL - Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) (Developer Libraries, SDKs, and APIs / Python)
- awesome-chatgpt - voidful/TextRL - TextRL is a Python library that utilizes reinforcement learning to improve text generation using huggingface's transformer models. (SDK, Libraries, Frameworks / Python)