https://github.com/ayulockin/T5-RLHF-TF
Implementation of Reinforcement Learning from Human Feedback for Summarization Task in TensorFlow
https://github.com/ayulockin/T5-RLHF-TF
Last synced: 23 days ago
JSON representation
Implementation of Reinforcement Learning from Human Feedback for Summarization Task in TensorFlow
- Host: GitHub
- URL: https://github.com/ayulockin/T5-RLHF-TF
- Owner: ayulockin
- License: mit
- Created: 2022-12-15T11:24:03.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-01-15T15:17:25.000Z (over 2 years ago)
- Last Synced: 2025-03-17T17:56:34.528Z (about 1 month ago)
- Language: Jupyter Notebook
- Size: 199 KB
- Stars: 4
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-human-in-the-loop - Github - ayulockin/T5-RLHF-TF
README
# RLHF-TF
Implementation of Reinforcement Learning from Human Feedback for Summarization Task in TensorFlow