An open API service indexing awesome lists of open source software.

https://github.com/vincentmin/transformer_rlhf_eli5

We train a transformer model using Reinforcement Learning Human Feedback on the Reddit ELI5 dataset
https://github.com/vincentmin/transformer_rlhf_eli5

Last synced: 9 months ago
JSON representation

We train a transformer model using Reinforcement Learning Human Feedback on the Reddit ELI5 dataset

Awesome Lists containing this project

README

          

# transformer_rlhf_eli5
We train a transformer model using Reinforcement Learning Human Feedback on the Reddit ELI5 dataset