Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Awesome-RLHF

Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
https://github.com/andy-yangz/Awesome-RLHF

Last synced: 2 days ago
JSON representation

Repos
- LM RLHF
  - Transformer Reinforcement Learning X (TRLX) - Learning*** (**ILQL**)
  - RL4LMs (A modular RL library to fine-tune language models to human preferences)
Datasets
- LM RLHF
  - HH-RLHF - rlhf)]：A Dataset created by Anthropic.
📜 Papers & Blog
Videos & Lectures
- LM RLHF
  - Learning Task Specifications for Reinforcement Learning from Human Feedback
  - Reinforcement Learning from Human Feedback: From Zero to chatGPT

Programming Languages

Categories

📜 Papers & Blog 18 Repos 2 Videos & Lectures 2 Datasets 1

Sub Categories

LM RLHF 18 Pre-LM RLHF 4 Survey 1

Keywords

reinforcement-learning 2 text-generation 1 table-to-text 1 summarization 1 nlp 1 natural-language-processing 1 machine-translation 1 language-modeling 1 dialogue-generation 1 pytorch 1 machine-learning 1