Projects in Awesome Lists by RLHFlow

A curated list of projects in awesome lists by RLHFlow .

- Recently synced
- Stars

https://github.com/RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

llama3 llm reward-models rlhf

Last synced: 07 May 2025

https://github.com/RLHFlow/Online-RLHF

A recipe for online RLHF and online iterative DPO.

llama3 llm rlhf

Last synced: 24 Feb 2025

https://github.com/rlhflow/online-rlhf

A recipe for online RLHF and online iterative DPO.

llama3 llm rlhf

Last synced: 08 Apr 2025

https://github.com/rlhflow/online-dpo-r1

Codebase for Iterative DPO Using Rule-based Rewards

Last synced: 19 Jun 2025

https://github.com/rlhflow/minimal-rl

Last synced: 19 Jun 2025

https://github.com/rlhflow/rlhf-reward-modeling

A recipe to train reward models for RLHF.

llm reward-functions rlhf

Last synced: 21 Aug 2025

https://github.com/rlhflow/directional-preference-alignment

Directional Preference Alignment

ai-alignment large-language-models rlhf

Last synced: 09 Mar 2026

https://github.com/RLHFlow/Directional-Preference-Alignment

Directional Preference Alignment

ai-alignment large-language-models rlhf

Last synced: 24 Feb 2025

https://github.com/rlhflow/gvm

Last synced: 30 Jul 2025

https://github.com/rlhflow/self-rewarding-reasoning-llm

Recipes to train the self-rewarding reasoning LLMs.

Last synced: 04 Oct 2025

https://github.com/rlhflow/reinforce-ada

An adaptive sampling framework for Reinforce-style LLM post training.

Last synced: 11 Oct 2025

https://github.com/rlhflow/rlhflow.github.io

Webpage for RLHFlow

Last synced: 02 Feb 2026

https://github.com/rlhflow/.github

Last synced: 28 Jan 2026

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome