Projects in Awesome Lists tagged with direct-preference-optimization

A curated list of projects in awesome lists tagged with direct-preference-optimization .

- Recently synced
- Stars

https://github.com/codelion/pts

Pivotal Token Search

dataset-generation direct-preference-optimization dpo llm llm-inference llm-steering mech-interp phi-4 phi-4-mini phi4 phi4-mini pivotal-token-search pivotal-tokens reasoning-agent reasoning-language-models reasoning-models sae sparse-autoencoder steering-vector tokens

Last synced: 10 Jun 2025

https://github.com/rasyosef/phi-2-sft-and-dpo

Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)

direct-preference-optimization huggingface llm pytorch supervised-finetuning transformers trl

Last synced: 14 May 2026

https://github.com/rasyosef/phi-1_5-instruct

Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)

direct-preference-optimization llm pytorch supervised-finetuning transformers trl

Last synced: 05 Oct 2025

https://github.com/cluebbers/adverserial-paraphrasing

Evaluate how LLaMA 3.1 8B handles paraphrased adversarial prompts targeting refusal behavior.

deep-learning direct-preference-optimization redteam reinforcement-learning

Last synced: 04 Sep 2025

https://github.com/akhilpandey95/llmscisci

Experiments, and how-to guide for the lecture "Large language models for Scientometrics"

direct-preference-optimization finetuning-llms in-context-learning llms reproducibility scientometrics

Last synced: 26 Feb 2026

https://github.com/cluebbers/dpo-rlhf-paraphrase-types

Enhancing paraphrase-type generation using Direct Preference Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF), with large-scale HPC support. This project aligns model outputs to human-ranked data for robust, safety-focused NLP.

alignment deep-learning direct-preference-optimization human-feedback paraphrase-generation paraphrase-type-generation reinforcement-learning transformers

Last synced: 29 Apr 2026

https://github.com/eliashornberg/epfllama

EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.

artificial-intelligence direct-preference-optimization large-language-models lora natural-language-processing pytorch supervised-finetuning

Last synced: 05 May 2026

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome