Projects in Awesome Lists tagged with direct-preference-optimization
A curated list of projects in awesome lists tagged with direct-preference-optimization .
https://github.com/codelion/pts
Pivotal Token Search
dataset-generation direct-preference-optimization dpo llm llm-inference llm-steering mech-interp phi-4 phi-4-mini phi4 phi4-mini pivotal-token-search pivotal-tokens reasoning-agent reasoning-language-models reasoning-models sae sparse-autoencoder steering-vector tokens
Last synced: 10 Jun 2025
https://github.com/rasyosef/phi-2-sft-and-dpo
Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
direct-preference-optimization huggingface llm pytorch supervised-finetuning transformers trl
Last synced: 05 Oct 2025
https://github.com/rasyosef/phi-1_5-instruct
Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
direct-preference-optimization llm pytorch supervised-finetuning transformers trl
Last synced: 05 Oct 2025
https://github.com/cluebbers/adverserial-paraphrasing
Evaluate how LLaMA 3.1 8B handles paraphrased adversarial prompts targeting refusal behavior.
deep-learning direct-preference-optimization redteam reinforcement-learning
Last synced: 04 Sep 2025
https://github.com/akhilpandey95/llmscisci
Experiments, and how-to guide for the lecture "Large language models for Scientometrics"
direct-preference-optimization finetuning-llms in-context-learning llms reproducibility scientometrics
Last synced: 26 Feb 2026
https://github.com/cluebbers/dpo-rlhf-paraphrase-types
Enhancing paraphrase-type generation using Direct Preference Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF), with large-scale HPC support. This project aligns model outputs to human-ranked data for robust, safety-focused NLP.
alignment deep-learning direct-preference-optimization human-feedback paraphrase-generation paraphrase-type-generation reinforcement-learning transformers
Last synced: 02 Jul 2025
https://github.com/eliashornberg/epfllama
EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.
artificial-intelligence direct-preference-optimization large-language-models lora natural-language-processing pytorch supervised-finetuning
Last synced: 29 Jan 2026