Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with direct-preference-optimization
A curated list of projects in awesome lists tagged with direct-preference-optimization .
https://github.com/eliashornberg/epfllama
EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.
artificial-intelligence direct-preference-optimization large-language-models lora natural-language-processing pytorch supervised-finetuning
Last synced: 30 Dec 2024