https://github.com/niutrans/vision-llm-alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
https://github.com/niutrans/vision-llm-alignment

alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision

Last synced: over 1 year ago
JSON representation

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Host: GitHub
URL: https://github.com/niutrans/vision-llm-alignment
Owner: NiuTrans
Created: 2024-06-29T14:53:25.000Z (about 2 years ago)
Default Branch: master
Last Pushed: 2024-10-16T10:25:30.000Z (over 1 year ago)
Last Synced: 2025-03-30T06:08:44.099Z (over 1 year ago)
Topics: alignment, dpo, llama3-vision, llava, llm, mllm, multi-model, ppo, reward, rlhf, sft, vision
Language: Python
Homepage:
Size: 153 MB
Stars: 104
Watchers: 3
Forks: 8
Open Issues: 1
Metadata Files:
- Readme: README.md

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/niutrans/vision-llm-alignment

Awesome Lists containing this project