https://github.com/niutrans/vision-llm-alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
https://github.com/niutrans/vision-llm-alignment
alignment dpo llama3-vision llava llm mllm multi-model ppo reward rlhf sft vision
Last synced: 2 months ago
JSON representation
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
- Host: GitHub
- URL: https://github.com/niutrans/vision-llm-alignment
- Owner: NiuTrans
- Created: 2024-06-29T14:53:25.000Z (12 months ago)
- Default Branch: master
- Last Pushed: 2024-10-16T10:25:30.000Z (8 months ago)
- Last Synced: 2025-03-30T06:08:44.099Z (3 months ago)
- Topics: alignment, dpo, llama3-vision, llava, llm, mllm, multi-model, ppo, reward, rlhf, sft, vision
- Language: Python
- Homepage:
- Size: 153 MB
- Stars: 104
- Watchers: 3
- Forks: 8
- Open Issues: 1
-
Metadata Files:
- Readme: README.md