https://github.com/tidedra/vl-rlhf
A RLHF Infrastructure for Vision-Language Models
https://github.com/tidedra/vl-rlhf
dpo llm lmm mllm rlhf vlm
Last synced: 4 months ago
JSON representation
A RLHF Infrastructure for Vision-Language Models
- Host: GitHub
- URL: https://github.com/tidedra/vl-rlhf
- Owner: TideDra
- License: apache-2.0
- Created: 2023-12-27T03:10:07.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-15T09:16:05.000Z (11 months ago)
- Last Synced: 2025-06-19T11:12:43.560Z (4 months ago)
- Topics: dpo, llm, lmm, mllm, rlhf, vlm
- Language: Python
- Homepage:
- Size: 3.8 MB
- Stars: 176
- Watchers: 4
- Forks: 7
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md