Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cosmic-heart/llm-sft-dpo-peft
Mistral 7b - SFT on Alpaca + PEFT + DPO on HH-RLHF.
https://github.com/cosmic-heart/llm-sft-dpo-peft
Last synced: 17 days ago
JSON representation
Mistral 7b - SFT on Alpaca + PEFT + DPO on HH-RLHF.
- Host: GitHub
- URL: https://github.com/cosmic-heart/llm-sft-dpo-peft
- Owner: cosmic-heart
- Created: 2023-10-05T19:08:06.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-26T09:42:33.000Z (over 1 year ago)
- Last Synced: 2025-01-19T11:30:03.214Z (22 days ago)
- Language: Python
- Homepage:
- Size: 16.6 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## LLM + PEFT + RLHF
## Build on:
- Pytorch## Dataset:
- SFT:
Anthropic/hh-rlhf- RLHF & DPO:
Stanford/alpaca