https://github.com/sarabesh/finetuning
Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.
https://github.com/sarabesh/finetuning
evaluation finetune-llms finetuning huggingface rlhf sft
Last synced: 10 months ago
JSON representation
Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.
- Host: GitHub
- URL: https://github.com/sarabesh/finetuning
- Owner: sarabesh
- Created: 2025-08-03T07:46:41.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2025-08-03T07:55:56.000Z (10 months ago)
- Last Synced: 2025-08-03T09:24:20.820Z (10 months ago)
- Topics: evaluation, finetune-llms, finetuning, huggingface, rlhf, sft
- Language: Python
- Homepage:
- Size: 1000 Bytes
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md