https://github.com/shakil1819/qwen2.5-3b-grpo-finetuned-lora-rag-pipeline
https://github.com/shakil1819/qwen2.5-3b-grpo-finetuned-lora-rag-pipeline
dataset finetuning-llms gpu llm qwen2-5 rag unsloth vllm
Last synced: 7 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/shakil1819/qwen2.5-3b-grpo-finetuned-lora-rag-pipeline
- Owner: shakil1819
- Created: 2025-05-11T10:14:17.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2025-05-11T20:32:05.000Z (6 months ago)
- Last Synced: 2025-06-08T19:37:16.508Z (5 months ago)
- Topics: dataset, finetuning-llms, gpu, llm, qwen2-5, rag, unsloth, vllm
- Language: Jupyter Notebook
- Homepage:
- Size: 13.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Qwen2.5-3B-GRPO-Finetuned-LoRA-RAG-Pipeline