An open API service indexing awesome lists of open source software.

https://github.com/nakjun/llama-3.2-1b-instruct-korquad-finetune

Llama-3.2-1B-Instruct model and KorQuAD dataset Finetune project
https://github.com/nakjun/llama-3.2-1b-instruct-korquad-finetune

fine-tuning llama sllm

Last synced: 4 months ago
JSON representation

Llama-3.2-1B-Instruct model and KorQuAD dataset Finetune project

Awesome Lists containing this project

README

        

# Llama-3.2-1B-Instruct-korQuAD-v1

๐Ÿค— [**Hugging Face**](https://huggingface.co/NakJun/Llama-3.2-1B-Instruct-korQuAD-v1) - **21.9K Downloads**

**Llama-3.2-1B-Instruct๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ๊ตญ์–ด ์งˆ์˜์‘๋‹ต ํƒœ์Šคํฌ๋ฅผ fine-tuning, inference, evaluation ํ•˜๋Š” ํ”„๋กœ์ ํŠธ**

## basic
- ๊ธฐ๋ณธ ๋ชจ๋ธ: Llama-3.2-1B-Instruct
- ํ•™์Šต ๋ฐ์ดํ„ฐ์…‹: KorQuAD v1.0
- ํ•™์Šต ๋ฐฉ๋ฒ•: LoRA (Low-Rank Adaptation)
- ์ฃผ์š” ํƒœ์Šคํฌ: ํ•œ๊ตญ์–ด ์งˆ์˜์‘๋‹ต

## history
### v1.0.0(2024-10-02)
- ์ดˆ๊ธฐ ๋ฒ„์ „ ์—…๋กœ๋“œ
- KorQuAD v1.0 ๋ฐ์ดํ„ฐ์…‹ ํŒŒ์ธํŠœ๋‹

### v1.1.0(2024-10-30)
- ๋ชจ๋ธ ํ”„๋กฌํ”„ํŠธ ๋ฐ ํ•™์Šต ๋ฐฉ๋ฒ• ๊ฐœ์„ 
- KorQuAD evaluate ์ฝ”๋“œ ์ ์šฉ

## evaluation
| ๋ชจ๋ธ | Exact Match | F1 Score |
|------|-------------|----------|
| Llama-3.2-1B-Instruct-v1 | 18.86 | 37.2 |
| Llama-3.2-1B-Instruct-v2 | 36.07 | 59.03 |
โ€ป https://korquad.github.io/category/1.0_KOR.html์˜ evaluation script ์‚ฌ์šฉ

## code description
```
1. fine_tuning.py
- ๋ชจ๋ธ ํŒŒ์ธํŠœ๋‹ ์ฝ”๋“œ
- training_params ๋ณ€๊ฒฝํ•˜์—ฌ ํ•™์Šต ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜์ • ๊ฐ€๋Šฅ

2. inference.py
- ๋ชจ๋ธ ์ธํผ๋Ÿฐ์Šค ์ฝ”๋“œ
- ๋ชจ๋ธ ๊ฒฝ๋กœ ๋ณ€๊ฒฝ ํ›„ ์‚ฌ์šฉ ๊ฐ€๋Šฅ

3. evaluation.py
- ๋ชจ๋ธ ์ด๋ฐธ๋ฅ˜์—์ด์…˜ ์ฝ”๋“œ
- KorQuAD evaluate๋ฅผ ์œ„ํ•œ json ์ €์žฅ

4. evaluate-v1.0.py
- KorQuAD evaluate ์ฝ”๋“œ ์ ์šฉ
- ํ‰๊ฐ€ ์ง€ํ‘œ ์ถœ๋ ฅ(Exact Match, F1 Score)
$python evaluate-v1.0.py [dataset_file] [prediction_file]
```

## learning parameters
- step: 2000
- ๋ฐฐ์น˜ ํฌ๊ธฐ: 1
- ํ•™์Šต๋ฅ : 2e-4
- ์˜ตํ‹ฐ๋งˆ์ด์ €: AdamW (32-bit)
- LoRA ์„ค์ •:
- r: 16
- lora_alpha: 16
- ๋Œ€์ƒ ๋ชจ๋“ˆ: ["q_proj", "v_proj", "k_proj", "o_proj", "gate_proj", "down_proj", "up_proj"]
- lora_dropout: 0.01

## Contact
- [email protected]
- https://github.com/nakjun
- https://huggingface.co/NakJun/Llama-3.2-1B-Instruct-korQuAD-v1