https://github.com/gigio1023/alpaca-lora-for-huggingface

Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel
https://github.com/gigio1023/alpaca-lora-for-huggingface

Last synced: 2 months ago
JSON representation

Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel

Host: GitHub
URL: https://github.com/gigio1023/alpaca-lora-for-huggingface
Owner: gigio1023
License: apache-2.0
Created: 2023-03-15T04:00:16.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-04-03T09:06:05.000Z (about 2 years ago)
Last Synced: 2025-03-12T17:43:40.426Z (3 months ago)
Language: Jupyter Notebook
Homepage:
Size: 22.6 MB
Stars: 24
Watchers: 2
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# 😊 Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel 😊

## Update Logs
!!! There's some error when you update torch to 2.0. I'll fix it as soon as possible !!!
## Features
- Multi-GPU training using DeepSpeed and Fully sharded Data Parallel with Accelerate
- Training LLaMA using huggingface, lora, peft
- Using clm training examples from huggingface example
- https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling
- You can use alpaca_data.hf which is converted for using Huggingface Datasets
- Split train and validation for clm training

## Dependency
```sh
pip install -r requirements.txt
```

## Train
```sh
# Use PEFT, LORA
accelerate launch --config_file peft_config.yaml finetune.py
```

or you can use Huggingface Arguments for controll all situations during training. All the HFArguments can be used.
```sh
# You can use train.sh.
# Stil updating...
python train.py \
--model_name_or_path decapoda-research/llama-7b-hf \
--dataset_name alpaca_data.hf \
--is_dataset_from_disk True \
--per_device_train_batch_size 8 \
--per_device_eval_batch_size 8 \
--do_train \
--do_eval \
--output_dir test-clm
```
The codes are still updated, so maybe there're can be some unexpected error.
I used base code from https://github.com/tloen/alpaca-lora. Thanks a lot!!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gigio1023/alpaca-lora-for-huggingface

Awesome Lists containing this project

README