Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/leehanchung/lora-instruct

Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
https://github.com/leehanchung/lora-instruct

agi falcon gpt llama llm lora mpt nlp redpajama

Last synced: 4 days ago
JSON representation

Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA

Host: GitHub
URL: https://github.com/leehanchung/lora-instruct
Owner: leehanchung
License: apache-2.0
Created: 2023-05-07T06:28:40.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-07-29T05:46:11.000Z (4 months ago)
Last Synced: 2024-10-29T07:34:27.812Z (16 days ago)
Topics: agi, falcon, gpt, llama, llm, lora, mpt, nlp, redpajama
Language: Python
Homepage:
Size: 20.7 MB
Stars: 101
Watchers: 3
Forks: 13
Open Issues: 12
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # :teacher:🤏 LoRA-Instruct

This repository contains code for fine-tuning permissive open source LLMs using [low-rank adaptation (LoRA)](https://arxiv.org/abs/2106.09685).

Code is tested using [Stanford Alpaca](https://crfm.stanford.edu/2023/03/13/alpaca.html) dataset.

- Estimated training time for fine-tuning RedPajama-INCITE-Base-7B-v0.1 with a single RTX 3090 and Stanford Alpaca is ~12 hours.

- Estimated training time for fine-tuning RedPajama-INCITE-Base-7B-v0.1 with RTX 3090 and RTX Titan and Stanford Alpaca is ~6.5 hours.

- Currently only supports LoRA Instruct fine-tuning [RedPajama-INCITE-Base-7B-v0.1](https://huggingface.co/togethercomputer/RedPajama-INCITE-Base-7B-v0.1).

Inspired by [Alpaca-LoRA](https://github.com/tloen/alpaca-lora)

## Trained Models

| Model | Runs | Training Time  | Link |

|:-------|:----:|:----:|:-----:|

| LLaMA 3B | :white_large_square: |  |  |

| LLaMA 7B | :white_large_square: |  |  |

| RedPajama 3B | :white_check_mark: | 1:44:14 | |

| RedPajama 7B | :white_check_mark: | 3:09:58 | |

| MPT 3B | :white_large_square: |  |  |

| MPT 7B | :white_large_square: |  |  |

| Falcon 7B | :white_check_mark: |  |  |

#### Training Hardware Spec

```

Ubuntu 20.04.1 LTS (WSL2)

Driver Version: 531.41

CUDA Version: 12.1

cuDNN version: 8.5.0

```

### Local Setup

Install dependencies

```bash

poetry install

```

To fine-tune using NVidia 2000 series GPU or earlier, please comment out this line in `finetune.py`

```python

model = prepare_model_for_int8_training(model)

```

### Training (`finetune.py`)

This file contains a straightforward application of PEFT / LoRA to decoder only model,

as well as some code related to prompt construction and tokenization.

Example usage:

```bash

python finetune.py \

    --base_model 'togethercomputer/RedPajama-INCITE-Base-7B-v0.1' \

    --output_dir './lora-redpajama'

```

#### Distributed Training with 🤗 Accelerate

We uses HuggingFace's `accelerate` library for distributed training. The following is an example for distributed training with two GPUs.

* NOTE: please set the following environment variables

```bash

export WORLD_SIZE=2

export CUDA_VISIBLE_DEVICES=0,1

```

```bash

torchrun \

    --nproc_per_node=2 \

    --master_port=1234 \

    finetune.py \

    --base_model 'togethercomputer/RedPajama-INCITE-Base-7B-v0.1' \

    --output_dir './lora-redpajama'

```

## References

- [LoRA: Low-Rank Adapation of Large Language Models](https://arxiv.org/abs/2106.09685)

- [State-of-the-art Parameter-Efficient Fine-Tuning (PEFT) methods](https://github.com/huggingface/peft)

- [Stanford Alpaca: An Instruction-following LLaMA Model](https://github.com/tatsu-lab/stanford_alpaca)

- [EMNLP 2022 Tutorial: Modular and Parameter-Efficient Fine-Tuning for NLP Models](https://www.youtube.com/watch?v=KoOlcX3XLd4)