https://github.com/dominodatalab/aihub_llama3_1_sft

AI Hub template to fine tune llama3.1-8b
https://github.com/dominodatalab/aihub_llama3_1_sft

Last synced: 4 months ago
JSON representation

AI Hub template to fine tune llama3.1-8b

Host: GitHub
URL: https://github.com/dominodatalab/aihub_llama3_1_sft
Owner: dominodatalab
License: apache-2.0
Created: 2024-07-30T23:26:29.000Z (12 months ago)
Default Branch: release-1.0.0
Last Pushed: 2024-07-31T16:56:46.000Z (12 months ago)
Last Synced: 2025-01-24T18:27:00.020Z (6 months ago)
Language: Jupyter Notebook
Size: 150 KB
Stars: 1
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

        # Fine tuning Llama3.1-8b using Huggingface SFT

## About this project

This project shows how to fine tune Llama3.1-8b using supervised fine tuning. We use the `SFTTrainer` that's available in the `trl` library from Huggingface. In order to fine tune the model, we will load it in 4bit or 8bit, train a LoRA adapter and merge it back to the base `llama3.1-8b` model.

The assets available in this project are:

*`llama3_sft.ipynb`* - This notebook includes all the code necessary to download the dataset, model files and to run the supervised fine tuning for *llama3.1-8b*.

*`model.py`* - A python file which is used to deploy the fine-tuned model as a Domino model API .

*`app.py`* - A Python file that loads the model, the LoRA adapter and allows for users to interact with the model as a Streamlit app .

*`app.sh`* - This script has launch instructions for the accompanying Streamlit app.

**Note : The first load of the app and model API takes time as it loads the fine tuned model into memory so please set the `Override request timeout` in the model API settings to an appropriate number**

## License

This template is licensed under Apache 2.0 and contains the following components: 

* mlflow [Apache License 2.0](https://github.com/mlflow/mlflow/blob/master/LICENSE.txt)

* accelerate [Apache License 2.0](https://github.com/huggingface/accelerate/blob/main/LICENSE)

* bitsandbytes [MIT License](https://github.com/TimDettmers/bitsandbytes/blob/main/LICENSE)

* datasets [Apache License 2.0](https://github.com/huggingface/datasets/blob/main/LICENSE)

* peft [Apache License 2.0](https://github.com/huggingface/peft/blob/main/LICENSE)

* streamlit [Apache License 2.0](https://github.com/streamlit/streamlit/blob/develop/LICENSE)

* transformers [Apache License 2.0](https://github.com/huggingface/transformers/blob/main/LICENSE)

* trl [Apache License 2.0](https://github.com/huggingface/trl/blob/main/LICENSE)

## Set up instructions

This project requires the following [compute environments](https://docs.dominodatalab.com/en/latest/user_guide/f51038/environments/) to be present. Please ensure the "Automatically make compatible with Domino" checkbox is selected while creating the environment.

### Environment Requirements (not needed if using the Domino AI Hub template)

**Environment Base**

***base image :*** `nvcr.io/nvidia/pytorch:23.10-py3`

***Dockerfile instructions***

```

RUN pip install "transformers>=4.43.2" "peft>=0.7.1,!=0.11.0" "trl>=0.7.9,<0.9.0" \

"bitsandbytes==0.43.2" "accelerate>=0.26.1" "streamlit==1.37.0" \

"mlflow==2.12.0" "uWSGI==2.0.26" \

"Flask==3.0.3" "Flask-Compress==1.15" "Flask-Cors==4.0.1" "jsonify==0.5"

RUN pip uninstall --yes transformer-engine

```

***Pluggable Workspace Tools** 

```

jupyterlab:

  title: "JupyterLab"

  iconUrl: "/assets/images/workspace-logos/jupyterlab.svg"

  start: ["/opt/domino/bin/jupyterlab-start.sh"]

  httpProxy:

    internalPath: "/{{ownerUsername}}/{{projectName}}/{{sessionPathComponent}}/{{runId}}/{{#if pathToOpen}}tree/{{pathToOpen}}{{/if}}"

    port: 8888

    rewrite: false

    requireSubdomain: false

vscode:

 title: "vscode"

 iconUrl: "/assets/images/workspace-logos/vscode.svg"

 start: [ "/opt/domino/bin/vscode-start.sh" ]

 httpProxy:

    port: 8888

    requireSubdomain: false

```

Please change the value in `start` according to your Domino version. This repo has been tested with Domino 5.10.0 and 5.11.0 .

### Hardware Requirements

This project will run on any Nvidia GPU with >=24GB of VRAM. Also ensure that the `Workspace and Jobs Volume Size` setting of the workspace is set to 100GB as the model files and datasets can get large.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dominodatalab/aihub_llama3_1_sft

Awesome Lists containing this project

README