https://github.com/hpprc/prompteol

Simplified re-implementation of PromptEOL
https://github.com/hpprc/prompteol

Last synced: 9 months ago
JSON representation

Simplified re-implementation of PromptEOL

Host: GitHub
URL: https://github.com/hpprc/prompteol
Owner: hppRC
Created: 2023-11-01T02:50:43.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-11-02T12:31:03.000Z (about 2 years ago)
Last Synced: 2025-02-10T13:37:31.392Z (10 months ago)
Language: Python
Homepage:
Size: 15.6 KB
Stars: 3
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Unofficial Re-Implementeation of PromptEOL

Please refer [the exciting original paper](https://arxiv.org/abs/2307.16645) and [the original implementation](https://github.com/kongds/scaling_sentemb).

## Instllation

```
rye sync

# load .envrc
direnv allow
```

## Usage

We have three implementations.

- `src/qlora.py`: Fine-tuning with QLoRA (w/ 4 or 8 bit quantization)
- `src/lora.py`: Fine-tuning with LoRA (w/o quantization)
- `src/full.py`: Full-parameter fine-tuning

```
# Use 2 GPUs (cuda:0, cuda:1)
accelerate launch --config_file accelerate.json src/qlora.py

# Use 2 GPUs (cuda:2, cuda:3)
CUDA_VISIBLE_DEVICES=2,3 accelerate launch --config_file accelerate.json src/qlora.py

# Use 4 GPUs
accelerate launch --config_file accelerate.json --num_processes 4 src/qlora.py
```

One of the most significant differences from the original implementation is that we evaluate at regular steps during training, leaving only the best performing checkpoints to reduce storage space and effort.

Also, in our code, we omitted evaluation by SentEval for simplicity.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hpprc/prompteol

Awesome Lists containing this project

README