An open API service indexing awesome lists of open source software.

https://github.com/p1atdev/vision-ft


https://github.com/p1atdev/vision-ft

Last synced: about 1 year ago
JSON representation

Awesome Lists containing this project

README

          

# Vision model finetuning scripts

WIP

## Features

- NF4 model loading
- QLoRA with bitsandbytes
- [Flash Attention](https://github.com/Dao-AILab/flash-attention) support
- Aspect ratio bucketing

### Models

- [x] SDXL
- [x] AuraFlow
- [ ] Flux & Flex (WIP)
- [ ] CogView4 (WIP)
- [ ] FractalGen (TODO)
- [ ] Wan 2.1 (TODO)

## Setup

```bash
uv sync --extra build
uv sync --all-extras
```

## References

- https://github.com/kohya-ss/sd-scripts
- Heavily inspired by this repository.

- https://github.com/cloneofsimo/minRF
- https://github.com/huggingface/diffusers
- https://github.com/Stability-AI/sd3-ref
- https://github.com/black-forest-labs/flux
- Model implementation

- https://github.com/microsoft/LoRA
- https://github.com/huggingface/peft
- Peft logic and implementation

- https://github.com/huggingface/diffusers
- https://github.com/Lightning-AI/pytorch-lightning
- https://github.com/ostris/ai-toolkit
- Traning cycle and API design

- https://github.com/NovelAI/novelai-aspect-ratio-bucketing
- Aspect ratio bucketing

- https://github.com/bitsandbytes-foundation/bitsandbytes
- Quantization