https://github.com/tyshiwo1/Accelerating-T2I-AR-with-SJD

[ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
https://github.com/tyshiwo1/Accelerating-T2I-AR-with-SJD

Last synced: about 1 month ago
JSON representation

[ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

Host: GitHub
URL: https://github.com/tyshiwo1/Accelerating-T2I-AR-with-SJD
Owner: tyshiwo1
Created: 2024-10-07T14:48:45.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2025-04-21T04:13:09.000Z (7 months ago)
Last Synced: 2025-04-21T05:27:45.872Z (7 months ago)
Language: Python
Homepage: https://arxiv.org/abs/2410.01699
Size: 1.17 MB
Stars: 35
Watchers: 2
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

          # SJD: Accelerating Auto-regressive Text-to-Image Generation with Training-free 
Speculative Jacobi Decoding

[Yao Teng](https://tyshiwo1.github.io/)¹, [Han Shi](https://han-shi.github.io/)², [Xian Liu](https://alvinliu0.github.io/)³, [Xuefei Ning](https://nics-effalg.com/ningxuefei/)⁴, [Guohao Dai](https://dai.sjtu.edu.cn/)^5,6, [Yu Wang](https://scholar.google.com.hk/citations?user=j8JGVvoAAAAJ)⁴, [Zhenguo Li](https://zhenguol.github.io/)², and [Xihui Liu](https://xh-liu.github.io/)¹.

*¹The University of Hong Kong, ²Huawei Noah’s Ark Lab, ³CUHK, ⁴Tsinghua University, ⁵Shanghai Jiao Tong University, ⁶Infinigence AI*

  

## 🚩 New Features/Updates

- ✅ Apr, 2025. 💥 **SJD** has been integrated into [Lumina-mGPT2](https://github.com/Alpha-VLLM/Lumina-mGPT-2.0) and [SimpleAR](https://github.com/wdrink/SimpleAR).

- ✅ Jan, 2025. 💥 **SJD** is accepted to ICLR 2025.

- ✅ Oct, 2024. Release **SJD**'s code.

## 🚩 TODO List

- □ Integrating SJD into vLLM framework for further acceleration.

## Installing the dependencies

##### Environment: 

- Python 3.10

- CUDA 12.5

- Pytorch 2.5.1+cu124

- Transformers 4.47.1

##### Install from `yaml`:

```bash

conda env create -f environment.yaml

```

## Performance

- Results on [Lumina-mGPT](https://github.com/Alpha-VLLM/Lumina-mGPT) 

  

- Results on [Emu3](https://github.com/baaivision/Emu3) 

  

## Text-to-Image with SJD

#### Lumina-mGPT

```bash

CUDA_VISIBLE_DEVICES=0 python test_lumina_mgpt.py

```

#### Emu3

```bash

CUDA_VISIBLE_DEVICES=0 python test_emu3.py

```

#### LlamaGen

```bash

CUDA_VISIBLE_DEVICES=0 python test_llamagen.py

```

## Acknowledge

Our code is based on [Lumina-mGPT](https://github.com/Alpha-VLLM/Lumina-mGPT), [Emu3](https://github.com/Alpha-VLLM/Lumina-mGPT), [LlamaGen](https://github.com/FoundationVision/LlamaGen), [Anole](https://github.com/GAIR-NLP/anole), and [CLLM](https://github.com/hao-ai-lab/Consistency_LLM). We would like to express our gratitude to [Tianwei Xiong](https://github.com/SilentView) for his assistance.

## Citation

```bibtex

@article{teng2024accelerating,

  title={Accelerating auto-regressive text-to-image generation with training-free speculative jacobi decoding},

  author={Teng, Yao and Shi, Han and Liu, Xian and Ning, Xuefei and Dai, Guohao and Wang, Yu and Li, Zhenguo and Liu, Xihui},

  journal={arXiv preprint arXiv:2410.01699},

  year={2024}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tyshiwo1/Accelerating-T2I-AR-with-SJD

Awesome Lists containing this project

README