An open API service indexing awesome lists of open source software.

https://github.com/bytedance-seed/deepflow

[ICCV 2025] Deeply Supervised Flow-Based Generative Models
https://github.com/bytedance-seed/deepflow

Last synced: 3 months ago
JSON representation

[ICCV 2025] Deeply Supervised Flow-Based Generative Models

Awesome Lists containing this project

README

          


👋 Hi, everyone!


We are ByteDance Seed team.


You can get to know us better through the following channels👇










![seed logo](https://github.com/user-attachments/assets/c42e675e-497c-4508-8bb9-093ad4d1f216)

# Deeply Supervised Flow-Based Generative Models

### [ICCV 2025]








Inkyu Shin·
Chenglin Yang·
Liang-Chieh Chen

DeepFlow is a novel flow-based generation framework that enhances velocity representation through inter layer communication using deep supervision and acceleration mechanism.

DeepFlow converges 8 times faster on ImageNet with equivalent performance and further reduces FID by 2.6 while halving training time compared to previous flow based models without a classifier free guidance.

# News
[2025/06/25]🎉DeepFlow is accepted to ICCV 2025.

[2025/05/07]🔥We release training code, sampling code, and checkpoints of DeepFlow.

# Introduction
DeepFlow incorporates deep supervision by evenly adding velocity prediction within transformer blocks, further enhanced by the
proposed Velocity Alignment block (VeRA).


teaser

# Getting started
## Installation
```shell
pip install -r requirements.txt
```

## Dataset
We conducted our experiments using [ImageNet](https://www.kaggle.com/competitions/imagenet-object-localization-challenge/data).
Kindly refer to preprocessing guide from [REPA](https://github.com/sihyun-yu/REPA/tree/main/preprocessing) to preprocess the dataset.

## Training
For training DeepFlow-XL/2-3T, use below training script.
```shell
bash script/xlarge/train.sh
```
where you can set following options:
- `--tg-upper-bound`: time-gap between adjacent branches.
- `--weighting`: time-step sampling during training.
- `--df-idxs`: key transformer layers where deep supervision is applied. (currently, equally-splitted).
- `--ssl-align`: whether to use SSL align (need to set up `--enc-type` as well).
- `--legacy-scaling`: whether to use soft-cap for scaling factor in "velocity_modulation" (enable this when ssl-align is enabled for reproducibility).

## Generation
For generative samples from DeepFlow-XL/2-3T, use below evaluation script.
```shell
bash script/xlarge/eval.sh
```
where you need to set specific path for checkpoint that will be evaluated.

Please follow [ADM evaluation](https://github.com/openai/guided-diffusion/tree/main/evaluations) for obtaining FID score.

## Model Zoo

We release checkpoint of DeepFlow trained on ImageNet for your reference.

| Dataset | Model | SSL align | Training Epochs | Link | FID (wo/ CFG) | FID (w/ CFG) |
| ------------- | ------------- | ------------- | ------------- | ------------- | ------------- | ------------- |
| ImageNet-256px | DeepFlow-XL/2-3T | X | 400 | [checkpoint](https://drive.google.com/file/d/1QXTDkeVRgfvBvyLkJcu-YJ_2pDxM73Ec/view?usp=sharing) | 7.2 | 1.97 |
| ImageNet-256px | DeepFlow-XL/2-3T | O | 400 | [checkpoint](https://drive.google.com/file/d/1ALMz0Cdm1XSZYWTPSVEVR-sfYXsKk6Zl/view?usp=sharing) | 5.0 | 1.77 |

# License
The code and model weights are licensed under CC-BY-NC. See LICENSE.txt for details.

# Citation
If you find DeepFlow useful for your research and applications, feel free to give us a star ⭐ or cite us using:

```bibtex
@article{shin2025deeply,
title={Deeply Supervised Flow-Based Generative Models},
author={Shin, Inkyu and Yang, Chenglin and Chen, Liang-Chieh},
journal={arXiv preprint arXiv:2503.14494},
year={2025}
}
```

## Acknowledgement

[DiT](https://github.com/facebookresearch/DiT)

[SiT](https://github.com/willisma/SiT)

[REPA](https://github.com/sihyun-yu/REPA)

# About [ByteDance Seed Team](https://team.doubao.com/)

Founded in 2023, ByteDance Seed Team is dedicated to crafting the industry's most advanced AI foundation models. The team aspires to become a world-class research team and make significant contributions to the advancement of science and society.