https://github.com/SobeyMIL/TVG

code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"
https://github.com/SobeyMIL/TVG

diffusion-models transition-video-generation

Last synced: 7 months ago
JSON representation

code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"

Host: GitHub
URL: https://github.com/SobeyMIL/TVG
Owner: SobeyMIL
License: apache-2.0
Created: 2024-08-19T04:03:31.000Z (11 months ago)
Default Branch: main
Last Pushed: 2024-08-19T13:54:34.000Z (11 months ago)
Last Synced: 2024-08-31T20:03:18.731Z (11 months ago)
Topics: diffusion-models, transition-video-generation
Language: Python
Homepage: https://sobeymil.github.io/tvg.com/
Size: 46.4 MB
Stars: 21
Watchers: 1
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

        ## ___***TVG: A Training-free Transition Video Generation Method with Diffusion Models***___

## Introduction

TVG is a training-free transition video generation method. It can enhance the video generation performance of models under the condition of generating videos given the initial and final frames.

![Framework](assets/wave.png)

Transition videos play a crucial role in media production, enhancing the flow and coherence of visual narratives. Traditional methods like morphing often lack artistic appeal and require specialized skills, limiting their effectiveness. Recent advances in diffusion model-based video generation offer new possibilities for creating transitions but face challenges such as poor inter-frame relationship modeling and abrupt content changes. We propose a novel training-free Transition Video Generation (TVG) approach using video-level diffusion models that addresses these limitations without additional training. Our method leverages Gaussian Process Regression (GPR) to model latent representations, ensuring smooth and dynamic transitions between frames. Additionally, we introduce interpolation-based conditional controls and a Frequency-aware Bidirectional Fusion (FBiF) architecture to enhance temporal control and transition reliability. Evaluations of benchmark datasets and custom image pairs demonstrate the effectiveness of our approach in generating high-quality smooth transition videos.

## ⚙️ Setup

### Install Environment via Anaconda (Recommended)

```bash

conda create -n TVG python=3.8.5

conda activate TVG

pip install -r requirements.txt

```

### Preparation

Please follow the instructions on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter) to download the [DynamiCrafter512_interp](https://huggingface.co/Doubiiu/DynamiCrafter_512_Interp/blob/main/model.ckpt) model and place it in the `checkpoints/dynamicrafter_512_interp_v1/model.ckpt` path.

### Dataset

We used the [MorphBench](https://github.com/Kevin-thu/DiffMorpher) and the [TC-Bench-I2V](https://github.com/weixi-feng/TC-Bench/). Please download the MorphBench dataset and place it in `EvalData/MorphBench/Animation` and `EvalData/MorphBench/Metamorphosis`. For the TC-Bench-I2V dataset, please follow the official tutorial and save the final frames to the `EvalData/TC-Bench/youtube_videos_frames` directory. You can refer to the corresponding files in the Prompts for specific paths. The images we collected ourselves are already in the `EvalData` directory.

## 💫 Inference

To reproduce the experiments from the paper, please run

```bash

bash paper_exp.sh

```

in the command line.

## Results

|  |  |  |

|----------|----------|----------|

| ![GIF 1](assets/Ours/00011-00000.gif) | ![GIF 2](assets/Ours/00020-00000.gif) | ![GIF 3](assets/Ours/00041-00000.gif) |

| ![GIF 4](assets/Ours/a_02.gif) | ![GIF 5](assets/Ours/a_03.gif) | ![GIF 6](assets/Ours/sun_set_0_sample0_2024-08-12-14-46-23.gif) |

| ![GIF 7](assets/Ours/arya_0_sample0_2024-08-05-11-39-47.gif) | ![GIF 8](assets/Ours/leo_0_sample0_2024-08-02-18-37-54.gif) | ![GIF 9](assets/Ours/leo_cat_0_sample0_2024-08-12-15-30-11.gif) |

| ![GIF 10](assets/Ours/man_spiderman_0_sample0_2024-08-12-15-28-54.gif) | ![GIF 11](assets/Ours/wave_0_sample0_2024-08-02-19-49-05.gif) | ![GIF 12](assets/Ours/whitehouse_church_0_sample0_2024-08-05-11-46-14.gif) |

| ![GIF 13](assets/Ours/wukong_0_sample0_2024-08-12-15-31-28.gif) | ![GIF 14](assets/Ours/clouds_fields_0_sample0_2024-08-12-15-27-37.gif)         |  ![GIF 15](assets/Ours/anakin_0_sample0_2024-08-05-11-10-40.gif)        |

## 🖊️ Citation

Please kindly cite our paper if you use our code, data, models or results:

```bibtex

@inproceedings{zhang2024tvg,

        title = {TVG: A Training-free Transition Video Generation Method with Diffusion Models},

        author = {Rui Zhang and Chen Yaosen and Yuegen Liu and  Wei Wang and Xuming Wen and  Hongxia Wang},

        year = {2024},

        booktitle = {arxiv}

}

```

## 💞 Acknowledgements

Thanks for the work of [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter). Our code is based on the implementation of them.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/SobeyMIL/TVG

Awesome Lists containing this project

README