https://github.com/dongzhuoyao/flowseq

An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"
https://github.com/dongzhuoyao/flowseq

diffusion-model flow flow-matching

Last synced: 10 months ago
JSON representation

An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"

Host: GitHub
URL: https://github.com/dongzhuoyao/flowseq
Owner: dongzhuoyao
License: mit
Created: 2023-06-25T10:51:23.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-04-05T09:35:06.000Z (almost 2 years ago)
Last Synced: 2024-04-05T10:35:51.083Z (almost 2 years ago)
Topics: diffusion-model, flow, flow-matching
Language: Python
Homepage: https://taohu.me/project_flowseq
Size: 1.05 MB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          #  Flow Matching for Conditional Text Generation in a Few Sampling Steps ( EACL 2024 )

 

Vincent Tao Hu,



Di Wu,



  Yuki M. Asano,





  Pascal Mettes,





  Basura Fernando,





   Bjorn Ommer, 





  Cees G.M. Snoek



This repository represents the official implementation of the EACL2024 paper titled "Flow Matching for Conditional Text Generation in a Few Sampling Steps".

[![Hugging Face Model](https://img.shields.io/badge/🤗%20Hugging%20Face-Model-green)](https://huggingface.co/taohu/flowsea)

[![Website](doc/badges/badge-website.svg)](https://taohu.me/project_flowseq)

[![Paper](https://img.shields.io/badge/arXiv-PDF-b31b1b)](https://aclanthology.org/2024.eacl-short.33.pdf)

[![GitHub](https://img.shields.io/github/stars/dongzhuoyao/flowseq?style=social)](https://github.com/dongzhuoyao/flowseq)

[![License](https://img.shields.io/badge/License-Apache--2.0-929292)](https://www.apache.org/licenses/LICENSE-2.0)

![landscape](doc/method_eacl24.png)

# Dataset

```

https://drive.google.com/drive/folders/1sU8CcOJE_aaaKLijBNzr-4y1i1YoZet2?usp=drive_link

```

## Run

```bash

CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nnodes=1 --nproc-per-node=4 flow_train.py 

CUDA_VISIBLE_DEVICES=0,2 torchrun --nnodes=1 --nproc-per-node=2 flow_train.py  data=qg

CUDA_VISIBLE_DEVICES=6 torchrun --nnodes=1 --nproc-per-node=1 flow_train.py  data=qg

```

# Evaluation 

Download the pretrained checkpoint from [https://huggingface.co/taohu/flowseq/tree/main](https://huggingface.co/taohu/flowseq/tree/main), more checkpoints are coming soon.

```python

python flow_sample_eval_s2s.py    data=qqp_acc data.eval.is_debug=0 data.eval.model_path='qqp_ema_0.9999_070000.pt' data.eval.candidate_num=1 data.eval.ode_stepnum=1

```

# Environment Preparation

```bash

conda create -n flowseq  python=3.10

conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia

pip install  torchdiffeq  matplotlib h5py  accelerate loguru blobfile ml_collections

pip install hydra-core wandb einops scikit-learn --upgrade

pip install einops 

pip install transformers

pip install nltk bert_score datasets torchmetrics

```

Optional

```bash

pip install diffusers

```

# Common Issue

- **The inference result on non-single steps**

  

  Our work is main about the explore the single-step sampling. The anchor loss is encouraged to infer the original dataset by single step, the multiple step was implemented by using a zigzag manner following the Consistency Models, this codebase doesn't include that implementation yet. 

- **Batch size**

  If your GPU is not rich enough, try to decrease the batch size to 128~256, and stop using the accumulate gradients, this can somehow reach fair performance according my experience.

  

- **Typical Issue**

  

```bash 

https://github.com/Shark-NLP/DiffuSeq/issues/5

https://github.com/Shark-NLP/DiffuSeq/issues/22

```

## Citation

Please add the citation if our paper or code helps you.

```

@inproceedings{HuEACL2024,

        title = {Flow Matching for Conditional Text Generation in a Few Sampling Steps},

        author = {Vincent Tao Hu and Di Wu and Yuki M Asano and Pascal Mettes and Basura Fernando and Björn Ommer and Cees G M Snoek},

        year = {2024},

        date = {2024-03-27},

        booktitle = {EACL},

        tppubtype = {inproceedings}

        }

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dongzhuoyao/flowseq

Awesome Lists containing this project

README