An open API service indexing awesome lists of open source software.

https://github.com/opendatalab/skydiffusion

The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
https://github.com/opendatalab/skydiffusion

Last synced: 9 months ago
JSON representation

The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”

Awesome Lists containing this project

README

          



Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm


arXiv
Project
Dataset
Python
Python



[Junyan Ye](https://yejy53.github.io/), Jun He, [Weijia Li](https://liweijia.github.io/), Zhutao Lv, Yi Lin, Jinhua Yu, Haote Yang, [Conghui He](https://conghui.github.io/)

Sun Yat-Sen University, Shanghai AI Laboratory, Sensetime Research

## 📰 News

* **[2024.11]** ⚡ We released the Dataset G2A-3. Check out the [Dataset](https://arxiv.org/abs/2408.01812).

* **[2024.11]** 😄 We released the training and testing code

* **[2024.08]** 🔥 We have released **Skydifussion: Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm**. Check out the [paper](https://huggingface.co/datasets/Yejy53/G2A-3/tree/main). The code and dataset are coming soon

## 🏆 Contributions

**Main methods:** We introduce SkyDiffusion, a novel ground-to-aerial synthesis method leveraging diffusion models and BEV paradigm to generate realistic, consistent aerial images.

**Method innovation details:** We design a Curved-BEV method to transform street-view images into satellite views for domain alignment. It also includes "Multi-to-One" mapping strategy to enhance BEV perception range in densely occluded urban areas.

**Dataset Contribution:** We introduce Ground2Aerial-3, a new ground-to-aerial image synthesis dataset, featuring disaster scene aerial image, historical high-resolution satellite image, and low-altitude UAV image

**Experimental results:** We introduce Ground2Aerial-3, a new ground-to-aerial image synthesis dataset, featuring disaster scene aerial image, historical high-resolution satellite image, and low-altitude UAV image

## 🛠️ Requirements and Installation
Clone this repo to a local folder:
```bash
git clone https://github.com/SkyDiffusion/SkyDiffusion-code.git
cd SkyDiffusion-code
```

We provide an available conda environment named skydiffusion. You can configure the necessary Python environment for the experiments by following these steps:
```bash
conda create --name skydiffusion python=3.9
conda activate skydiffusion
conda env update --name skydiffusion --file environment.yaml
```

## 🤗 Data Preparation
The publicly available datasets used in this paper can be obtained from the following sources:

**Preparing G2A-3 Dataset.** The dataset can be downloaded [here](https://huggingface.co/datasets/Yejy53/G2A-3/tree/main).

**Preparing CVUSA Dataset.** The dataset can be downloaded [here](https://mvrl.cse.wustl.edu/datasets/cvusa).

**Preparing CVACT Dataset.** The dataset can be downloaded [here](https://github.com/Liumouliu/OriCNN).

**Preparing VIGOR Dataset.** The dataset can be downloaded [here](https://github.com/Jeff-Zilence/VIGOR/tree/main).

After unzipping the datasets, prepare the training and testing data as discussed in our paper.

## 🚀 Quick Start
Generating Aerial Images Using Our Pre-trained Model. Use the provided pre-trained model to generate aerial images according to the following code:
```bash
python test.py \
--num_gpus=8 \
--config_path=./models/lacldm_v15.yaml \
--image_width=512 --image_height=512 \
--result_dir= [Output folder] \
--model_path=./ckpt/CVACT_SkyDiffusion.ckpt \
--data_file_path=./examples/examples.csv \
--dataset_name=CVACT
```

## BibTeX 🙏

If you have any questions, be free to contact with me!
```
@article{ye2024skydiffusion,
title={SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm},
author={Ye, Junyan and He, Jun and Li, Weijia and Lv, Zhutao and Yu, Jinhua and Yang, Haote and He, Conghui},
journal={arXiv preprint arXiv:2408.01812},
year={2024}
}

## License

This project is licensed under the Apache 2.0 License. See the [LICENSE](LICENSE) file for details.