https://github.com/opendatalab/skydiffusion
The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
https://github.com/opendatalab/skydiffusion
Last synced: 9 months ago
JSON representation
The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
- Host: GitHub
- URL: https://github.com/opendatalab/skydiffusion
- Owner: opendatalab
- License: apache-2.0
- Created: 2024-08-07T10:46:19.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-18T09:32:30.000Z (10 months ago)
- Last Synced: 2025-03-27T12:07:39.014Z (10 months ago)
- Language: Python
- Homepage:
- Size: 256 MB
- Stars: 46
- Watchers: 1
- Forks: 2
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[Junyan Ye](https://yejy53.github.io/), Jun He, [Weijia Li](https://liweijia.github.io/), Zhutao Lv, Yi Lin, Jinhua Yu, Haote Yang, [Conghui He](https://conghui.github.io/)
Sun Yat-Sen University, Shanghai AI Laboratory, Sensetime Research

## 📰 News
* **[2024.11]** ⚡ We released the Dataset G2A-3. Check out the [Dataset](https://arxiv.org/abs/2408.01812).
* **[2024.11]** 😄 We released the training and testing code
* **[2024.08]** 🔥 We have released **Skydifussion: Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm**. Check out the [paper](https://huggingface.co/datasets/Yejy53/G2A-3/tree/main). The code and dataset are coming soon
## 🏆 Contributions
**Main methods:** We introduce SkyDiffusion, a novel ground-to-aerial synthesis method leveraging diffusion models and BEV paradigm to generate realistic, consistent aerial images.
**Method innovation details:** We design a Curved-BEV method to transform street-view images into satellite views for domain alignment. It also includes "Multi-to-One" mapping strategy to enhance BEV perception range in densely occluded urban areas.
**Dataset Contribution:** We introduce Ground2Aerial-3, a new ground-to-aerial image synthesis dataset, featuring disaster scene aerial image, historical high-resolution satellite image, and low-altitude UAV image
**Experimental results:** We introduce Ground2Aerial-3, a new ground-to-aerial image synthesis dataset, featuring disaster scene aerial image, historical high-resolution satellite image, and low-altitude UAV image
## 🛠️ Requirements and Installation
Clone this repo to a local folder:
```bash
git clone https://github.com/SkyDiffusion/SkyDiffusion-code.git
cd SkyDiffusion-code
```
We provide an available conda environment named skydiffusion. You can configure the necessary Python environment for the experiments by following these steps:
```bash
conda create --name skydiffusion python=3.9
conda activate skydiffusion
conda env update --name skydiffusion --file environment.yaml
```
## 🤗 Data Preparation
The publicly available datasets used in this paper can be obtained from the following sources:
**Preparing G2A-3 Dataset.** The dataset can be downloaded [here](https://huggingface.co/datasets/Yejy53/G2A-3/tree/main).
**Preparing CVUSA Dataset.** The dataset can be downloaded [here](https://mvrl.cse.wustl.edu/datasets/cvusa).
**Preparing CVACT Dataset.** The dataset can be downloaded [here](https://github.com/Liumouliu/OriCNN).
**Preparing VIGOR Dataset.** The dataset can be downloaded [here](https://github.com/Jeff-Zilence/VIGOR/tree/main).
After unzipping the datasets, prepare the training and testing data as discussed in our paper.
## 🚀 Quick Start
Generating Aerial Images Using Our Pre-trained Model. Use the provided pre-trained model to generate aerial images according to the following code:
```bash
python test.py \
--num_gpus=8 \
--config_path=./models/lacldm_v15.yaml \
--image_width=512 --image_height=512 \
--result_dir= [Output folder] \
--model_path=./ckpt/CVACT_SkyDiffusion.ckpt \
--data_file_path=./examples/examples.csv \
--dataset_name=CVACT
```
## BibTeX 🙏
If you have any questions, be free to contact with me!
```
@article{ye2024skydiffusion,
title={SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm},
author={Ye, Junyan and He, Jun and Li, Weijia and Lv, Zhutao and Yu, Jinhua and Yang, Haote and He, Conghui},
journal={arXiv preprint arXiv:2408.01812},
year={2024}
}
## License
This project is licensed under the Apache 2.0 License. See the [LICENSE](LICENSE) file for details.