https://github.com/hustvl/VAD

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
https://github.com/hustvl/VAD

autonomous-driving end-to-end

Last synced: 7 months ago
JSON representation

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Host: GitHub
URL: https://github.com/hustvl/VAD
Owner: hustvl
License: apache-2.0
Created: 2023-02-16T08:41:37.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-03-03T03:06:03.000Z (7 months ago)
Last Synced: 2025-03-16T22:07:27.309Z (7 months ago)
Topics: autonomous-driving, end-to-end
Language: Python
Homepage: https://arxiv.org/abs/2303.12077
Size: 4.45 MB
Stars: 884
Watchers: 30
Forks: 99
Open Issues: 67
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          ## VAD v1 & v2

[project page](https://hgao-cv.github.io/VADv2/)

https://user-images.githubusercontent.com/45144254/229673708-648e8da5-4c70-4346-9da2-423447d1ecde.mp4

https://github.com/hustvl/VAD/assets/45144254/153b9bf0-5159-46b5-9fab-573baf5c6159

> [**VAD: Vectorized Scene Representation for Efficient Autonomous Driving**](https://arxiv.org/abs/2303.12077)

>

> [Bo Jiang](https://github.com/rb93dett)¹\*, [Shaoyu Chen](https://scholar.google.com/citations?user=PIeNN2gAAAAJ&hl=en&oi=sra)¹\*, Qing Xu², [Bencheng Liao](https://github.com/LegendBC)¹, Jiajie Chen², [Helong Zhou](https://scholar.google.com/citations?user=wkhOMMwAAAAJ&hl=en&oi=ao)², [Qian Zhang](https://scholar.google.com/citations?user=pCY-bikAAAAJ&hl=zh-CN)², [Wenyu Liu](http://eic.hust.edu.cn/professor/liuwenyu/)¹, [Chang Huang](https://scholar.google.com/citations?user=IyyEKyIAAAAJ&hl=zh-CN)², [Xinggang Wang](https://xwcv.github.io/)^1,†

> 

> ¹ Huazhong University of Science and Technology, ² Horizon Robotics

>

> \*: equal contribution, ^†: corresponding author.

>

>[arXiv Paper](https://arxiv.org/abs/2303.12077), ICCV 2023

## News

* **`27 Feb, 2025`:** Check out our latest work, [DiffusionDrive](https://github.com/hustvl/DiffusionDrive), accepted to CVPR 2025! This study explores multi-modal end-to-end driving using diffusion models for real-time and real-world applications.

* **`19 Feb, 2025`:** Checkout our new work [RAD](https://hgao-cv.github.io/RAD) 🥰, end-to-end autonomous driving with large-scale 3DGS-based Reinforcement Learning post-training.

* **`30 Oct, 2024`:** Checkout our new work [Senna](https://github.com/hustvl/Senna) 🥰, which combines VAD/VADv2 with large vision-language models to achieve more accurate, robust, and generalizable autonomous driving planning.

* **`20 Sep, 2024`:** Core code of VADv2 (config and model) is available in the `VADv2` folder. Easy to integrade it into the VADv1 framework for training and inference.

* **`17 June, 2024`:** CARLA implementation of VADv1 is available on [Bench2Drive](https://github.com/Thinklab-SJTU/Bench2Drive?tab=readme-ov-file).

* **`20 Feb, 2024`:** VADv2 is available on arXiv    [paper](https://arxiv.org/pdf/2402.13243)    [project page](https://hgao-cv.github.io/VADv2/).

* **`1 Aug, 2023`:** Code & models are released!

* **`14 July, 2023`:** VAD is accepted by ICCV 2023🎉! Code and models will be open source soon!

* **`21 Mar, 2023`:** We release the VAD paper on [arXiv](https://arxiv.org/abs/2303.12077). Code/Models are coming soon. Please stay tuned! ☕️

## Introduction

> VAD is a vectorized paradigm for end-to-end autonomous driving.







- We propose VAD, an end-to-end unified vectorized paradigm for autonomous driving. VAD models the driving scene as a fully vectorized representation, getting rid of computationally intensive dense rasterized representation and hand-designed post-processing steps.

- VAD implicitly and explicitly utilizes the vectorized scene information to improve planning safety, via query interaction and vectorized planning constraints.

- VAD achieves SOTA end-to-end planning performance, outperforming previous methods by a large margin. Not only that, because of the vectorized scene representation and our concise model design, VAD greatly improves the inference speed, which is critical for the real-world deployment of an autonomous driving system.

## Models

| Method | Backbone | avg. L2 | avg. Col. | FPS | Config | Download |

| :---: | :---: | :---: | :---: |  :---: | :---: | :---: |

| VAD-Tiny | R50 | 0.78 | 0.38 | 16.8 | [config](projects/configs/VAD/VAD_tiny_stage_2.py) | [model](https://drive.google.com/file/d/1KgCC_wFqPH0CQqdr6Pp2smBX5ARPaqne/view?usp=sharing) |

| VAD-Base | R50 | 0.72 | 0.22 | 4.5 | [config](projects/configs/VAD/VAD_base_stage_2.py) | [model](https://drive.google.com/file/d/1FLX-4LVm4z-RskghFbxGuYlcYOQmV5bS/view?usp=sharing) |

## Results

- Open-loop planning results on [nuScenes](https://github.com/nutonomy/nuscenes-devkit). See the [paper](https://arxiv.org/abs/2303.12077) for more details.

| Method | L2 (m) 1s | L2 (m) 2s | L2 (m) 3s | Col. (%) 1s | Col. (%) 2s | Col. (%) 3s | FPS |

| :---: | :---: | :---: | :---: | :---:| :---: | :---: | :---: |

| ST-P3 | 1.33 | 2.11 | 2.90 | 0.23 | 0.62 | 1.27 | 1.6 |

| UniAD | 0.48 | 0.96 | 1.65 | **0.05** | 0.17 | 0.71 | 1.8 |

| VAD-Tiny | 0.46 | 0.76 | 1.12 | 0.21 | 0.35 | 0.58 | **16.8** |

| VAD-Base | **0.41** | **0.70** | **1.05** | **0.07** | **0.17** | **0.41** | 4.5 |

- Closed-loop simulation results on [CARLA](https://github.com/carla-simulator/carla).

| Method | Town05 Short DS | Town05 Short RC | Town05 Long DS | Town05 Long RC |

| :---: | :---: | :---: | :---: | :---:|

| CILRS | 7.47 | 13.40 | 3.68 | 7.19 |

| LBC | 30.97 | 55.01 | 7.05 | 32.09 |

| Transfuser\* | 54.52 | 78.41 | 33.15 | 56.36 |

| ST-P3 | 55.14 | 86.74 | 11.45 | 83.15 |

| VAD-Base | **64.29** | **87.26** | **30.31** | 75.20 |

> \*: LiDAR-based method.

## Getting Started

- [Installation](docs/install.md)

- [Prepare Dataset](docs/prepare_dataset.md)

- [Train and Eval](docs/train_eval.md)

- [Visualization](docs/visualization.md)

## Catalog

- [x] Code & Checkpoints Release

- [x] Initialization

## Contact

If you have any questions or suggestions about this repo, please feel free to contact us (bjiang@hust.edu.cn, outsidercsy@gmail.com).

## Citation

If you find VAD useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

```BibTeX

@article{jiang2023vad,

  title={VAD: Vectorized Scene Representation for Efficient Autonomous Driving},

  author={Jiang, Bo and Chen, Shaoyu and Xu, Qing and Liao, Bencheng and Chen, Jiajie and Zhou, Helong and Zhang, Qian and Liu, Wenyu and Huang, Chang and Wang, Xinggang},

  journal={ICCV},

  year={2023}

}

@article{chen2024vadv2,

  title={Vadv2: End-to-end vectorized autonomous driving via probabilistic planning},

  author={Chen, Shaoyu and Jiang, Bo and Gao, Hao and Liao, Bencheng and Xu, Qing and Zhang, Qian and Huang, Chang and Liu, Wenyu and Wang, Xinggang},

  journal={arXiv preprint arXiv:2402.13243},

  year={2024}

}

```

## License

All code in this repository is under the [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0).

## Acknowledgement

VAD is based on the following projects: [mmdet3d](https://github.com/open-mmlab/mmdetection3d), [detr3d](https://github.com/WangYueFt/detr3d), [BEVFormer](https://github.com/fundamentalvision/BEVFormer) and [MapTR](https://github.com/hustvl/MapTR). Many thanks for their excellent contributions to the community.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hustvl/VAD

Awesome Lists containing this project

README