https://github.com/wudongming97/Prompt4Driving

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/wudongming97/Prompt4Driving
Owner: wudongming97
License: apache-2.0
Created: 2023-08-09T08:17:54.000Z (almost 2 years ago)
Default Branch: master
Last Pushed: 2024-06-27T05:40:52.000Z (12 months ago)
Last Synced: 2024-08-01T05:14:55.028Z (11 months ago)
Language: Python
Size: 286 KB
Stars: 106
Watchers: 12
Forks: 0
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-LLM4AD - official

README

        






Language Prompt for Autonomous Driving







> **[Language Prompt for Autonomous Driving](https://arxiv.org/abs/2309.04379)**

>

> [Dongming Wu*](https://wudongming97.github.io/), [Wencheng Han*](https://wencheng256.github.io/), [Tiancai Wang](https://scholar.google.com/citations?user=YI0sRroAAAAJ&hl=zh-CN), [Yingfei Liu](https://scholar.google.com/citations?user=pF9KA1sAAAAJ&hl=zh-CN&oi=ao), [Xiangyu Zhang](https://scholar.google.com/citations?user=yuB-cfoAAAAJ&hl=zh-CN), [Jianbing Shen](https://shenjianbing.github.io/)

## :fire: Introduction



This is the official implementation of **Language Prompt for Autonomous Driving**.

* We propose a new large-scale language prompt set for driving scenes, named NuPrompt.  As far as we know, it is the first dataset specializing in multiple 3D objects of interest from video domain. 

* We construct a new prompt-based driving perceiving task, which requires using a language prompt as a semantic cue to predict object trajectories.

* We develop a simple end-to-end baseline model, called PromptTrack, which effectively fuses cross-modal features in a newly built prompt reasoning branch to predict referent objects, showing impressive performance.

## :boom: News

- [2024.12.10] This work is accepted by AAAI 2025. A new version of the paper will be released soon.

- [2024.06.27] Data and code are released. Welcome to try it!

- [2023.09.11] Our paper is released at [arXiv](https://arxiv.org/abs/2309.04379).

## :star: Benchmark

We expand nuScenes dataset with annotating language prompts, named NuPrompt.

It is a large-scale dataset for language prompt in driving scenes, which contains 40,147 language prompts for 3D objects.

Thanks to nuScenes, our descriptions are closed to real-driving nature and complexity, covering a 3D, multi-view, and multi-frame space.

The data can be downloaded from [NuPrompt](https://github.com/wudongming97/Prompt4Driving/releases/download/v1.0/nuprompt_v1.0.zip).

## :hammer: Model

Our model is built upon [PF-Track](https://github.com/TRI-ML/PF-Track). 

Please refer to [data.md](./docs/data.md) for preparing data and pre-trained models.

Please refer to [environment.md](./docs/environment.md) for environment installation.

Please refer to [training_inference.md](./docs/training_inference.md) for training and evaluation.

## :rocket: Results

|   Method    | AMOTA | AMOTP | RECALL |                                              Model                                               |                                                     Config |

|:-----------:|:-----:|:-----:|:------:|:------------------------------------------------------------------------------------------------:|-----------------------------------------------------------:|

| PromptTrack | 0.200 | 1.572 | 32.5%  | [model](https://github.com/wudongming97/Prompt4Driving/releases/download/v1.0/f3_prompttrack_e12.pth) | [config](./projects/configs/prompttrack/f3_prompttrack.py) |

## :point_right: Citation

If you find our work useful in your research, please consider citing them.

```

@article{wu2023language,

  title={Language Prompt for Autonomous Driving},

  author={Wu, Dongming and Han, Wencheng and Wang, Tiancai and Liu, Yingfei and Zhang, Xiangyu and Shen, Jianbing},

  journal={arXiv preprint},

  year={2023}

}

```

```

@inproceedings{wu2023referring,

  title={Referring multi-object tracking},

  author={Wu, Dongming and Han, Wencheng and Wang, Tiancai and Dong, Xingping and Zhang, Xiangyu and Shen, Jianbing},

  booktitle={CVPR},

  year={2023}

}

```

## :heart: Acknowledgements

We thank the authors that open the following projects. 

- [MMDetection3d](https://github.com/open-mmlab/mmdetection3d)

- [nuScenes](https://github.com/nutonomy/nuscenes-devkit)

- [PF-Track](https://github.com/TRI-ML/PF-Track)

- [PETR](https://github.com/megvii-research/PETR)

- [MOTR](https://github.com/megvii-research/MOTR)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/wudongming97/Prompt4Driving

Awesome Lists containing this project

README

Language Prompt for Autonomous Driving