https://github.com/Robertwyq/Drivingdojo
[NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
https://github.com/Robertwyq/Drivingdojo
Last synced: 3 months ago
JSON representation
[NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
- Host: GitHub
- URL: https://github.com/Robertwyq/Drivingdojo
- Owner: Robertwyq
- License: gpl-3.0
- Created: 2024-07-18T05:35:33.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-12-05T10:58:10.000Z (over 1 year ago)
- Last Synced: 2024-12-05T11:39:30.095Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 12.4 MB
- Stars: 33
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-llm4tr - DrivingDojo - agent interplay, and driving knowledge | Training and action instruction following benchmark for driving world models. | (π Summary of Language-Enhanced Datasets)
README
# Drivingdojo
The official implementation of the paper:
**DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model**
> [Yuqi Wang](https://robertwyq.github.io/), Ke Cheng, [Jiawei He](https://jiaweihe.com/), Qitai Wang, Hengchen Dai, Yuntao Chen, Fei Xia, and [Zhaoxiang Zhang](https://zhaoxiangzhang.net/)
>
> π [paper](https://arxiv.org/abs/2410.10738) π¬ [video demos](https://drivingdojo.github.io/) π [dataset](https://huggingface.co/datasets/Yuqi1997/DrivingDojo) π’[zhihu](https://zhuanlan.zhihu.com/p/1551246719)
## π News
- [2024-12] We release the code for computing the AIF metric.
- [2024-11] The code is support finetuned Stable Video Diffusion on multiple driving dataset.
- [2024-10] Our dataset DrivingDojo is released on **Huggingface**.
- [2024-9] Our paper is accepted by **NeurIPS 2024**.
## πΉοΈ Getting Started
Our code is based on the open-source project diffusers. The source code is organized in [diffusers/examples/world_model](diffusers/examples/world_model)
The following table shows the supported finetuned methods and datasets in this repo, and we will update it continuously.
| Method / Dataset | DrivingDojo | nuScenes | Waymo | OpenDV2K |
|-----------------------|-------|------------|------|-------------|
| Stable Video Diffusion | β | β | β | |
| Action-Conditioned Video Generation | β | | | |
### π¦ Installation
- [Installation](docs/INSTALL.md)
- [Dataset](docs/DATASET.md)
- [Training](docs/TRAINING.md)
- [Inference](docs/INFERENCE.md)
- [AIF Metric](docs/AIF.md)
## πCitation
if you find our work useful in your research, please consider citing:
```bibtex
@article{wang2024drivingdojo,
title={DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model},
author={Wang, Yuqi and Cheng, Ke and He, Jiawei and Wang, Qitai and Dai, Hengchen and Chen, Yuntao and Xia, Fei and Zhang, Zhaoxiang},
journal={arXiv preprint arXiv:2410.10738},
year={2024}
}
```
## Acknowledgement
Many thanks to the following open-source projects:
* [diffusers](https://github.com/huggingface/diffusers)