Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/qiantianwen/NuScenes-QA
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
https://github.com/qiantianwen/NuScenes-QA
autonomous-driving vision-language visual-question-answering
Last synced: about 1 month ago
JSON representation
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
- Host: GitHub
- URL: https://github.com/qiantianwen/NuScenes-QA
- Owner: qiantianwen
- License: mit
- Created: 2023-05-24T06:28:17.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-10T14:03:35.000Z (7 months ago)
- Last Synced: 2024-02-10T16:32:34.380Z (5 months ago)
- Topics: autonomous-driving, vision-language, visual-question-answering
- Homepage:
- Size: 1.55 MB
- Stars: 101
- Watchers: 14
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Lists
- PJLab-ADG-awesome-knowledge-driven-AD - NuScenes-QA
- awesome-knowledge-driven-AD - NuScenes-QA
- Awesome-LLM4AD - NuScenes-QA
README
# [AAAI 2024] NuScenes-QA
Official repository for the AAAI 2024 paper **[NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario](https://arxiv.org/pdf/2305.14836.pdf)**.
![DataConstruction](docs/data_construction.png)
## :fire: News
- `2023.12.09` Our paper is accepted by AAAI 2024!
- `2023.09.04` Our NuScenes-QA dataset v1.0 released.## :hourglass_flowing_sand: To Do
- [x] Release question & anwswer data
- [ ] Release visual feature
- [ ] Release training and testing code## :running: Getting Started
### Data Preparation
We have released our question-answer annotations, please download it from [HERE](https://drive.google.com/drive/folders/1jIkICT23wZWZYPrWCa0x-ubjpClSzOuU?usp=sharing).
For the visual data, you can download the origin nuScenes dataset from [HERE](https://www.nuscenes.org/download), and prepare the data refer to this [LINK](https://mmdetection3d.readthedocs.io/en/v0.16.0/datasets/nuscenes_det.html). As an alternative, you can also download our provided object-level features extracted using pre-trained detection models from [HERE]() (to be released soon).
### Training & Testing
Todo.## :star: Others
If you have any questions about the dataset and its generation or the object-level feature extraction, feel free to cantact me with `[email protected]`.## :book: Citation
If you find our paper and project useful, please consider citing:
```bibtex
@article{qian2023nuscenes,
title={NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario},
author={Qian, Tianwen and Chen, Jingjing and Zhuo, Linhai and Jiao, Yang and Jiang, Yu-Gang},
journal={arXiv preprint arXiv:2305.14836},
year={2023}
}
```