Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zhang-tao-whu/dvis
DVIS: Decoupled Video Instance Segmentation Framework
https://github.com/zhang-tao-whu/dvis
offline online ovis segmentation video-instance-segmentation video-panoptic-segmentation
Last synced: about 1 month ago
JSON representation
DVIS: Decoupled Video Instance Segmentation Framework
- Host: GitHub
- URL: https://github.com/zhang-tao-whu/dvis
- Owner: zhang-tao-whu
- License: mit
- Created: 2023-05-05T05:22:50.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-02T02:42:02.000Z (9 months ago)
- Last Synced: 2024-04-23T00:22:30.976Z (8 months ago)
- Topics: offline, online, ovis, segmentation, video-instance-segmentation, video-panoptic-segmentation
- Language: Python
- Homepage:
- Size: 188 KB
- Stars: 114
- Watchers: 4
- Forks: 5
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# [DVIS: Decoupled Video Instance Segmentation Framework](https://arxiv.org/abs/2306.03413)
[Tao Zhang](https://scholar.google.com/citations?user=3xu4a5oAAAAJ&hl=zh-CN), XingYe Tian, [Yu Wu](https://scholar.google.com/citations?hl=zh-CN&user=23SZHUwAAAAJ), [ShunPing Ji](https://scholar.google.com/citations?user=FjoRmF4AAAAJ&hl=zh-CN), Xuebo Wang, Yuan Zhang, Pengfei Wan[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-ovis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-ovis-1?p=dvis-decoupled-video-instance-segmentation)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-panoptic-segmentation-on-vipseg)](https://paperswithcode.com/sota/video-panoptic-segmentation-on-vipseg?p=dvis-decoupled-video-instance-segmentation)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-3)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-3?p=dvis-decoupled-video-instance-segmentation)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-1?p=dvis-decoupled-video-instance-segmentation)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-2)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-2?p=dvis-decoupled-video-instance-segmentation)
## News
- DVIS-DAQ achieves 57.1 AP on the OVIS dataset and also sets a new SOTA performance on YTVIS19/21 and VIPSeg. The code will be released in [DVIS-DAQ](https://github.com/zhang-tao-whu/DVIS_Plus). The paper is available at [DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries](https://arxiv.org/pdf/2404.00086.pdf) and the project page can be found in [project page](https://zhang-tao-whu.github.io/projects/DVIS_DAQ/).
- The improved version of DVIS, DVIS++, is now available. Please refer to [DVIS++](https://github.com/zhang-tao-whu/DVIS_Plus) for more information. DVIS++ achieves 41.2 AP, 56.7 AP, and 52.0 AP, as well as 48.6 mIOU and 44.2 VPQ in OVIS, YTVIS19, YTVIS21, VSPW, and VIPSeg, respectively. Additionally, OV-DVIS++ supports open-vocabulary universal video segmentation.
- DVIS achieved **1st place** in the VPS Track of the PVUW challenge at CVPR 2023. `2023.5.25`
- DVIS has been accepted by ICCV 2023. `2023.7.15`
- DVIS achieved **1st place** in the VIS Track of the 5th LSVOS challenge at ICCV 2023. `2023.8.15`## Features
- DVIS is a universal video segmentation framework that supports VIS, VPS and VSS.
- DVIS can run in both online and offline modes.
- DVIS achieved SOTA performance on YTVIS, OVIS, VIPSeg and VSPW datasets.
- DVIS can complete training and inference on GPUs with only 11G memory.## Demos
## Installation
See [Installation Instructions](INSTALL.md).
## Getting Started
See [Preparing Datasets for DVIS](datasets/README.md).
See [Getting Started with DVIS](GETTING_STARTED.md).
## Model Zoo
Trained models are available for download in the [DVIS Model Zoo](MODEL_ZOO.md).
```BibTeX
@article{DVIS,
title={DVIS: Decoupled Video Instance Segmentation Framework},
author={Zhang, Tao and Tian, Xingye and Wu, Yu and Ji, Shunping and Wang, Xuebo and Zhang, Yuan and Wan, Pengfei},
journal={arXiv preprint arXiv:2306.03413},
year={2023}
}@article{zhang2023vis1st,
title={1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation},
author={Zhang, Tao and Tian, Xingye and Zhou, Yikang and Wu, Yu and Ji, Shunping and Yan, Cilin and Wang, Xuebo and Tao, Xin and Zhang, Yuan and Wan, Pengfei},
journal={arXiv preprint arXiv:2308.14392},
year={2023}
}@article{zhang2023vps1st,
title={1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation},
author={Zhang, Tao and Tian, Xingye and Wei, Haoran and Wu, Yu and Ji, Shunping and Wang, Xuebo and Zhang, Yuan and Wan, Pengfei},
journal={arXiv preprint arXiv:2306.04091},
year={2023}
}
```## Acknowledgement
This repo is largely based on [Mask2Former](https://github.com/facebookresearch/Mask2Former), [MinVIS](https://github.com/NVlabs/MinVIS) and [VITA](https://github.com/sukjunhwang/VITA).
Thanks for their excellent works.