Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zhang-tao-whu/dvis

DVIS: Decoupled Video Instance Segmentation Framework
https://github.com/zhang-tao-whu/dvis

offline online ovis segmentation video-instance-segmentation video-panoptic-segmentation

Last synced: about 1 month ago
JSON representation

DVIS: Decoupled Video Instance Segmentation Framework

Host: GitHub
URL: https://github.com/zhang-tao-whu/dvis
Owner: zhang-tao-whu
License: mit
Created: 2023-05-05T05:22:50.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-04-02T02:42:02.000Z (9 months ago)
Last Synced: 2024-04-23T00:22:30.976Z (8 months ago)
Topics: offline, online, ovis, segmentation, video-instance-segmentation, video-panoptic-segmentation
Language: Python
Homepage:
Size: 188 KB
Stars: 114
Watchers: 4
Forks: 5
Open Issues: 7
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        


# [DVIS: Decoupled Video Instance Segmentation Framework](https://arxiv.org/abs/2306.03413)

[Tao Zhang](https://scholar.google.com/citations?user=3xu4a5oAAAAJ&hl=zh-CN), XingYe Tian, [Yu Wu](https://scholar.google.com/citations?hl=zh-CN&user=23SZHUwAAAAJ), [ShunPing Ji](https://scholar.google.com/citations?user=FjoRmF4AAAAJ&hl=zh-CN), Xuebo Wang, Yuan Zhang, Pengfei Wan

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-ovis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-ovis-1?p=dvis-decoupled-video-instance-segmentation)

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-panoptic-segmentation-on-vipseg)](https://paperswithcode.com/sota/video-panoptic-segmentation-on-vipseg?p=dvis-decoupled-video-instance-segmentation)

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-3)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-3?p=dvis-decoupled-video-instance-segmentation)

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-1?p=dvis-decoupled-video-instance-segmentation)

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dvis-decoupled-video-instance-segmentation/video-instance-segmentation-on-youtube-vis-2)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-2?p=dvis-decoupled-video-instance-segmentation)





## News

- DVIS-DAQ achieves 57.1 AP on the OVIS dataset and also sets a new SOTA performance on YTVIS19/21 and VIPSeg. The code will be released in [DVIS-DAQ](https://github.com/zhang-tao-whu/DVIS_Plus). The paper is available at [DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries](https://arxiv.org/pdf/2404.00086.pdf) and the project page can be found in [project page](https://zhang-tao-whu.github.io/projects/DVIS_DAQ/).

- The improved version of DVIS, DVIS++, is now available. Please refer to [DVIS++](https://github.com/zhang-tao-whu/DVIS_Plus) for more information. DVIS++ achieves 41.2 AP, 56.7 AP, and 52.0 AP, as well as 48.6 mIOU and 44.2 VPQ in OVIS, YTVIS19, YTVIS21, VSPW, and VIPSeg, respectively. Additionally, OV-DVIS++ supports open-vocabulary universal video segmentation.

- DVIS achieved **1st place** in the VPS Track of the PVUW challenge at CVPR 2023. `2023.5.25`

- DVIS has been accepted by ICCV 2023. `2023.7.15`

- DVIS achieved **1st place** in the VIS Track of the 5th LSVOS challenge at ICCV 2023. `2023.8.15`

## Features

- DVIS is a universal video segmentation framework that supports VIS, VPS and VSS.

- DVIS can run in both online and offline modes. 

- DVIS achieved SOTA performance on YTVIS, OVIS, VIPSeg and VSPW datasets.

- DVIS can complete training and inference on GPUs with only 11G memory. 

## Demos

 

  

 

## Installation

See [Installation Instructions](INSTALL.md).

## Getting Started

See [Preparing Datasets for DVIS](datasets/README.md).

See [Getting Started with DVIS](GETTING_STARTED.md).

## Model Zoo

Trained models are available for download in the [DVIS Model Zoo](MODEL_ZOO.md).

## Citing DVIS

```BibTeX

@article{DVIS,

  title={DVIS: Decoupled Video Instance Segmentation Framework},

  author={Zhang, Tao and Tian, Xingye and Wu, Yu and Ji, Shunping and Wang, Xuebo and Zhang, Yuan and Wan, Pengfei},

  journal={arXiv preprint arXiv:2306.03413},

  year={2023}

}

@article{zhang2023vis1st,

  title={1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation},

  author={Zhang, Tao and Tian, Xingye and Zhou, Yikang and Wu, Yu and Ji, Shunping and Yan, Cilin and Wang, Xuebo and Tao, Xin and Zhang, Yuan and Wan, Pengfei},

  journal={arXiv preprint arXiv:2308.14392},

  year={2023}

}

@article{zhang2023vps1st,

  title={1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation},

  author={Zhang, Tao and Tian, Xingye and Wei, Haoran and Wu, Yu and Ji, Shunping and Wang, Xuebo and Zhang, Yuan and Wan, Pengfei},

  journal={arXiv preprint arXiv:2306.04091},

  year={2023}

}

```

## Acknowledgement

This repo is largely based on [Mask2Former](https://github.com/facebookresearch/Mask2Former), [MinVIS](https://github.com/NVlabs/MinVIS) and [VITA](https://github.com/sukjunhwang/VITA).

Thanks for their excellent works.