https://github.com/shi-labs/compfeat-for-video-instance-segmentation

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation (AAAI 2021)
https://github.com/shi-labs/compfeat-for-video-instance-segmentation

Last synced: 8 months ago
JSON representation

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation (AAAI 2021)

Host: GitHub
URL: https://github.com/shi-labs/compfeat-for-video-instance-segmentation
Owner: SHI-Labs
Created: 2020-12-03T19:43:00.000Z (almost 5 years ago)
Default Branch: main
Last Pushed: 2020-12-08T03:40:01.000Z (almost 5 years ago)
Last Synced: 2024-12-30T02:51:42.810Z (9 months ago)
Homepage: https://arxiv.org/abs/2012.03400
Size: 182 KB
Stars: 19
Watchers: 9
Forks: 3
Open Issues: 4
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

This is the repo to host the code for ComFeat of the following paper:

**CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation**, AAAI 2021 [ArXiv Link](https://arxiv.org/abs/2012.03400)

[Yang Fu](http://ifp-uiuc.github.io/), [Linjie Yang](https://scholar.google.com/citations?user=XptEO8oAAAAJ&hl=en), [Ding Liu](https://scholar.google.com/citations?user=PGtHUI0AAAAJ&hl=en), [Thomas S. Huang](http://ifp-uiuc.github.io/) and [Humphrey Shi](https://www.humphreyshi.com).

**Note:**
Our code will be released here, stay tuned.

## Introduction
Video instance segmentation is a complex task in which we need to detect, segment, and track each object for any given video. Previous approaches only utilize single-frame features for the detection, segmentation, and tracking of objects and they suffer in the video scenario due to several distinct challenges such as motion blur and drastic appearance change.
To eliminate ambiguities introduced by only using single-frame features, we propose a novel comprehensive feature aggregation approach (**ComFeat**) to refine features at both frame-level and object-level with temporal and spatial context information.
The aggregation process is carefully designed with a new attention mechanism which significantly increases the discriminative power of the learned features.
We further improve the tracking capability of our model through a siamese design by incorporating both feature similarities and spatial similarities.
Our proposed CompFeat achieves the state-of-the-art results on the challenging YouTube-VIS dataset.

## Framework
![Illustration of the Self-similarity Grouping.](./figs/framework.png)

## Bibtex
```
@article{fu2021compfeat,
title={CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation},
author={Yang Fu, Linjie Yang, Ding Liu, Thomas S. Huang and Humphrey Shi},
journal={AAAI Conference on Artificial Intelligence},
year={2021}
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/shi-labs/compfeat-for-video-instance-segmentation

Awesome Lists containing this project

README