Hybrid Task Cascade for Instance Segmentation

01/22/2019
by   Kai Chen, et al.
6

Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features progressively while integrating complementary features together in each stage. Without bells and whistles, a single HTC obtains 38.4 Mask R-CNN baseline on MSCOCO dataset. More importantly, our overall system achieves 48.6 mask AP on the test-challenge dataset and 49.0 mask AP on test-dev, which are the state-of-the-art performance.

READ FULL TEXT
research
03/26/2020

Mask Encoding for Single Shot Instance Segmentation

To date, instance segmentation is dominated by twostage methods, as pion...
research
11/12/2019

Equalization Loss for Large Vocabulary Instance Segmentation

Recent object detection and instance segmentation tasks mainly focus on ...
research
11/15/2019

CenterMask : Real-Time Anchor-Free Instance Segmentation

We propose a simple yet efficient anchor-free instance segmentation, cal...
research
08/01/2017

CNN Cascades for Segmenting Whole Slide Images of the Kidney

Due to the increasing availability of whole slide scanners facilitating ...
research
08/23/2020

Seesaw Loss for Long-Tailed Instance Segmentation

This report presents the approach used in the submission of the LVIS Cha...
research
05/07/2021

A^2-FPN: Attention Aggregation based Feature Pyramid Network for Instance Segmentation

Learning pyramidal feature representations is crucial for recognizing ob...
research
09/03/2020

1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask

This article introduces the solutions of the team lvisTraveler for LVIS ...

Please sign up or login with your details

Forgot password? Click here to reset