MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

12/01/2020
by   Huiyu Wang, et al.
0

We present MaX-DeepLab, the first end-to-end model for panoptic segmentation. Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks and hand-designed components, such as box detection, non-maximum suppression, thing-stuff merging, etc. Although these sub-tasks are tackled by area experts, they fail to comprehensively solve the target task. By contrast, our MaX-DeepLab directly predicts class-labeled masks with a mask transformer, and is trained with a panoptic quality inspired loss via bipartite matching. Our mask transformer employs a dual-path architecture that introduces a global memory path in addition to a CNN path, allowing direct communication with any CNN layers. As a result, MaX-DeepLab shows a significant 7.1 box-free regime on the challenging COCO dataset, closing the gap between box-based and box-free methods for the first time. A small variant of MaX-DeepLab improves 3.0 Furthermore, MaX-DeepLab, without test time augmentation, achieves new state-of-the-art 51.3

READ FULL TEXT

page 1

page 8

page 10

page 11

page 12

page 13

research
05/03/2021

ISTR: End-to-End Instance Segmentation with Transformers

End-to-end paradigms significantly improve the accuracy of various deep-...
research
01/10/2023

Vision Transformers Are Good Mask Auto-Labelers

We propose Mask Auto-Labeler (MAL), a high-quality Transformer-based mas...
research
04/05/2023

MethaneMapper: Spectral Absorption aware Hyperspectral Transformer for Methane Detection

Methane (CH_4) is the chief contributor to global climate change. Recent...
research
11/25/2021

BoxeR: Box-Attention for 2D and 3D Transformers

In this paper, we propose a simple attention mechanism, we call Box-Atte...
research
06/29/2023

ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

This paper presents a new mechanism to facilitate the training of mask t...
research
08/08/2021

LeafMask: Towards Greater Accuracy on Leaf Segmentation

Leaf segmentation is the most direct and effective way for high-throughp...
research
03/08/2023

X-Pruner: eXplainable Pruning for Vision Transformers

Recently vision transformer models have become prominent models for a ra...

Please sign up or login with your details

Forgot password? Click here to reset