Masked-attention Mask Transformer for Universal Image Segmentation

12/02/2021
by   Bowen Cheng, et al.
6

Image segmentation is about grouping pixels with different semantics, e.g., category or instance membership, where each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing specialized architectures for each task. We present Masked-attention Mask Transformer (Mask2Former), a new architecture capable of addressing any image segmentation task (panoptic, instance or semantic). Its key components include masked attention, which extracts localized features by constraining cross-attention within predicted mask regions. In addition to reducing the research effort by at least three times, it outperforms the best specialized architectures by a significant margin on four popular datasets. Most notably, Mask2Former sets a new state-of-the-art for panoptic segmentation (57.8 PQ on COCO), instance segmentation (50.1 AP on COCO) and semantic segmentation (57.7 mIoU on ADE20K).

READ FULL TEXT

page 1

page 8

page 18

page 19

page 20

research
12/20/2021

Mask2Former for Video Instance Segmentation

We find Mask2Former also achieves state-of-the-art performance on video ...
research
11/26/2021

Mask Transfiner for High-Quality Instance Segmentation

Two-stage and query-based instance segmentation methods have achieved re...
research
05/23/2016

Bridging Category-level and Instance-level Semantic Image Segmentation

We propose an approach to instance-level image segmentation that is buil...
research
06/17/2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-base...
research
10/06/2022

Mask3D for 3D Semantic Instance Segmentation

Modern 3D semantic instance segmentation approaches predominantly rely o...
research
06/02/2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Observing the close relationship among panoptic, semantic and instance s...
research
06/28/2021

K-Net: Towards Unified Image Segmentation

Semantic, instance, and panoptic segmentations have been addressed using...

Please sign up or login with your details

Forgot password? Click here to reset