Semantic Segmentation by Early Region Proxy

03/26/2022
by   Yifan Zhang, et al.
2

Typical vision backbones manipulate structured features. As a compromise, semantic segmentation has long been modeled as per-point prediction on dense regular grids. In this work, we present a novel and efficient modeling that starts from interpreting the image as a tessellation of learnable regions, each of which has flexible geometrics and carries homogeneous semantics. To model region-wise context, we exploit Transformer to encode regions in a sequence-to-sequence manner by applying multi-layer self-attention on the region embeddings, which serve as proxies of specific regions. Semantic segmentation is now carried out as per-region prediction on top of the encoded region embeddings using a single linear classifier, where a decoder is no longer needed. The proposed RegProxy model discards the common Cartesian feature layout and operates purely at region level. Hence, it exhibits the most competitive performance-efficiency trade-off compared with the conventional dense prediction methods. For example, on ADE20K, the small-sized RegProxy-S/16 outperforms the best CNN model using 25 the largest RegProxy-L/16 achieves 52.9mIoU which outperforms the state-of-the-art by 2.1 at https://github.com/YiF-Zhang/RegionProxy.

READ FULL TEXT

page 7

page 8

page 14

page 15

page 16

research
05/12/2021

Segmenter: Transformer for Semantic Segmentation

Image segmentation is often ambiguous at the level of individual image p...
research
12/31/2020

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Most recent semantic segmentation methods adopt a fully-convolutional ne...
research
12/06/2022

IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation

Semantic segmentation usually benefits from global contexts, fine locali...
research
06/28/2021

Multi-Compound Transformer for Accurate Biomedical Image Segmentation

The recent vision transformer(i.e.for image classification) learns non-l...
research
02/21/2023

Lightweight Real-time Semantic Segmentation Network with Efficient Transformer and CNN

In the past decade, convolutional neural networks (CNNs) have shown prom...
research
04/03/2023

Associating Spatially-Consistent Grouping with Text-supervised Semantic Segmentation

In this work, we investigate performing semantic segmentation solely thr...
research
10/03/2022

Masked Supervised Learning for Semantic Segmentation

Self-attention is of vital importance in semantic segmentation as it ena...

Please sign up or login with your details

Forgot password? Click here to reset