Global Aggregation then Local Distribution in Fully Convolutional Networks

09/16/2019
by   Xiangtai Li, et al.
3

It has been widely proven that modelling long-range dependencies in fully convolutional networks (FCNs) via global aggregation modules is critical for complex scene understanding tasks such as semantic segmentation and object detection. However, global aggregation is often dominated by features of large patterns and tends to oversmooth regions that contain small patterns (e.g., boundaries and small objects). To resolve this problem, we propose to first use Global Aggregation and then Local Distribution, which is called GALD, where long-range dependencies are more confidently used inside large pattern regions and vice versa. The size of each pattern at each position is estimated in the network as a per-channel mask map. GALD is end-to-end trainable and can be easily plugged into existing FCNs with various global aggregation modules for a wide range of vision tasks, and consistently improves the performance of state-of-the-art object detection and instance segmentation approaches. In particular, GALD used in semantic segmentation achieves new state-of-the-art performance on Cityscapes test set with mIoU 83.3%. Code is available at: <https://github.com/lxtGH/GALD-Net>

READ FULL TEXT

page 2

page 8

page 9

page 10

page 12

page 13

research
07/28/2021

Global Aggregation then Local Distribution for Scene Parsing

Modelling long-range contextual relationships is critical for pixel-wise...
research
11/23/2016

Fully Convolutional Instance-aware Semantic Segmentation

We present the first fully convolutional end-to-end solution for instanc...
research
12/21/2022

DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation

Transformer-based models have been widely demonstrated to be successful ...
research
12/07/2020

Rethinking Learnable Tree Filter for Generic Feature Transform

The Learnable Tree Filter presents a remarkable approach to model struct...
research
12/08/2021

Fully Attentional Network for Semantic Segmentation

Recent non-local self-attention methods have proven to be effective in c...
research
03/26/2020

Memory Enhanced Global-Local Aggregation for Video Object Detection

How do humans recognize an object in a piece of video? Due to the deteri...
research
07/20/2018

Competition vs. Concatenation in Skip Connections of Fully Convolutional Networks

Increased information sharing through short and long-range skip connecti...

Please sign up or login with your details

Forgot password? Click here to reset