Graph-Based Global Reasoning Networks

11/30/2018
by   Yunpeng Chen, et al.
0

Globally modeling and reasoning over relations between regions can be beneficial for many computer vision tasks on both images and videos. Convolutional Neural Networks (CNNs) excel at modeling local relations by convolution operations, but they are typically inefficient at capturing global relations between distant regions and require stacking multiple convolution layers. In this work, we propose a new approach for reasoning globally in which a set of features are globally aggregated over the coordinate space and then projected to an interaction space where relational reasoning can be efficiently computed. After reasoning, relation-aware features are distributed back to the original coordinate space for down-stream tasks. We further present a highly efficient instantiation of the proposed approach and introduce the Global Reasoning unit (GloRe unit) that implements the coordinate-interaction space mapping by weighted global pooling and weighted broadcasting, and the relation reasoning via graph convolution on a small graph in interaction space. The proposed GloRe unit is lightweight, end-to-end trainable and can be easily plugged into existing CNNs for a wide range of tasks. Extensive experiments show our GloRe unit can consistently boost the performance of state-of-the-art backbone architectures, including ResNet, ResNeXt, SE-Net and DPN, for both 2D and 3D CNNs, on image classification, semantic segmentation and video action recognition task.

READ FULL TEXT
research
03/07/2021

Graph-based Pyramid Global Context Reasoning with a Saliency-aware Projection for COVID-19 Lung Infections Segmentation

Coronavirus Disease 2019 (COVID-19) has rapidly spread in 2020, emerging...
research
10/12/2019

Context-Gated Convolution

As the basic building block of Convolutional Neural Networks (CNNs), the...
research
04/05/2019

Relation-Aware Global Attention

Attention mechanism aims to increase the representation power by focusin...
research
11/06/2020

Towards Efficient Scene Understanding via Squeeze Reasoning

Graph-based convolutional model such as non-local block has shown to be ...
research
08/26/2020

Visual Concept Reasoning Networks

A split-transform-merge strategy has been broadly used as an architectur...
research
10/27/2018

A^2-Nets: Double Attention Networks

Learning to capture long-range relations is fundamental to image/video r...
research
12/07/2017

Broadcasting Convolutional Network

While convolutional neural networks (CNNs) are widely used for handling ...

Please sign up or login with your details

Forgot password? Click here to reset