Unifying Nonlocal Blocks for Neural Networks

08/05/2021
by   Lei Zhu, et al.
0

The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks. Although having shown excellent performance, they still lack the mechanism to encode the rich, structured information among elements in an image or video. In this paper, to theoretically analyze the property of these nonlocal-based blocks, we provide a new perspective to interpret them, where we view them as a set of graph filters generated on a fully-connected graph. Specifically, when choosing the Chebyshev graph filter, a unified formulation can be derived for explaining and analyzing the existing nonlocal-based blocks (e.g., nonlocal block, nonlocal stage, double attention block). Furthermore, by concerning the property of spectral, we propose an efficient and robust spectral nonlocal block, which can be more robust and flexible to catch long-range dependencies when inserted into deep neural networks than the existing nonlocal blocks. Experimental results demonstrate the clear-cut improvements and practical applicabilities of our method on image classification, action recognition, semantic segmentation, and person re-identification tasks.

READ FULL TEXT

page 2

page 5

page 7

research
11/04/2019

A Spectral Nonlocal Block for Neural Networks

The nonlocal network is designed for capturing long-range spatial-tempor...
research
09/20/2022

Dynamic Graph Message Passing Networks for Visual Recognition

Modelling long-range dependencies is critical for scene understanding ta...
research
06/02/2018

Nonlocal Neural Networks, Nonlocal Diffusion and Nonlocal Modeling

Nonlocal neural networks have been proposed and shown to be effective in...
research
05/31/2023

CVSNet: A Computer Implementation for Central Visual System of The Brain

In computer vision, different basic blocks are created around different ...
research
10/27/2018

A^2-Nets: Double Attention Networks

Learning to capture long-range relations is fundamental to image/video r...
research
09/27/2019

Learnable Tree Filter for Structure-preserving Feature Transform

Learning discriminative global features plays a vital role in semantic s...
research
11/10/2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Models recently used in the literature proving residual networks (ResNet...

Please sign up or login with your details

Forgot password? Click here to reset