Blurring the Line Between Structure and Learning to Optimize and Adapt Receptive Fields

04/25/2019
by   Evan Shelhamer, et al.
0

The visual world is vast and varied, but its variations divide into structured and unstructured factors. We compose free-form filters and structured Gaussian filters, optimized end-to-end, to factorize deep representations and learn both local features and their degree of locality. Our semi-structured composition is strictly more expressive than free-form filtering, and changes in its structured parameters would require changes in free-form architecture. In effect this optimizes over receptive field size and shape, tuning locality to the data and task. Dynamic inference, in which the Gaussian structure varies with the input, adapts receptive field size to compensate for local scale variation. Optimizing receptive field size improves semantic segmentation accuracy on Cityscapes by 1-2 points for strong dilated and skip architectures and by up to 10 points for suboptimal designs. Adapting receptive fields by dynamic Gaussian structure further improves results, equaling the accuracy of free-form deformation while improving efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 11

research
01/15/2017

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

We study characteristics of receptive fields of units in deep convolutio...
research
08/08/2019

Dynamic Scale Inference by Entropy Minimization

Given the variety of the visual world there is not one true scale for re...
research
10/07/2019

Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation

Convolutional networks are not aware of an object's geometric variations...
research
05/30/2022

Pooling Revisited: Your Receptive Field is Suboptimal

The size and shape of the receptive field determine how the network aggr...
research
11/07/2019

Investigations of the Influences of a CNN's Receptive Field on Segmentation of Subnuclei of Bilateral Amygdalae

Segmentation of objects with various sizes is relatively less explored i...
research
06/28/2022

Graph Condensation via Receptive Field Distribution Matching

Graph neural networks (GNNs) enable the analysis of graphs using deep le...
research
11/12/2021

Frequency learning for structured CNN filters with Gaussian fractional derivatives

Frequency information lies at the base of discriminating between texture...

Please sign up or login with your details

Forgot password? Click here to reset