Replicating Kernels with a Short Stride Allows Sparse Reconstructions with Fewer Independent Kernels

06/17/2014
by   Peter F. Schultz, et al.
0

In sparse coding it is common to tile an image into nonoverlapping patches, and then use a dictionary to create a sparse representation of each tile independently. In this situation, the overcompleteness of the dictionary is the number of dictionary elements divided by the patch size. In deconvolutional neural networks (DCNs), dictionaries learned on nonoverlapping tiles are replaced by a family of convolution kernels. Hence adjacent points in the feature maps (V1 layers) have receptive fields in the image that are translations of each other. The translational distance is determined by the dimensions of V1 in comparison to the dimensions of the image space. We refer to this translational distance as the stride. We implement a type of DCN using a modified Locally Competitive Algorithm (LCA) to investigate the relationship between the number of kernels, the stride, the receptive field size, and the quality of reconstruction. We find, for example, that for 16x16-pixel receptive fields, using eight kernels and a stride of 2 leads to sparse reconstructions of comparable quality as using 512 kernels and a stride of 16 (the nonoverlapping case). We also find that for a given stride and number of kernels, the patch size does not significantly affect reconstruction quality. Instead, the learned convolution kernels have a natural support radius independent of the patch size.

READ FULL TEXT

page 5

page 6

page 8

page 9

page 10

page 11

research
07/06/2021

Integrating Circle Kernels into Convolutional Neural Networks

The square kernel is a standard unit for contemporary Convolutional Neur...
research
07/26/2021

Log-Polar Space Convolution for Convolutional Neural Networks

Convolutional neural networks use regular quadrilateral convolution kern...
research
10/15/2018

Sparse-View CT Reconstruction via Convolutional Sparse Coding

Traditional dictionary learning based CT reconstruction methods are patc...
research
10/13/2011

Sparse Image Representation with Epitomes

Sparse coding, which is the decomposition of a vector using only a few b...
research
07/18/2020

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

Depth data provide geometric information that can bring progress in RGB-...
research
10/07/2019

Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation

Convolutional networks are not aware of an object's geometric variations...
research
02/22/2023

Sparse, Geometric Autoencoder Models of V1

The classical sparse coding model represents visual stimuli as a linear ...

Please sign up or login with your details

Forgot password? Click here to reset