A Tour of Convolutional Networks Guided by Linear Interpreters

Convolutional networks are large linear systems divided into layers and connected by non-linear units. These units are the "articulations" that allow the network to adapt to the input. To understand how a network manages to solve a problem we must look at the articulated decisions in entirety. If we could capture the actions of non-linear units for a particular input, we would be able to replay the whole system back and forth as if it was always linear. It would also reveal the actions of non-linearities because the resulting linear system, a Linear Interpreter, depends on the input image. We introduce a hooking layer, called a LinearScope, which allows us to run the network and the linear interpreter in parallel. Its implementation is simple, flexible and efficient. From here we can make many curious inquiries: how do these linear systems look like? When the rows and columns of the transformation matrix are images, how do they look like? What type of basis do these linear transformations rely on? The answers depend on the problems presented, through which we take a tour to some popular architectures used for classification, super-resolution (SR) and image-to-image translation (I2I). For classification we observe that popular networks use a pixel-wise vote per class strategy and heavily rely on bias parameters. For SR and I2I we find that CNNs use wavelet-type basis similar to the human visual system. For I2I we reveal copy-move and template-creation strategies to generate outputs.

READ FULL TEXT

page 6

page 7

page 11

page 12

page 13

page 14

page 16

research
11/07/2017

Can Maxout Units Downsize Restoration Networks? - Single Image Super-Resolution Using Lightweight CNN with Maxout Units

Rectified linear units (ReLU) are well-known to be helpful in obtaining ...
research
07/10/2023

DWA: Differential Wavelet Amplifier for Image Super-Resolution

This work introduces Differential Wavelet Amplifier (DWA), a drop-in mod...
research
12/05/2020

Spatially-Adaptive Pixelwise Networks for Fast Image Translation

We introduce a new generator architecture, aimed at fast and efficient h...
research
04/17/2021

VSpSR: Explorable Super-Resolution via Variational Sparse Representation

Super-resolution (SR) is an ill-posed problem, which means that infinite...
research
05/03/2023

Bicubic++: Slim, Slimmer, Slimmest – Designing an Industry-Grade Super-Resolution Network

We propose a real-time and lightweight single-image super-resolution (SR...
research
05/22/2017

Convolutional Networks with MuxOut Layers as Multi-rate Systems for Image Upscaling

We interpret convolutional networks as adaptive filters and combine them...
research
08/23/2021

edge-SR: Super-Resolution For The Masses

Classic image scaling (e.g. bicubic) can be seen as one convolutional la...

Please sign up or login with your details

Forgot password? Click here to reset