U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers

06/05/2022
by   Louis-François Bouchard, et al.
0

We report on a significant discovery linking deep convolutional neural networks (CNN) to biological vision and fundamental particle physics. A model of information propagation in a CNN is proposed via an analogy to an optical system, where bosonic particles (i.e. photons) are concentrated as the 2D spatial resolution of the image collapses to a focal point 1× 1=1. A 3D space (x,y,t) is defined by (x,y) coordinates in the image plane and CNN layer t, where a principal ray (0,0,t) runs in the direction of information propagation through both the optical axis and the image center pixel located at (x,y)=(0,0), about which the sharpest possible spatial focus is limited to a circle of confusion in the image plane. Our novel insight is to model the principal optical ray (0,0,t) as geometrically equivalent to the medial vector in the positive orthant I(x,y) ∈ R^N+ of a N-channel activation space, e.g. along the greyscale (or luminance) vector (t,t,t) in RGB colour space. Information is thus concentrated into an energy potential E(x,y,t)=I(x,y,t)^2, which, particularly for bottleneck layers t of generic CNNs, is highly concentrated and symmetric about the spatial origin (0,0,t) and exhibits the well-known "Sombrero" potential of the boson particle. This symmetry is broken in classification, where bottleneck layers of generic pre-trained CNN models exhibit a consistent class-specific bias towards an angle θ∈ U(1) defined simultaneously in the image plane and in activation feature space. Initial observations validate our hypothesis from generic pre-trained CNN activation maps and a bare-bones memory-based classification scheme, with no training or tuning. Training from scratch using a random U(1) class label the leads to improved classification in all cases.

READ FULL TEXT

page 8

page 10

page 11

research
07/08/2015

Feature Representation in Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are powerful models that achieve im...
research
06/10/2021

Transformed CNNs: recasting pre-trained convolutional layers with self-attention

Vision Transformers (ViT) have recently emerged as a powerful alternativ...
research
09/21/2020

Depth-Adapted CNN for RGB-D cameras

Conventional 2D Convolutional Neural Networks (CNN) extract features fro...
research
10/16/2021

An Acceleration Method Based on Deep Learning and Multilinear Feature Space

Computer vision plays a crucial role in Advanced Assistance Systems. Mos...
research
04/02/2017

Understanding Deep Representations through Random Weights

We systematically study the deep representation of random weight CNN (co...
research
06/13/2017

Identifying Spatial Relations in Images using Convolutional Neural Networks

Traditional approaches to building a large scale knowledge graph have us...
research
03/21/2023

Neural networks trained on synthetically generated crystals can extract structural information from ICSD powder X-ray diffractograms

Machine learning techniques have successfully been used to extract struc...

Please sign up or login with your details

Forgot password? Click here to reset