Integrating Flexible Normalization into Mid-Level Representations of Deep Convolutional Neural Networks

Deep convolutional neural networks (CNNs) are becoming increasingly popular models to predict neural responses in visual cortex. However, contextual effects, which are prevalent in neural processing and in perception, are not explicitly handled by current CNNs, including those used for neural prediction. In primary visual cortex, neural responses are modulated by stimuli spatially surrounding the classical receptive field in rich ways. These effects have been modeled with divisive normalization approaches, including flexible models where spatial normalization is recruited only to the degree responses from center and surround locations are deemed statistically dependent. We propose a flexible normalization model applied to mid-level representations of deep CNNs as a tractable way to study contextual normalization mechanisms in mid-level visual areas. This approach captures non-trivial spatial dependencies among mid-level features in CNNs, such as those present in textures and other visual stimuli that arise from tiling high order features, geometrically. We expect that the proposed approach can make predictions about when spatial normalization might be recruited in mid-level cortical areas. We also expect this approach to be useful as part of the CNN toolkit, therefore going beyond more restrictive fixed forms of normalization.

READ FULL TEXT

page 6

page 7

page 8

page 16

page 17

research
06/07/2018

Correspondence of Deep Neural Networks and the Brain for Visual Textures

Deep convolutional neural networks (CNNs) trained on objects and scenes ...
research
11/08/2017

Revealing structure components of the retina by deep learning networks

Deep convolutional neural networks (CNNs) have demonstrated impressive p...
research
10/01/2016

Very Deep Convolutional Neural Networks for Raw Waveforms

Learning acoustic models directly from the raw waveform data with minima...
research
09/27/2018

A rotation-equivariant convolutional neural network model of primary visual cortex

Classical models describe primary visual cortex (V1) as a filter bank of...
research
06/11/2020

A new inference approach for training shallow and deep generalized linear models of noisy interacting neurons

Generalized linear models are one of the most efficient paradigms for pr...
research
05/18/2023

Explaining V1 Properties with a Biologically Constrained Deep Learning Architecture

Convolutional neural networks (CNNs) have recently emerged as promising ...
research
12/13/2015

Cross-dimensional Weighting for Aggregated Deep Convolutional Features

We propose a simple and straightforward way of creating powerful image r...

Please sign up or login with your details

Forgot password? Click here to reset