Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm

07/11/2022
by   Lechao Xiao, et al.
11

Although learning in high dimensions is commonly believed to suffer from the curse of dimensionality, modern machine learning methods often exhibit an astonishing power to tackle a wide range of challenging real-world learning problems without using abundant amounts of data. How exactly these methods break this curse remains a fundamental open question in the theory of deep learning. While previous efforts have investigated this question by studying the data (D), model (M), and inference algorithm (I) as independent modules, in this paper, we analyze the triplet (D, M, I) as an integrated system and identify important synergies that help mitigate the curse of dimensionality. We first study the basic symmetries associated with various learning algorithms (M, I), focusing on four prototypical architectures in deep learning: fully-connected networks (FCN), locally-connected networks (LCN), and convolutional networks with and without pooling (GAP/VEC). We find that learning is most efficient when these symmetries are compatible with those of the data distribution and that performance significantly deteriorates when any member of the (D, M, I) triplet is inconsistent or suboptimal.

READ FULL TEXT

page 4

page 18

page 21

page 22

page 23

research
03/20/2023

What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement

The question of what makes a data distribution suitable for deep learnin...
research
07/19/2021

A quantum algorithm for training wide and deep classical neural networks

Given the success of deep learning in classical machine learning, quantu...
research
02/19/2019

Large-scale mammography CAD with Deformable Conv-Nets

State-of-the-art deep learning methods for image processing are evolving...
research
03/21/2022

Image Classification on Accelerated Neural Networks

For image classification problems, various neural network models are com...
research
07/12/2022

Continual Learning with Deep Learning Methods in an Application-Oriented Context

Abstract knowledge is deeply grounded in many computer-based application...
research
06/16/2015

Deep Convolutional Networks on Graph-Structured Data

Deep Learning's recent successes have mostly relied on Convolutional Net...
research
03/17/2021

Triplet-Watershed for Hyperspectral Image Classification

Hyperspectral images (HSI) consist of rich spatial and spectral informat...

Please sign up or login with your details

Forgot password? Click here to reset