The Intrinsic Dimension of Images and Its Impact on Learning

04/18/2021
by   Phillip Pope, et al.
0

It is widely believed that natural image data exhibits low-dimensional structure despite the high dimensionality of conventional pixel representations. This idea underlies a common intuition for the remarkable success of deep learning in computer vision. In this work, we apply dimension estimation tools to popular datasets and investigate the role of low-dimensional structure in deep learning. We find that common natural image datasets indeed have very low intrinsic dimension relative to the high number of pixels in the images. Additionally, we find that low dimensional datasets are easier for neural networks to learn, and models solving these tasks generalize better from training to test data. Along the way, we develop a technique for validating our dimension estimation tools on synthetic data generated by GANs allowing us to actively manipulate the intrinsic dimension by controlling the image generation process. Code for our experiments may be found here https://github.com/ppope/dimensions.

READ FULL TEXT
research
07/04/2019

Adaptive Approximation and Estimation of Deep Neural Network to Intrinsic Dimensionality

We theoretically prove that the generalization performance of deep neura...
research
07/06/2022

The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning

The manifold hypothesis is a core mechanism behind the success of deep l...
research
06/08/2021

Intrinsic Dimension Estimation

It has long been thought that high-dimensional data encountered in many ...
research
05/30/2023

Bottleneck Structure in Learned Features: Low-Dimension vs Regularity Tradeoff

Previous work has shown that DNNs with large depth L and L_2-regularizat...
research
07/06/2022

The Union of Manifolds Hypothesis and its Implications for Deep Generative Modelling

Deep learning has had tremendous success at learning low-dimensional rep...
research
07/20/2022

Intrinsic dimension estimation for discrete metrics

Real world-datasets characterized by discrete features are ubiquitous: f...
research
08/08/2023

The Five-Dollar Model: Generating Game Maps and Sprites from Sentence Embeddings

The five-dollar model is a lightweight text-to-image generative architec...

Please sign up or login with your details

Forgot password? Click here to reset