Learning multi-scale local conditional probability models of images

03/06/2023
by   Zahra Kadkhodaie, et al.
0

Deep neural networks can learn powerful prior probability models for images, as evidenced by the high-quality generations obtained with recent score-based diffusion methods. But the means by which these networks capture complex global statistical structure, apparently without suffering from the curse of dimensionality, remain a mystery. To study this, we incorporate diffusion methods into a multi-scale decomposition, reducing dimensionality by assuming a stationary local Markov model for wavelet coefficients conditioned on coarser-scale coefficients. We instantiate this model using convolutional neural networks (CNNs) with local receptive fields, which enforce both the stationarity and Markov properties. Global structures are captured using a CNN with receptive fields covering the entire (but small) low-pass image. We test this model on a dataset of face images, which are highly non-stationary and contain large-scale geometric structures. Remarkably, denoising, super-resolution, and image synthesis results all demonstrate that these structures can be captured with significantly smaller conditioning neighborhoods than required by a Markov model implemented in the pixel domain. Our results show that score estimation for large complex images can be reduced to low-dimensional Markov conditional models across scales, alleviating the curse of dimensionality.

READ FULL TEXT

page 3

page 7

page 8

page 9

page 15

page 16

research
07/07/2019

ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning

Extracting multi-scale information is key to semantic segmentation. Howe...
research
01/03/2018

Joint convolutional neural pyramid for depth map super-resolution

High-resolution depth map can be inferred from a low-resolution one with...
research
04/19/2022

CTCNet: A CNN-Transformer Cooperation Network for Face Image Super-Resolution

Recently, deep convolution neural networks (CNNs) steered face super-res...
research
05/26/2020

Perceptual Extreme Super Resolution Network with Receptive Field Block

Perceptual Extreme Super-Resolution for single image is extremely diffic...
research
09/20/2023

Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models

We investigate the approximation efficiency of score functions by deep n...
research
11/18/2015

Super-Resolution with Deep Convolutional Sufficient Statistics

Inverse problems in image and audio, and super-resolution in particular,...
research
12/14/2020

Learning Hybrid Representations for Automatic 3D Vessel Centerline Extraction

Automatic blood vessel extraction from 3D medical images is crucial for ...

Please sign up or login with your details

Forgot password? Click here to reset