An Interpretable Generative Model for Handwritten Digit Image Synthesis

11/11/2018
by   Yao Zhu, et al.
16

An interpretable generative model for handwritten digits synthesis is proposed in this work. Modern image generative models, such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs), are trained by backpropagation (BP). The training process is complex and the underlying mechanism is difficult to explain. We propose an interpretable multi-stage PCA method to achieve the same goal and use handwritten digit images synthesis as an illustrative example. First, we derive principal-component-analysis-based (PCA-based) transform kernels at each stage based on the covariance of its inputs. This results in a sequence of transforms that convert input images of correlated pixels to spectral vectors of uncorrelated components. In other words, it is a whitening process. Then, we can synthesize an image based on random vectors and multi-stage transform kernels through a coloring process. The generative model is a feedforward (FF) design since no BP is used in model parameter determination. Its design complexity is significantly lower, and the whole design process is explainable. Finally, we design an FF generative model using the MNIST dataset, compare synthesis results with those obtained by state-of-the-art GAN and VAE methods, and show that the proposed generative model achieves comparable performance.

READ FULL TEXT

page 9

page 11

research
10/29/2017

A Saak Transform Approach to Efficient, Scalable and Robust Handwritten Digits Recognition

An efficient, scalable and robust approach to the handwritten digits rec...
research
10/05/2018

Interpretable Convolutional Neural Networks via Feedforward Design

The model parameters of convolutional neural networks (CNNs) are determi...
research
11/10/2021

A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis

Generative models have been applied in the medical imaging domain for va...
research
04/06/2020

GANSpace: Discovering Interpretable GAN Controls

This paper describes a simple technique to analyze Generative Adversaria...
research
08/17/2015

A Generative Model for Multi-Dialect Representation

In the era of deep learning several unsupervised models have been develo...
research
10/11/2017

On Data-Driven Saak Transform

Being motivated by the multilayer RECOS (REctified-COrrelations on a Sph...
research
06/11/2018

Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models

This study investigates the use of non-linear unsupervised dimensionalit...

Please sign up or login with your details

Forgot password? Click here to reset