ALAP-AE: As-Lite-as-Possible Auto-Encoder

03/19/2022
by   Nisarg A. Shah, et al.
0

We present a novel algorithm to reduce tensor compute required by a conditional image generation autoencoder and make it as-lite-as-possible, without sacrificing quality of photo-realistic image generation. Our method is device agnostic, and can optimize an autoencoder for a given CPU-only, GPU compute device(s) in about normal time it takes to train an autoencoder on a generic workstation. We achieve this via a two-stage novel strategy where, first, we condense the channel weights, such that, as few as possible channels are used. Then, we prune the nearly zeroed out weight activations, and fine-tune this lite autoencoder. To maintain image quality, fine-tuning is done via student-teacher training, where we reuse the condensed autoencoder as the teacher. We show performance gains for various conditional image generation tasks: segmentation mask to face images, face images to cartoonization, and finally CycleGAN-based model on horse to zebra dataset over multiple compute devices. We perform various ablation studies to justify the claims and design choices, and achieve real-time versions of various autoencoders on CPU-only devices while maintaining image quality, thus enabling at-scale deployment of such autoencoders.

READ FULL TEXT

page 2

page 5

page 8

page 9

page 10

page 11

research
11/01/2019

Cali-Sketch: Stroke Calibration and Completion for High-Quality Face Image Generation from Poorly-Drawn Sketches

Image generation task has received increasing attention because of its w...
research
06/26/2020

Semi-Adversarial Networks: Convolutional Autoencoders for Imparting Privacyto Face Images

In this paper, we design and evaluate a convolutional autoencoder that p...
research
11/29/2021

Vector Quantized Diffusion Model for Text-to-Image Synthesis

We present the vector quantized diffusion (VQ-Diffusion) model for text-...
research
03/29/2020

Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose

Recent works have shown how realistic talking face images can be obtaine...
research
12/01/2017

Semi-Adversarial Networks: Convolutional Autoencoders for Imparting Privacy to Face Images

In this paper, we design and evaluate a convolutional autoencoder that p...
research
03/10/2023

New Benchmarks for Accountable Text-based Visual Re-creation

Given a command, humans can directly execute the action after thinking o...

Please sign up or login with your details

Forgot password? Click here to reset