Semantics-Guided Representation Learning with Applications to Visual Synthesis

10/21/2020
by   Jia-Wei Yan, et al.
0

Learning interpretable and interpolatable latent representations has been an emerging research direction, allowing researchers to understand and utilize the derived latent space for further applications such as visual synthesis or recognition. While most existing approaches derive an interpolatable latent space and induces smooth transition in image appearance, it is still not clear how to observe desirable representations which would contain semantic information of interest. In this paper, we aim to learn meaningful representations and simultaneously perform semantic-oriented and visually-smooth interpolation. To this end, we propose an angular triplet-neighbor loss (ATNL) that enables learning a latent representation whose distribution matches the semantic information of interest. With the latent space guided by ATNL, we further utilize spherical semantic interpolation for generating semantic warping of images, allowing synthesis of desirable visual data. Experiments on MNIST and CMU Multi-PIE datasets qualitatively and quantitatively verify the effectiveness of our method.

READ FULL TEXT

page 1

page 4

page 5

page 6

research
05/17/2019

Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

Variational autoencoders learn unsupervised data representations, but th...
research
04/24/2023

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Diffusion models have attained impressive visual quality for image synth...
research
08/05/2022

IDLat: An Importance-Driven Latent Generation Method for Scientific Data

Deep learning based latent representations have been widely used for num...
research
10/31/2017

Semantic Interpolation in Implicit Models

In implicit models, one often interpolates between sampled points in lat...
research
03/28/2020

Inferring Semantic Information with 3D Neural Scene Representations

Biological vision infers multi-modal 3D representations that support rea...
research
09/06/2019

Video Interpolation and Prediction with Unsupervised Landmarks

Prediction and interpolation for long-range video data involves the comp...
research
10/03/2022

Smooth image-to-image translations with latent space interpolations

Multi-domain image-to-image (I2I) translations can transform a source im...

Please sign up or login with your details

Forgot password? Click here to reset