DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation

07/12/2023
by   Yipeng Leng, et al.
0

Diffusion probabilistic models (DPMs) have shown remarkable results on various image synthesis tasks such as text-to-image generation and image inpainting. However, compared to other generative methods like VAEs and GANs, DPMs lack a low-dimensional, interpretable, and well-decoupled latent code. Recently, diffusion autoencoders (Diff-AE) were proposed to explore the potential of DPMs for representation learning via autoencoding. Diff-AE provides an accessible latent space that exhibits remarkable interpretability, allowing us to manipulate image attributes based on latent codes from the space. However, previous works are not generic as they only operated on a few limited attributes. To further explore the latent space of Diff-AE and achieve a generic editing pipeline, we proposed a module called Group-supervised AutoEncoder(dubbed GAE) for Diff-AE to achieve better disentanglement on the latent code. Our proposed GAE has trained via an attribute-swap strategy to acquire the latent codes for multi-attribute image manipulation based on examples. We empirically demonstrate that our method enables multiple-attributes manipulation and achieves convincing sample quality and attribute alignments, while significantly reducing computational requirements compared to pixel-based approaches for representational decoupling. Code will be released soon.

READ FULL TEXT

page 3

page 5

page 6

page 7

research
07/26/2019

Latent Space Factorisation and Manipulation via Matrix Subspace Projection

This paper proposes a novel method for factorising the information in th...
research
04/24/2023

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Diffusion models have attained impressive visual quality for image synth...
research
10/12/2022

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

Fashion attribute editing is a task that aims to convert the semantic at...
research
06/14/2023

InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models

While diffusion models excel at generating high-quality samples, their l...
research
11/25/2020

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

We explore and analyze the latent style space of StyleGAN2, a state-of-t...
research
11/30/2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Diffusion probabilistic models (DPMs) have achieved remarkable quality i...
research
02/18/2023

Attribute-Specific Manipulation Based on Layer-Wise Channels

Image manipulation on the latent space of the pre-trained StyleGAN can c...

Please sign up or login with your details

Forgot password? Click here to reset