Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

02/24/2023
by   Yong-Hyun Park, et al.
0

Despite the success of diffusion models (DMs), we still lack a thorough understanding of their latent space. While image editing with GANs builds upon latent space, DMs rely on editing the conditions such as text prompts. We present an unsupervised method to discover interpretable editing directions for the latent variables 𝐱_t ∈𝒳 of DMs. Our method adopts Riemannian geometry between 𝒳 and the intermediate feature maps ℋ of the U-Nets to provide a deep understanding over the geometrical structure of 𝒳. The discovered semantic latent directions mostly yield disentangled attribute changes, and they are globally consistent across different samples. Furthermore, editing in earlier timesteps edits coarse attributes, while ones in later timesteps focus on high-frequency details. We define the curvedness of a line segment between samples to show that 𝒳 is a curved manifold. Experiments on different baselines and datasets demonstrate the effectiveness of our method even on Stable Diffusion. Our source code will be publicly available for the future researchers.

READ FULL TEXT

page 16

page 17

page 19

page 20

page 21

page 22

page 23

page 24

research
07/24/2023

Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry

Despite the success of diffusion models (DMs), we still lack a thorough ...
research
03/20/2023

Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models

Denoising Diffusion Models (DDMs) have emerged as a strong competitor to...
research
10/20/2022

Diffusion Models already have a Semantic Latent Space

Diffusion models achieve outstanding generative performance in various d...
research
11/21/2022

OrthoGAN: Multifaceted Semantics for Disentangled Face Editing

This paper describes a new technique for finding disentangled semantic d...
research
11/05/2020

Disentangling Latent Space for Unsupervised Semantic Face Editing

Editing facial images created by StyleGAN is a popular research topic wi...
research
07/06/2022

Towards Counterfactual Image Manipulation via CLIP

Leveraging StyleGAN's expressivity and its disentangled latent codes, ex...
research
08/11/2023

Head Rotation in Denoising Diffusion Models

Denoising Diffusion Models (DDM) are emerging as the cutting-edge techno...

Please sign up or login with your details

Forgot password? Click here to reset