Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models

03/20/2023
by   René Haas, et al.
0

Denoising Diffusion Models (DDMs) have emerged as a strong competitor to Generative Adversarial Networks (GANs). However, despite their widespread use in image synthesis and editing applications, their latent space is still not as well understood. Recently, a semantic latent space for DDMs, coined `h-space', was shown to facilitate semantic image editing in a way reminiscent of GANs. The h-space is comprised of the bottleneck activations in the DDM's denoiser across all timesteps of the diffusion process. In this paper, we explore the properties of h-space and propose several novel methods for finding meaningful semantic directions within it. We start by studying unsupervised methods for revealing interpretable semantic directions in pretrained DDMs. Specifically, we show that global latent directions emerge as the principal components in the latent space. Additionally, we provide a novel method for discovering image-specific semantic directions by spectral analysis of the Jacobian of the denoiser w.r.t. the latent code. Next, we extend the analysis by finding directions in a supervised fashion in unconditional DDMs. We demonstrate how such directions can be found by relying on either a labeled data set of real images or by annotating generated samples with a domain-specific attribute classifier. We further show how to semantically disentangle the found direction by simple linear projection. Our approaches are applicable without requiring any architectural modifications, text-based guidance, CLIP-based optimization, or model fine-tuning.

READ FULL TEXT

page 5

page 12

page 13

page 14

page 15

page 18

page 19

page 20

research
04/02/2021

LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions

Recent research has shown great potential for finding interpretable dire...
research
04/06/2020

GANSpace: Discovering Interpretable GAN Controls

This paper describes a simple technique to analyze Generative Adversaria...
research
12/09/2021

CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

The success of StyleGAN has enabled unprecedented semantic editing capab...
research
07/16/2023

Householder Projector for Unsupervised Latent Semantics Discovery

Generative Adversarial Networks (GANs), especially the recent style-base...
research
02/24/2023

Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

Despite the success of diffusion models (DMs), we still lack a thorough ...
research
06/13/2021

Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs

In this paper, we propose a method to find local-geometry-aware traversa...
research
08/11/2023

Head Rotation in Denoising Diffusion Models

Denoising Diffusion Models (DDM) are emerging as the cutting-edge techno...

Please sign up or login with your details

Forgot password? Click here to reset