Head Rotation in Denoising Diffusion Models

08/11/2023
by   Andrea Asperti, et al.
0

Denoising Diffusion Models (DDM) are emerging as the cutting-edge technology in the realm of deep generative modeling, challenging the dominance of Generative Adversarial Networks. However, effectively exploring the latent space's semantics and identifying compelling trajectories for manipulating and editing important attributes of the generated samples remains challenging, primarily due to the high-dimensional nature of the latent space. In this study, we specifically concentrate on face rotation, which is known to be one of the most intricate editing operations. By leveraging a recent embedding technique for Denoising Diffusion Implicit Models (DDIM), we achieve, in many cases, noteworthy manipulations encompassing a wide rotation angle of ± 30^o, preserving the distinct characteristics of the individual. Our methodology exploits the computation of trajectories approximating clouds of latent representations of dataset samples with different yaw rotations through linear regression. Specific trajectories are obtained by restricting the analysis to subsets of data sharing significant attributes with the source image. One of these attributes is the light provenance: a byproduct of our research is a labeling of CelebA, categorizing images into three major groups based on the illumination direction: left, center, and right.

READ FULL TEXT

page 8

page 10

page 16

page 17

page 18

page 19

page 20

page 21

research
05/12/2022

Tensor-based Emotion Editing in the StyleGAN Latent Space

In this paper, we use a tensor model based on the Higher-Order Singular ...
research
03/20/2023

Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models

Denoising Diffusion Models (DDMs) have emerged as a strong competitor to...
research
05/25/2023

UDPM: Upsampling Diffusion Probabilistic Models

In recent years, Denoising Diffusion Probabilistic Models (DDPM) have ca...
research
11/26/2022

Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model

Semantic editing of images is the fundamental goal of computer vision. A...
research
02/24/2023

Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

Despite the success of diffusion models (DMs), we still lack a thorough ...
research
10/16/2019

Exploiting video sequences for unsupervised disentangling in generative adversarial networks

In this work we present an adversarial training algorithm that exploits ...
research
08/31/2023

Latent Painter

Latent diffusers revolutionized the generative AI and inspired creative ...

Please sign up or login with your details

Forgot password? Click here to reset