Conditioning Diffusion Models via Attributes and Semantic Masks for Face Generation

06/01/2023
by   Nico Giambi, et al.
0

Deep generative models have shown impressive results in generating realistic images of faces. GANs managed to generate high-quality, high-fidelity images when conditioned on semantic masks, but they still lack the ability to diversify their output. Diffusion models partially solve this problem and are able to generate diverse samples given the same condition. In this paper, we propose a multi-conditioning approach for diffusion models via cross-attention exploiting both attributes and semantic masks to generate high-quality and controllable face images. We also studied the impact of applying perceptual-focused loss weighting into the latent space instead of the pixel space. Our method extends the previous approaches by introducing conditioning on more than one set of features, guaranteeing a more fine-grained control over the generated face images. We evaluate our approach on the CelebA-HQ dataset, and we show that it can generate realistic and diverse samples while allowing for fine-grained control over multiple attributes and semantic regions. Additionally, we perform an ablation study to evaluate the impact of different conditioning strategies on the quality and diversity of the generated images.

READ FULL TEXT

page 3

page 5

page 6

page 7

page 8

page 12

page 13

page 14

research
11/23/2022

CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

Capitalizing on the recent advances in image generation models, existing...
research
08/22/2023

MatFuse: Controllable Material Generation with Diffusion Models

Creating high quality and realistic materials in computer graphics is a ...
research
06/16/2022

Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

Capitalizing on the recent advances in image generation models, existing...
research
08/31/2020

GIF: Generative Interpretable Faces

Photo-realistic visualization and animation of expressive human faces ha...
research
07/05/2022

Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

With the advent of an increasing number of Augmented and Virtual Reality...
research
03/07/2023

DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer

Generating visual layouts is an essential ingredient of graphic design. ...
research
08/03/2023

On the Biometric Capacity of Generative Face Models

There has been tremendous progress in generating realistic faces with hi...

Please sign up or login with your details

Forgot password? Click here to reset