Unbiased Face Synthesis With Diffusion Models: Are We There Yet?

09/13/2023
by   Harrison Rosenberg, et al.
0

Text-to-image diffusion models have achieved widespread popularity due to their unprecedented image generation capability. In particular, their ability to synthesize and modify human faces has spurred research into using generated face images in both training data augmentation and model performance assessments. In this paper, we study the efficacy and shortcomings of generative models in the context of face generation. Utilizing a combination of qualitative and quantitative measures, including embedding-based metrics and user studies, we present a framework to audit the characteristics of generated faces conditioned on a set of social attributes. We applied our framework on faces generated through state-of-the-art text-to-image diffusion models. We identify several limitations of face image generation that include faithfulness to the text prompt, demographic disparities, and distributional shifts. Furthermore, we present an analytical model that provides insights into how training data selection contributes to the performance of generative models.

READ FULL TEXT

page 2

page 18

page 19

page 21

page 22

page 23

page 24

page 25

research
05/01/2019

Learn to synthesize and synthesize to learn

Attribute guided face image synthesis aims to manipulate attributes on a...
research
06/22/2022

Facke: a Survey on Generative Models for Face Swapping

In this work, we investigate into the performance of mainstream neural g...
research
10/02/2022

Generated Faces in the Wild: Quantitative Comparison of Stable Diffusion, Midjourney and DALL-E 2

The field of image synthesis has made great strides in the last couple o...
research
07/01/2023

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

While large-scale pre-trained text-to-image models can synthesize divers...
research
06/11/2023

Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

We present Face0, a novel way to instantaneously condition a text-to-ima...
research
08/31/2023

Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive Method

The growth of misinformation and re-contextualized media in social media...
research
05/22/2023

The CLIP Model is Secretly an Image-to-Prompt Converter

The Stable Diffusion model is a prominent text-to-image generation model...

Please sign up or login with your details

Forgot password? Click here to reset