TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation

07/11/2023
by   Paul Grimal, et al.
0

The progress in the generation of synthetic images has made it crucial to assess their quality. While several metrics have been proposed to assess the rendering of images, it is crucial for Text-to-Image (T2I) models, which generate images based on a prompt, to consider additional aspects such as to which extent the generated image matches the important content of the prompt. Moreover, although the generated images usually result from a random starting point, the influence of this one is generally not considered. In this article, we propose a new metric based on prompt templates to study the alignment between the content specified in the prompt and the corresponding generated images. It allows us to better characterize the alignment in terms of the type of the specified objects, their number, and their color. We conducted a study on several recent T2I models about various aspects. An additional interesting result we obtained with our approach is that image quality can vary drastically depending on the latent noise used as a seed for the images. We also quantify the influence of the number of concepts in the prompt, their order as well as their (color) attributes. Finally, our method allows us to identify some latent seeds that produce better images than others, opening novel directions of research on this understudied topic.

READ FULL TEXT

page 2

page 8

page 13

page 16

page 17

page 18

research
07/14/2023

GenAssist: Making Image Generation Accessible

Blind and low vision (BLV) creators use images to communicate with sight...
research
07/06/2023

On the Cultural Gap in Text-to-Image Generation

One challenge in text-to-image (T2I) generation is the inadvertent refle...
research
04/26/2023

Training-Free Location-Aware Text-to-Image Synthesis

Current large-scale generative models have impressive efficiency in gene...
research
11/09/2022

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models

Text-conditioned image generation models have recently achieved astonish...
research
05/05/2023

Guided Image Synthesis via Initial Image Editing in Diffusion Model

Diffusion models have the ability to generate high quality images by den...
research
08/11/2023

DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity

The unprecedented photorealistic results achieved by recent text-to-imag...
research
12/02/2021

TISE: A Toolbox for Text-to-Image Synthesis Evaluation

In this paper, we conduct a study on state-of-the-art methods for single...

Please sign up or login with your details

Forgot password? Click here to reset