Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

11/17/2022
by   Ninareh Mehrabi, et al.
5

Natural language often contains ambiguities that can lead to misinterpretation and miscommunication. While humans can handle ambiguities effectively by asking clarifying questions and/or relying on contextual cues and common-sense knowledge, resolving ambiguities can be notoriously hard for machines. In this work, we study ambiguities that arise in text-to-image generative models. We curate a benchmark dataset covering different types of ambiguities that occur in these systems. We then propose a framework to mitigate ambiguities in the prompts given to the systems by soliciting clarifications from the user. Through automatic and human evaluations, we show the effectiveness of our framework in generating more faithful images aligned with human intention in the presence of ambiguities.

READ FULL TEXT

page 1

page 8

page 19

page 20

research
10/26/2022

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

With recent advancements in diffusion models, users can generate high-qu...
research
06/15/2023

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Recent text-to-image generative models can generate high-fidelity images...
research
10/27/2022

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?

Text-to-image generative models have achieved unprecedented success in g...
research
09/20/2019

Creative GANs for generating poems, lyrics, and metaphors

Generative models for text have substantially contributed to tasks like ...
research
09/16/2023

A Statistical Turing Test for Generative Models

The emergence of human-like abilities of AI systems for content generati...
research
07/15/2023

Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?

Pre-trained text-to-image generative models can produce diverse, semanti...
research
04/18/2022

Simultaneous Multiple-Prompt Guided Generation Using Differentiable Optimal Transport

Recent advances in deep learning, such as powerful generative models and...

Please sign up or login with your details

Forgot password? Click here to reset