Composition and Deformance: Measuring Imageability with a Text-to-Image Model

06/05/2023
by   Si Wu, et al.
0

Although psycholinguists and psychologists have long studied the tendency of linguistic strings to evoke mental images in hearers or readers, most computational studies have applied this concept of imageability only to isolated words. Using recent developments in text-to-image generation models, such as DALLE mini, we propose computational methods that use generated images to measure the imageability of both single English words and connected text. We sample text prompts for image generation from three corpora: human-generated image captions, news article sentences, and poem lines. We subject these prompts to different deformances to examine the model's ability to detect changes in imageability caused by compositional change. We find high correlation between the proposed computational measures of imageability and human judgments of individual words. We also find the proposed measures more consistently respond to changes in compositionality than baseline approaches. We discuss possible effects of model training and implications for the study of compositionality in text-to-image models.

READ FULL TEXT

page 2

page 12

research
10/05/2018

CanvasGAN: A simple baseline for text to image generation by incrementally patching a canvas

We propose a new recurrent generative model for generating images from t...
research
08/04/2022

Adversarial Attacks on Image Generation With Made-Up Words

Text-guided image generation models can be prompted to generate images u...
research
05/13/2022

The Creativity of Text-to-Image Generation

Text-to-image synthesis has made a giant leap towards becoming a mainstr...
research
07/12/2023

T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Despite the stunning ability to generate high-quality images by recent t...
research
07/29/2022

Testing Relational Understanding in Text-Guided Image Generation

Relations are basic building blocks of human cognition. Classic and rece...
research
09/22/2022

Implementing and Experimenting with Diffusion Models for Text-to-Image Generation

Taking advantage of the many recent advances in deep learning, text-to-i...
research
05/26/2023

Stereotypes and Smut: The (Mis)representation of Non-cisgender Identities by Text-to-Image Models

Cutting-edge image generation has been praised for producing high-qualit...

Please sign up or login with your details

Forgot password? Click here to reset