Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models

10/19/2022
by   Ricardo Kleinlein, et al.
0

The impressive capacity shown by recent text-to-image diffusion models to generate high-quality pictures from textual input prompts has leveraged the debate about the very definition of art. Nonetheless, these models have been trained using text data collected from content-based labelling protocols that focus on describing the items and actions in an image but neglect any subjective appraisal. Consequently, these automatic systems need rigorous descriptions of the elements and the pictorial style of the image to be generated, otherwise failing to deliver. As potential indicators of the actual artistic capabilities of current generative models, we characterise the sentimentality, objectiveness and degree of abstraction of publicly available text data used to train current text-to-image diffusion models. Considering the sharp difference observed between their language style and that typically employed in artistic contexts, we suggest generative models should incorporate additional sources of subjective information in their training in order to overcome (or at least to alleviate) some of their current limitations, thus effectively unleashing a truly artistic and creative generation.

READ FULL TEXT

page 5

page 6

research
11/13/2020

Diffusion models for Handwriting Generation

In this paper, we propose a diffusion probabilistic model for handwritin...
research
04/26/2023

Training-Free Location-Aware Text-to-Image Synthesis

Current large-scale generative models have impressive efficiency in gene...
research
09/17/2022

Can segmentation models be trained with fully synthetically generated data?

In order to achieve good performance and generalisability, medical image...
research
06/01/2023

Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models

Generative models have recently exhibited exceptional capabilities in va...
research
09/11/2023

PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud

Text-to-image synthesis for the Chinese language poses unique challenges...
research
09/06/2023

My Art My Choice: Adversarial Protection Against Unruly AI

Generative AI is on the rise, enabling everyone to produce realistic con...
research
05/27/2023

The Curse of Recursion: Training on Generated Data Makes Models Forget

Stable Diffusion revolutionised image creation from descriptive text. GP...

Please sign up or login with your details

Forgot password? Click here to reset