Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales

02/17/2023
by   Martin Ruskov, et al.
0

The quality of text-to-image generation is continuously improving, yet the boundaries of its applicability are still unclear. In particular, refinement of the text input with the objective of achieving better results - commonly called prompt engineering - so far seems to have not been geared towards work with pre-existing texts. We investigate whether text-to-image generation and prompt engineering could be used to generate basic illustrations of popular fairytales. Using Midjourney v4, we engage in action research with a dual aim: to attempt to generate 5 believable illustrations for each of 5 popular fairytales, and to define a prompt engineering process that starts from a pre-existing text and arrives at an illustration of it. We arrive at a tentative 4-stage process: i) initial prompt, ii) composition adjustment, iii) style refinement, and iv) variation selection. We also discuss three reasons why the generation model struggles with certain illustrations: difficulties with counts, bias from stereotypical configurations and inability to depict overly fantastic situations. Our findings are not limited to the specific generation model and are intended to be generalisable to future ones.

READ FULL TEXT

page 6

page 7

page 9

page 10

page 11

page 12

research
05/13/2022

The Creativity of Text-to-Image Generation

Text-to-image synthesis has made a giant leap towards becoming a mainstr...
research
03/24/2022

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Recent text-to-image generation methods provide a simple yet exciting co...
research
03/13/2023

Prompting AI Art: An Investigation into the Creative Skill of Prompt Engineering

Humankind is entering a novel era of creativity - an era in which anybod...
research
06/20/2023

The Cultivated Practices of Text-to-Image Generation

Humankind is entering a novel creative era in which anybody can synthesi...
research
11/04/2022

Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation Models

While text-to-image synthesis currently enjoys great popularity among re...
research
11/24/2018

Keep Drawing It: Iterative language-based image generation and editing

Conditional text-to-image generation approaches commonly focus on genera...
research
12/19/2022

Optimizing Prompts for Text-to-Image Generation

Well-designed prompts can guide text-to-image models to generate amazing...

Please sign up or login with your details

Forgot password? Click here to reset