WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI

08/25/2023
by   Hai Dang, et al.
0

Crafting a rich and unique environment is crucial for fictional world-building, but can be difficult to achieve since illustrating a world from scratch requires time and significant skill. We investigate the use of recent multi-modal image generation systems to enable users iteratively visualize and modify elements of their fictional world using a combination of text input, sketching, and region-based filling. WorldSmith enables novice world builders to quickly visualize a fictional world with layered edits and hierarchical compositions. Through a formative study (4 participants) and first-use study (13 participants) we demonstrate that WorldSmith offers more expressive interactions with prompt-based models. With this work, we explore how creatives can be empowered to leverage prompt-based generative AI as a tool in their creative process, beyond current "click-once" prompting UI paradigms.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 10

page 12

page 17

research
03/10/2023

Text-to-Image Generation: Perceptions and Realities

Generative AI is an emerging technology that will have a profound impact...
research
08/09/2023

PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions

While diffusion-based text-to-image (T2I) models provide a simple and po...
research
03/13/2023

Prompting AI Art: An Investigation into the Creative Skill of Prompt Engineering

Humankind is entering a novel era of creativity - an era in which anybod...
research
10/20/2022

3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows

Text-to-image AI systems are capable of generating novel images for insp...
research
10/19/2021

A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation

A creative image-and-text generative AI system mimics humans' extraordin...
research
12/22/2022

Multi-Lingual DALL-E Storytime

While recent advancements in artificial intelligence (AI) language model...
research
07/29/2022

Testing Relational Understanding in Text-Guided Image Generation

Relations are basic building blocks of human cognition. Classic and rece...

Please sign up or login with your details

Forgot password? Click here to reset