PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation

07/18/2023
by   Yingchaojie Feng, et al.
0

Generative text-to-image models have gained great popularity among the public for their powerful capability to generate high-quality images based on natural language prompts. However, developing effective prompts for desired images can be challenging due to the complexity and ambiguity of natural language. This research proposes PromptMagician, a visual analysis system that helps users explore the image results and refine the input prompts. The backbone of our system is a prompt recommendation model that takes user prompts as input, retrieves similar prompt-image pairs from DiffusionDB, and identifies special (important and relevant) prompt keywords. To facilitate interactive prompt refinement, PromptMagician introduces a multi-level visualization for the cross-modal embedding of the retrieved images and recommended keywords, and supports users in specifying multiple criteria for personalized exploration. Two usage scenarios, a user study, and expert interviews demonstrate the effectiveness and usability of our system, suggesting it facilitates prompt engineering and improves the creativity support of the generative text-to-image model.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 8

research
04/18/2023

Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models

Text-to-image generative models have demonstrated remarkable capabilitie...
research
01/25/2023

XNLI: Explaining and Diagnosing NLI-based Visual Data Analysis

Natural language interfaces (NLIs) enable users to flexibly specify anal...
research
05/11/2023

Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users

Interactive machine learning (IML) allows users to build their custom ma...
research
09/05/2023

Breaking Barriers to Creative Expression: Co-Designing and Implementing an Accessible Text-to-Image Interface

Text-to-image generation models have grown in popularity due to their ab...
research
12/20/2018

Sequential Attention GAN for Interactive Image Editing via Dialogue

In this paper, we introduce a new task - interactive image editing via c...
research
12/05/2021

Gaudí: Conversational Interactions with Deep Representations to Generate Image Collections

Based on recent advances in realistic language modeling (GPT-3) and cros...
research
07/18/2023

PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM

Text-to-image generation model is able to generate images across a diver...

Please sign up or login with your details

Forgot password? Click here to reset