DialogPaint: A Dialog-based Image Editing Model

03/17/2023
by   Jingxuan Wei, et al.
1

We present DialogPaint, an innovative framework that employs an interactive conversational approach for image editing. The framework comprises a pretrained dialogue model (Blenderbot) and a diffusion model (Stable Diffusion). The dialogue model engages in conversation with users to understand their requirements and generates concise instructions based on the dialogue. Subsequently, the Stable Diffusion model employs these instructions, along with the input image, to produce the desired output. Due to the difficulty of acquiring fine-tuning data for such models, we leverage multiple large-scale models to generate simulated dialogues and corresponding image pairs. After fine-tuning our framework with the synthesized data, we evaluate its performance in real application scenes. The results demonstrate that DialogPaint excels in both objective and subjective evaluation metrics effectively handling ambiguous instructions and performing tasks such as object replacement, style transfer, color modification. Moreover, our framework supports multi-round editing, allowing for the completion of complicated editing tasks.

READ FULL TEXT

page 4

page 7

page 8

research
11/17/2022

InstructPix2Pix: Learning to Follow Image Editing Instructions

We propose a method for editing images from human instructions: given an...
research
05/21/2023

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions

We present an end-to-end diffusion-based method for editing videos with ...
research
06/01/2023

Differential Diffusion: Giving Each Pixel Its Strength

Text-based image editing has advanced significantly in recent years. Wit...
research
07/25/2023

Fashion Matrix: Editing Photos by Just Talking

The utilization of Large Language Models (LLMs) for the construction of ...
research
03/30/2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Image editing using diffusion models has witnessed extremely fast-paced ...
research
07/10/2023

FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing

To serve the intricate and varied demands of image editing, precise and ...
research
07/27/2023

Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields

With the popularity of implicit neural representations, or neural radian...

Please sign up or login with your details

Forgot password? Click here to reset