MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

06/16/2023
by   Kai Zhang, et al.
0

Text-guided image editing is widely needed in daily life, ranging from personal use to professional applications such as Photoshop. However, existing methods are either zero-shot or trained on an automatically synthesized dataset, which contains a high volume of noise. Thus, they still require lots of manual tuning to produce desirable outcomes in practice. To address this issue, we introduce MagicBrush (https://osu-nlp-group.github.io/MagicBrush/), the first large-scale, manually annotated dataset for instruction-guided real image editing that covers diverse scenarios: single-turn, multi-turn, mask-provided, and mask-free editing. MagicBrush comprises over 10K manually annotated triples (source image, instruction, target image), which supports trainining large-scale text-guided image editing models. We fine-tune InstructPix2Pix on MagicBrush and show that the new model can produce much better images according to human evaluation. We further conduct extensive experiments to evaluate current image editing baselines from multiple dimensions including quantitative, qualitative, and human evaluations. The results reveal the challenging nature of our dataset and the gap between current baselines and real-world editing needs.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 19

page 20

page 21

research
05/29/2023

InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions

Recent works have explored text-guided image editing using diffusion mod...
research
04/17/2023

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

Despite the success in large-scale text-to-image generation and text-con...
research
07/20/2023

OBJECT 3DIT: Language-guided 3D-aware Image Editing

Existing image editing tools, while powerful, typically disregard the un...
research
02/24/2022

CAISE: Conversational Agent for Image Search and Editing

Demand for image editing has been increasing as users' desire for expres...
research
12/15/2022

Text-guided mask-free local image retouching

In the realm of multi-modality, text-guided image retouching techniques ...
research
05/17/2021

SHARE: a System for Hierarchical Assistive Recipe Editing

We introduce SHARE: a System for Hierarchical Assistive Recipe Editing t...
research
05/07/2020

Nakdan: Professional Hebrew Diacritizer

We present a system for automatic diacritization of Hebrew text. The sys...

Please sign up or login with your details

Forgot password? Click here to reset