Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation

04/13/2023
by   Jaemin Cho, et al.
4

Spatial control is a core capability in controllable image generation. Advancements in layout-guided image generation have shown promising results on in-distribution (ID) datasets with similar spatial configurations. However, it is unclear how these models perform when facing out-of-distribution (OOD) samples with arbitrary, unseen layouts. In this paper, we propose LayoutBench, a diagnostic benchmark for layout-guided image generation that examines four categories of spatial control skills: number, position, size, and shape. We benchmark two recent representative layout-guided image generation methods and observe that the good ID layout control may not generalize well to arbitrary layouts in the wild (e.g., objects at the boundary). Next, we propose IterInpaint, a new baseline that generates foreground and background regions in a step-by-step manner via inpainting, demonstrating stronger generalizability than existing models on OOD layouts in LayoutBench. We perform quantitative and qualitative evaluation and fine-grained analysis on the four LayoutBench skills to pinpoint the weaknesses of existing models. Lastly, we show comprehensive ablation studies on IterInpaint, including training task ratio, crop paste vs. repaint, and generation order. Project website: https://layoutbench.github.io

READ FULL TEXT

page 12

page 15

page 17

page 18

page 19

page 20

page 21

page 22

research
05/24/2023

Visual Programming for Text-to-Image Generation and Evaluation

As large language models have demonstrated impressive performance in man...
research
07/06/2021

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

Despite significant progress on current state-of-the-art image generatio...
research
08/09/2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

In the text-to-image generation field, recent remarkable progress in Sta...
research
03/25/2021

AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable Layout and Style

Conditional image synthesis from layout has recently attracted much inte...
research
12/06/2019

cFineGAN: Unsupervised multi-conditional fine-grained image generation

We propose an unsupervised multi-conditional image generation pipeline: ...
research
12/25/2019

Controllable and Progressive Image Extrapolation

Image extrapolation aims at expanding the narrow field of view of a give...
research
04/15/2021

Spectrogram Inpainting for Interactive Generation of Instrument Sounds

Modern approaches to sound synthesis using deep neural networks are hard...

Please sign up or login with your details

Forgot password? Click here to reset