Text2Layer: Layered Image Generation using Latent Diffusion Model

07/19/2023
by   Xinyang Zhang, et al.
0

Layer compositing is one of the most popular image editing workflows among both amateurs and professionals. Motivated by the success of diffusion models, we explore layer compositing from a layered image generation perspective. Instead of generating an image, we propose to generate background, foreground, layer mask, and the composed image simultaneously. To achieve layered image generation, we train an autoencoder that is able to reconstruct layered images and train diffusion models on the latent representation. One benefit of the proposed problem is to enable better compositing workflows in addition to the high-quality image output. Another benefit is producing higher-quality layer masks compared to masks produced by a separate step of image segmentation. Experimental results show that the proposed method is able to generate high-quality layered images and initiates a benchmark for future work.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 11

page 14

page 15

page 16

research
08/06/2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

Denoising diffusion probabilistic models (DDPM) have shown remarkable pe...
research
12/01/2022

3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models

Diffusion models have shown great promise for image generation, beating ...
research
08/18/2023

HumanLiff: Layer-wise 3D Human Generation with Diffusion Model

3D human generation from 2D images has achieved remarkable progress thro...
research
12/02/2021

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

Segmenting an image into its parts is a frequent preprocess for high-lev...
research
10/10/2022

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation

Diffusion models (DMs) have recently emerged as SoTA tools for generativ...
research
06/27/2022

Diffusion Deformable Model for 4D Temporal Medical Image Generation

Temporal volume images with 3D+t (4D) information are often used in medi...
research
12/13/2018

Vector Image Generation by Learning Parametric Layer Decomposition

Deep image generation is becoming a tool to enhance artists and designer...

Please sign up or login with your details

Forgot password? Click here to reset