ObjectStitch: Generative Object Compositing

12/02/2022
by   Yizhi Song, et al.
0

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generative models, in this work, we propose a self-supervised framework for object compositing by leveraging the power of conditional diffusion models. Our framework can hollistically address the object compositing task in a unified model, transforming the viewpoint, geometry, color and shadow of the generated object while requiring no manual labeling. To preserve the input object's characteristics, we introduce a content adaptor that helps to maintain categorical semantics and object appearance. A data augmentation method is further adopted to improve the fidelity of the generator. Our method outperforms relevant baselines in both realism and faithfulness of the synthesized result images in a user study on various real-world images.

READ FULL TEXT

page 1

page 5

page 7

page 13

page 14

page 15

page 16

page 17

research
08/15/2021

SSH: A Self-Supervised Framework for Image Harmonization

Image harmonization aims to improve the quality of image compositing by ...
research
06/29/2023

Generate Anything Anywhere in Any Scene

Text-to-image diffusion models have attracted considerable interest due ...
research
03/29/2022

Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images

Learning 3D generative models from a dataset of monocular images enables...
research
05/22/2023

Phased data augmentation for training PixelCNNs with VQ-VAE-2 and limited data

With development of deep learning, researchers have developed generative...
research
05/03/2023

AG3D: Learning to Generate 3D Avatars from 2D Image Collections

While progress in 2D generative models of human appearance has been rapi...
research
02/08/2023

Neural Congealing: Aligning Images to a Joint Semantic Atlas

We present Neural Congealing – a zero-shot self-supervised framework for...
research
11/05/2022

Local Manifold Augmentation for Multiview Semantic Consistency

Multiview self-supervised representation learning roots in exploring sem...

Please sign up or login with your details

Forgot password? Click here to reset