Object-Centric Image Generation with Factored Depths, Locations, and Appearances

04/01/2020
by   Titas Anciukevicius, et al.
0

We present a generative model of images that explicitly reasons over the set of objects they show. Our model learns a structured latent representation that separates objects from each other and from the background; unlike prior works, it explicitly represents the 2D position and depth of each object, as well as an embedding of its segmentation mask and appearance. The model can be trained from images alone in a purely unsupervised fashion without the need for object masks or depth information. Moreover, it always generates complete objects, even though a significant fraction of training images contain occlusions. Finally, we show that our model can infer decompositions of novel images into their constituent objects, including accurate prediction of depth ordering and segmentation of occluded parts.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 9

page 10

page 11

page 13

research
03/29/2017

SeGAN: Segmenting and Generating the Invisible

Objects often occlude each other in scenes; Inferring their appearance b...
research
12/02/2021

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

Segmenting an image into its parts is a frequent preprocess for high-lev...
research
11/21/2022

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due...
research
12/10/2022

Source-free Depth for Object Pop-out

Depth cues are known to be useful for visual perception. However, direct...
research
03/15/2019

Generate What You Can't See - a View-dependent Image Generation

In order to operate autonomously, a robot should explore the environment...
research
06/05/2018

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects

We present Sequential Attend, Infer, Repeat (SQAIR), an interpretable de...
research
11/19/2018

SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

We present a novel approach to image manipulation and understanding by s...

Please sign up or login with your details

Forgot password? Click here to reset