BlobGAN: Spatially Disentangled Scene Representations

05/05/2022
by   Dave Epstein, et al.
8

We propose an unsupervised, mid-level representation for a generative model of scenes. The representation is mid-level in that it is neither per-pixel nor per-image; rather, scenes are modeled as a collection of spatial, depth-ordered "blobs" of features. Blobs are differentiably placed onto a feature grid that is decoded into an image by a generative adversarial network. Due to the spatial uniformity of blobs and the locality inherent to convolution, our network learns to associate different blobs with different entities in a scene and to arrange these blobs to capture scene layout. We demonstrate this emergent behavior by showing that, despite training without any supervision, our method enables applications such as easy manipulation of objects within a scene (e.g., moving, removing, and restyling furniture), creation of feasible scenes given constraints (e.g., plausible rooms with drawers at a particular location), and parsing of real-world images into constituent parts. On a challenging multi-category dataset of indoor scenes, BlobGAN outperforms StyleGAN2 in image quality as measured by FID. See our project page for video results and interactive demo: http://www.dave.ml/blobgan

READ FULL TEXT

page 10

page 11

page 21

page 22

page 24

page 25

page 27

page 28

research
03/26/2023

BlobGAN-3D: A Spatially-Disentangled 3D-Aware Generative Model for Indoor Scenes

3D-aware image synthesis has attracted increasing interest as it models ...
research
07/27/2022

GAUDI: A Neural Architect for Immersive 3D Scene Generation

We introduce GAUDI, a generative model capable of capturing the distribu...
research
10/23/2014

Capturing spatial interdependence in image features: the counting grid, an epitomic representation for bags of features

In recent scene recognition research images or large image regions are o...
research
10/03/2022

SinGRAV: Learning a Generative Radiance Volume from a Single Natural Scene

We present a 3D generative model for general natural scenes. Lacking nec...
research
08/16/2022

Casual Indoor HDR Radiance Capture from Omnidirectional Images

We present PanoHDR-NeRF, a novel pipeline to casually capture a plausibl...
research
02/02/2023

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

In this work, we present SceneDreamer, an unconditional generative model...
research
08/20/2020

Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation

Procedural models are being widely used to synthesize scenes for graphic...

Please sign up or login with your details

Forgot password? Click here to reset