CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding

03/07/2023
by   Jingyu Liu, et al.
0

Indoor scene synthesis involves automatically picking and placing furniture appropriately on a floor plan, so that the scene looks realistic and is functionally plausible. Such scenes can serve as a home for immersive 3D experiences, or be used to train embodied agents. Existing methods for this task rely on labeled categories of furniture, e.g. bed, chair or table, to generate contextually relevant combinations of furniture. Whether heuristic or learned, these methods ignore instance-level attributes of objects such as color and style, and as a result may produce visually less coherent scenes. In this paper, we introduce an auto-regressive scene model which can output instance-level predictions, making use of general purpose image embedding based on CLIP. This allows us to learn visual correspondences such as matching color and style, and produce more plausible and aesthetically pleasing scenes. Evaluated on the 3D-FRONT dataset, our model achieves SOTA results in scene generation and improves auto-completion metrics by over 50 embedding-based approach enables zero-shot text-guided scene generation and editing, which easily generalizes to furniture not seen at training time.

READ FULL TEXT

page 1

page 5

page 7

page 11

page 12

page 13

research
07/24/2018

GRAINS: Generative Recursive Autoencoders for INdoor Scenes

We present a generative neural network which enables us to generate plau...
research
03/24/2023

DiffuScene: Scene Graph Denoising Diffusion Probabilistic Model for Generative Indoor Scene Synthesis

We present DiffuScene for indoor 3D scene synthesis based on a novel sce...
research
03/09/2020

Style-compatible Object Recommendation for Multi-room Indoor Scene Synthesis

Traditional indoor scene synthesis methods often take a two-step approac...
research
01/23/2015

Automatic Objects Removal for Scene Completion

With the explosive growth of web-based cameras and mobile devices, billi...
research
11/10/2021

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Learning-based methods for training embodied agents typically require a ...
research
02/28/2017

A Data-driven Approach for Furniture and Indoor Scene Colorization

We present a data-driven approach that colorizes 3D furniture models and...
research
09/13/2014

Structure Preserving Large Imagery Reconstruction

With the explosive growth of web-based cameras and mobile devices, billi...

Please sign up or login with your details

Forgot password? Click here to reset