Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs

04/25/2023
by   Mizuki Tabata, et al.
0

Estimating the layout of a room from a single-shot panoramic image is important in virtual/augmented reality and furniture layout simulation. This involves identifying three-dimensional (3D) geometry, such as the location of corners and boundaries, and performing 3D reconstruction. However, occlusion is a common issue that can negatively impact room layout estimation, and this has not been thoroughly studied to date. It is possible to obtain 3D shape information of rooms as drawings of buildings and coordinates of corners from image datasets, thus we propose providing both 2D panoramic and 3D information to a model to effectively deal with occlusion. However, simply feeding 3D information to a model is not sufficient to utilize the shape information for an occluded area. Therefore, we improve the model by introducing 3D Intersection over Union (IoU) loss to effectively use 3D information. In some cases, drawings are not available or the construction deviates from a drawing. Considering such practical cases, we propose a method for distilling knowledge from a model trained with both images and 3D information to a model that takes only images as input. The proposed model, which is called Shape-Net, achieves state-of-the-art (SOTA) performance on benchmark datasets. We also confirmed its effectiveness in dealing with occlusion through significantly improved accuracy on images with occlusion compared with existing models.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
05/29/2019

Flat2Layout: Flat Representation for Estimating Layout of General Room Types

This paper proposes a new approach, Flat2Layout, for estimating general ...
research
04/01/2021

LED2-Net: Monocular 360 Layout Estimation via Differentiable Depth Rendering

Although significant progress has been made in room layout estimation, m...
research
03/03/2022

LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network

3D room layout estimation by a single panorama using deep neural network...
research
03/23/2018

LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image

We propose an algorithm to predict room layout from a single image that ...
research
03/02/2020

Inferring the location of reflecting surfaces exploiting loudspeaker directivity

Accurate sound field reproduction in rooms is often limited by the lack ...
research
12/08/2022

Occlusion-Robust FAU Recognition by Mining Latent Space of Masked Autoencoders

Facial action units (FAUs) are critical for fine-grained facial expressi...
research
03/15/2021

GRIHA: Synthesizing 2-Dimensional Building Layouts from Images Captured using a Smart Phone

Reconstructing an indoor scene and generating a layout/floor plan in 3D ...

Please sign up or login with your details

Forgot password? Click here to reset