360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning

10/24/2022
by   Bolivar Solarte, et al.
0

We present 360-MLC, a self-training method based on multi-view layout consistency for finetuning monocular room-layout models using unlabeled 360-images only. This can be valuable in practical scenarios where a pre-trained model needs to be adapted to a new data domain without using any ground truth annotations. Our simple yet effective assumption is that multiple layout estimations in the same scene must define a consistent geometry regardless of their camera positions. Based on this idea, we leverage a pre-trained model to project estimated layout boundaries from several camera views into the 3D world coordinate. Then, we re-project them back to the spherical coordinate and build a probability function, from which we sample the pseudo-labels for self-training. To handle unconfident pseudo-labels, we evaluate the variance in the re-projected boundaries as an uncertainty value to weight each pseudo-label in our loss function during training. In addition, since ground truth annotations are not available during training nor in testing, we leverage the entropy information in multiple layout estimations as a quantitative metric to measure the geometry consistency of the scene, allowing us to evaluate any layout estimator for hyper-parameter tuning, including model selection without ground truth annotations. Experimental results show that our solution achieves favorable performance against state-of-the-art methods when self-training from three publicly available source datasets to a unique, newly labeled dataset consisting of multi-view of the same scenes.

READ FULL TEXT

page 2

page 5

page 8

page 9

page 15

page 17

research
06/14/2022

Learning 3D Object Shape and Layout without 3D Supervision

A 3D scene consists of a set of objects, each with a shape and a layout ...
research
05/03/2022

Cross-View Cross-Scene Multi-View Crowd Counting

Multi-view crowd counting has been previously proposed to utilize multi-...
research
05/07/2019

Learning Unsupervised Multi-View Stereopsis via Robust Photometric Consistency

We present a learning based approach for multi-view stereopsis (MVS). Wh...
research
04/15/2021

GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds

We present GANcraft, an unsupervised neural rendering framework for gene...
research
03/30/2022

Self-supervised 360^∘ Room Layout Estimation

We present the first self-supervised method to train panoramic room layo...
research
06/22/2022

Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout Cues

Spherical cameras capture scenes in a holistic manner and have been used...
research
11/30/2022

MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

In this paper, we propose and showcase, for the first time, monocular mu...

Please sign up or login with your details

Forgot password? Click here to reset