Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction

06/16/2019
by   Steven Hickson, et al.
4

We propose 4 insights that help to significantly improve the performance of deep learning models that predict surface normals and semantic labels from a single RGB image. These insights are: (1) denoise the "ground truth" surface normals in the training set to ensure consistency with the semantic labels; (2) concurrently train on a mix of real and synthetic data, instead of pretraining on synthetic and finetuning on real; (3) jointly predict normals and semantics using a shared model, but only backpropagate errors on pixels that have valid training labels; (4) slim down the model and use grayscale instead of color inputs. Despite the simplicity of these steps, we demonstrate consistently improved results on several datasets, using a model that runs at 12 fps on a standard mobile phone.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
12/22/2016

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Indoor scene understanding is central to applications such as robot navi...
research
01/08/2020

iDLG: Improved Deep Leakage from Gradients

It is widely believed that sharing gradients will not leak private train...
research
02/12/2023

SemanticAC: Semantics-Assisted Framework for Audio Classification

In this paper, we propose SemanticAC, a semantics-assisted framework for...
research
09/11/2018

Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data Feature Transfer

Robot perception systems need to perform reliable image segmentation in ...
research
07/06/2020

Are Labels Necessary for Classifier Accuracy Evaluation?

To calculate the model accuracy on a computer vision task, e.g., object ...
research
03/17/2023

Semantic Scene Completion with Cleaner Self

Semantic Scene Completion (SSC) transforms an image of single-view depth...
research
08/07/2017

Training Deep Networks to be Spatially Sensitive

In many computer vision tasks, for example saliency prediction or semant...

Please sign up or login with your details

Forgot password? Click here to reset