Anisotropic Convolutional Networks for 3D Semantic Scene Completion

04/05/2020
by   Jie Li, et al.
2

As a voxel-wise labeling task, semantic scene completion (SSC) tries to simultaneously infer the occupancy and semantic labels for a scene from a single depth and/or RGB image. The key challenge for SSC is how to effectively take advantage of the 3D context to model various objects or stuffs with severe variations in shapes, layouts and visibility. To handle such variations, we propose a novel module called anisotropic convolution, which properties with flexibility and power impossible for the competing methods such as standard 3D convolution and some of its variations. In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely. The basic idea is to enable anisotropic 3D receptive field by decomposing a 3D convolution into three consecutive 1D convolutions, and the kernel size for each such 1D convolution is adaptively determined on the fly. By stacking multiple such anisotropic convolution modules, the voxel-wise modeling capability can be further enhanced while maintaining a controllable amount of model parameters. Extensive experiments on two SSC benchmarks, NYU-Depth-v2 and NYUCAD, show the superior performance of the proposed method. Our code is available at https://waterljwant.github.io/SSC/

READ FULL TEXT

page 3

page 6

page 11

research
11/28/2016

Semantic Scene Completion from a Single Depth Image

This paper focuses on semantic scene completion, a task for producing a ...
research
10/03/2019

3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

A key challenge for RGB-D segmentation is how to effectively incorporate...
research
12/12/2018

Tree-structured Kronecker Convolutional Networks for Semantic Segmentation

Most existing semantic segmentation methods employ atrous convolution to...
research
12/12/2018

Tree-structured Kronecker Convolutional Network for Semantic Segmentation

Most existing semantic segmentation methods employ atrous convolution to...
research
07/18/2020

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

Depth data provide geometric information that can bring progress in RGB-...
research
07/11/2019

Efficient Semantic Scene Completion Network with Spatial Group Convolution

We introduce Spatial Group Convolution (SGC) for accelerating the comput...
research
03/31/2020

3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior

The goal of the Semantic Scene Completion (SSC) task is to simultaneousl...

Please sign up or login with your details

Forgot password? Click here to reset