Depth-aware Glass Surface Detection with Cross-modal Context Mining

06/22/2022
by   Jiaying Lin, et al.
8

Glass surfaces are becoming increasingly ubiquitous as modern buildings tend to use a lot of glass panels. This however poses substantial challenges on the operations of autonomous systems such as robots, self-driving cars and drones, as the glass panels can become transparent obstacles to the navigation.Existing works attempt to exploit various cues, including glass boundary context or reflections, as a prior. However, they are all based on input RGB images.We observe that the transmission of 3D depth sensor light through glass surfaces often produces blank regions in the depth maps, which can offer additional insights to complement the RGB image features for glass surface detection. In this paper, we propose a novel framework for glass surface detection by incorporating RGB-D information, with two novel modules: (1) a cross-modal context mining (CCM) module to adaptively learn individual and mutual context features from RGB and depth information, and (2) a depth-missing aware attention (DAA) module to explicitly exploit spatial locations where missing depths occur to help detect the presence of glass surfaces. In addition, we propose a large-scale RGB-D glass surface detection dataset, called RGB-D GSD, for RGB-D glass surface detection. Our dataset comprises 3,009 real-world RGB-D glass surface images with precise annotations. Extensive experimental results show that our proposed model outperforms state-of-the-art methods.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 10

page 11

page 12

research
07/09/2020

Cross-Modal Weighting Network for RGB-D Salient Object Detection

Depth maps contain geometric clues for assisting Salient Object Detectio...
research
10/30/2018

Cross-Modal Attentional Context Learning for RGB-D Object Detection

Recognizing objects from simultaneously sensed photometric (RGB) and dep...
research
09/08/2021

RGB-D Salient Object Detection with Ubiquitous Target Awareness

Conventional RGB-D salient object detection methods aim to leverage dept...
research
10/12/2020

Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection

How to effectively fuse cross-modal information is the key problem for R...
research
02/08/2021

Towards Accurate RGB-D Saliency Detection with Complementary Attention and Adaptive Integration

Saliency detection based on the complementary information from RGB image...
research
05/26/2023

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation

Most existing works solving Room-to-Room VLN problem only utilize RGB im...
research
07/13/2022

Symmetry-Aware Transformer-based Mirror Detection

Mirror detection aims to identify the mirror regions in the given input ...

Please sign up or login with your details

Forgot password? Click here to reset