2D-3D Geometric Fusion Network using Multi-Neighbourhood Graph Convolution for RGB-D Indoor Scene Classification

09/23/2020
by   Albert Mosella-Montoro, et al.
3

Multi-modal fusion has been proved to help enhance the performance of scene classification tasks. This paper presents a 2D-3D fusion stage that combines 3D Geometric features with 2D Texture features obtained by 2D Convolutional Neural Networks. To get a robust 3D Geometric embedding, a network that uses two novel layers is proposed. The first layer, Multi-Neighbourhood Graph Convolution, aims to learn a more robust geometric descriptor of the scene combining two different neighbourhoods: one in the Euclidean space and the other in the Feature space. The second proposed layer, Nearest Voxel Pooling, improves the performance of the well-known Voxel Pooling. Experimental results, using NYU-Depth-v2 and SUN RGB-D datasets, show that the proposed method outperforms the current state-of-the-art in RGB-D indoor scene classification tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2019

Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification

Geometric 3D scene classification is a very challenging task. Current me...
research
11/01/2019

Centroid-Based Scene Classification (CBSC): Using Deep Features and Clustering for RGB-D Indoor Scene Classification

This paper contributes a novel method for RGB-D indoor scene classificat...
research
04/26/2020

When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition

Recognizing objects and scenes are two challenging but essential tasks i...
research
02/17/2020

3D Gated Recurrent Fusion for Semantic Scene Completion

This paper tackles the problem of data fusion in the semantic scene comp...
research
02/29/2020

Attention-aware fusion RGB-D face recognition

A novel attention aware method is proposed to fuse two image modalities,...
research
03/24/2017

Feature Fusion using Extended Jaccard Graph and Stochastic Gradient Descent for Robot

Robot vision is a fundamental device for human-robot interaction and rob...
research
11/09/2020

After All, Only The Last Neuron Matters: Comparing Multi-modal Fusion Functions for Scene Graph Generation

From object segmentation to word vector representations, Scene Graph Gen...

Please sign up or login with your details

Forgot password? Click here to reset