Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition

04/06/2016
by   Ziyan Wang, et al.
0

In this paper, we propose a new correlated and individual multi-modal deep learning (CIMDL) method for RGB-D object recognition. Unlike most conventional RGB-D object recognition methods which extract features from the RGB and depth channels individually, our CIMDL jointly learns feature representations from raw RGB-D data with a pair of deep neural networks, so that the sharable and modal-specific information can be simultaneously exploited. Specifically, we construct a pair of deep convolutional neural networks (CNNs) for the RGB and depth data, and concatenate them at the top layer of the network with a loss function which learns a new feature space where both correlated part and the individual part of the RGB-D information are well modelled. The parameters of the whole networks are updated by using the back-propagation criterion. Experimental results on two widely used RGB-D object image benchmark datasets clearly show that our method outperforms state-of-the-arts.

READ FULL TEXT

page 7

page 8

research
06/05/2018

Recurrent Convolutional Fusion for RGB-D Object Recognition

Providing machines with the ability to recognize objects like humans has...
research
07/24/2015

Multimodal Deep Learning for Robust RGB-D Object Recognition

Robust object recognition is a crucial ingredient of many, if not all, r...
research
04/26/2020

When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition

Recognizing objects and scenes are two challenging but essential tasks i...
research
04/13/2021

SPARK: SPAcecraft Recognition leveraging Knowledge of Space Environment

This paper proposes the SPARK dataset as a new unique space object multi...
research
03/31/2017

(DE)^2 CO: Deep Depth Colorization

Object recognition on depth images using convolutional neural networks r...
research
09/08/2022

RGB-X Classification for Electronics Sorting

Effectively disassembling and recovering materials from waste electrical...
research
01/17/2019

Background subtraction on depth videos with convolutional neural networks

Background subtraction is a significant component of computer vision sys...

Please sign up or login with your details

Forgot password? Click here to reset