Pairwise Decomposition of Image Sequences for Active Multi-View Recognition

05/26/2016
by   Edward Johns, et al.
0

A multi-view image sequence provides a much richer capacity for object recognition than from a single image. However, most existing solutions to multi-view recognition typically adopt hand-crafted, model-based geometric methods, which do not readily embrace recent trends in deep learning. We propose to bring Convolutional Neural Networks to generic multi-view recognition, by decomposing an image sequence into a set of image pairs, classifying each pair independently, and then learning an object classifier by weighting the contribution of each pair. This allows for recognition over arbitrary camera trajectories, without requiring explicit training over the potentially infinite number of camera paths and lengths. Building these pairwise relationships then naturally extends to the next-best-view problem in an active recognition framework. To achieve this, we train a second Convolutional Neural Network to map directly from an observed image to next viewpoint. Finally, we incorporate this into a trajectory optimisation task, whereby the best recognition confidence is sought for a given trajectory length. We present state-of-the-art results in both guided and unguided multi-view recognition on the ModelNet dataset, and show how our method can be used with depth images, greyscale images, or both.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2019

MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks

As the development of deep neural networks, 3D object recognition is bec...
research
11/17/2019

Leveraging Multi-view Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation

We present an unsupervised approach for factorizing object appearance in...
research
11/26/2018

IGNOR: Image-guided Neural Object Rendering

We propose a new learning-based novel view synthesis approach for scanne...
research
11/20/2022

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

Recently, vision architectures based exclusively on multi-layer perceptr...
research
07/29/2015

Deep Learning for Single-View Instance Recognition

Deep learning methods have typically been trained on large datasets in w...
research
08/11/2016

Multi-View Product Image Search Using Deep ConvNets Representations

Multi-view product image queries can improve retrieval performance over ...
research
06/29/2012

Visual Vocabulary Learning and Its Application to 3D and Mobile Visual Search

In this technical report, we review related works and recent trends in v...

Please sign up or login with your details

Forgot password? Click here to reset