DeepAI AI Chat
Log In Sign Up

Multi-View Product Image Search Using Deep ConvNets Representations

by   Muhammet Bastan, et al.

Multi-view product image queries can improve retrieval performance over single view queries significantly. In this paper, we investigated the performance of deep convolutional neural networks (ConvNets) on multi-view product image search. First, we trained a VGG-like network to learn deep ConvNets representations of product images. Then, we computed the deep ConvNets representations of database and query images and performed single view queries, and multi-view queries using several early and late fusion approaches. We performed extensive experiments on the publicly available Multi-View Object Image Dataset (MVOD 5K) with both clean background queries from the Internet and cluttered background queries from a mobile phone. We compared the performance of ConvNets to the classical bag-of-visual-words (BoWs). We concluded that (1) multi-view queries with deep ConvNets representations perform significantly better than single view queries, (2) ConvNets perform much better than BoWs and have room for further improvement, (3) pre-training of ConvNets on a different image dataset with background clutter is needed to obtain good performance on cluttered product image queries obtained with a mobile phone.


Mobile Multi-View Object Image Search

High user interaction capability of mobile devices can help improve the ...

3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection

3D visual perception tasks based on multi-camera images are essential fo...

MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks

As the development of deep neural networks, 3D object recognition is bec...

Embedded Deep Bilinear Interactive Information and Selective Fusion for Multi-view Learning

As a concrete application of multi-view learning, multi-view classificat...

Multi-view information fusion using multi-view variational autoencoders to predict proximal femoral strength

Background and aim: Hip fracture can be devastating. The proximal femora...

TumorNet: Lung Nodule Characterization Using Multi-View Convolutional Neural Network with Gaussian Process

Characterization of lung nodules as benign or malignant is one of the mo...

Multi-View Networks For Multi-Channel Audio Classification

In this paper we introduce the idea of multi-view networks for sound cla...