Multi-View Product Image Search Using Deep ConvNets Representations

08/11/2016
by   Muhammet Bastan, et al.
0

Multi-view product image queries can improve retrieval performance over single view queries significantly. In this paper, we investigated the performance of deep convolutional neural networks (ConvNets) on multi-view product image search. First, we trained a VGG-like network to learn deep ConvNets representations of product images. Then, we computed the deep ConvNets representations of database and query images and performed single view queries, and multi-view queries using several early and late fusion approaches. We performed extensive experiments on the publicly available Multi-View Object Image Dataset (MVOD 5K) with both clean background queries from the Internet and cluttered background queries from a mobile phone. We compared the performance of ConvNets to the classical bag-of-visual-words (BoWs). We concluded that (1) multi-view queries with deep ConvNets representations perform significantly better than single view queries, (2) ConvNets perform much better than BoWs and have room for further improvement, (3) pre-training of ConvNets on a different image dataset with background clutter is needed to obtain good performance on cluttered product image queries obtained with a mobile phone.

READ FULL TEXT
research
07/31/2015

Mobile Multi-View Object Image Search

High user interaction capability of mobile devices can help improve the ...
research
02/16/2023

3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection

3D visual perception tasks based on multi-camera images are essential fo...
research
06/15/2019

MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks

As the development of deep neural networks, 3D object recognition is bec...
research
03/16/2022

Multi-View Document Representation Learning for Open-Domain Dense Retrieval

Dense retrieval has achieved impressive advances in first-stage retrieva...
research
07/26/2018

Discriminative multi-view Privileged Information learning for image re-ranking

Conventional multi-view re-ranking methods usually perform asymmetrical ...
research
11/17/2019

Leveraging Multi-view Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation

We present an unsupervised approach for factorizing object appearance in...
research
05/26/2016

Pairwise Decomposition of Image Sequences for Active Multi-View Recognition

A multi-view image sequence provides a much richer capacity for object r...

Please sign up or login with your details

Forgot password? Click here to reset