One-Shot Item Search with Multimodal Data

11/27/2018
by   Jonghwa Yim, et al.
0

In the task of near similar image search, features from Deep Neural Network is often used to compare images and measure similarity. In the past, we only focused visual search in image dataset without text data. However, since deep neural network emerged, the performance of visual search becomes high enough to apply it in many industries from 3D data to multimodal data. Compared to the needs of multimodal search, there has not been sufficient researches. In this paper, we present a method of near similar search with image and text multimodal dataset. Earlier time, similar image search, especially when searching shopping items, treated image and text separately to search similar items and reorder the results. This regards two tasks of image search and text matching as two different tasks. Our method, however, explore the vast data to compute k-nearest neighbors using both image and text. In our experiment of similar item search, our system using multimodal data shows better performance than single data while it only increases minute computing time. For the experiment, we collected more than 15 million of accessory and six million of digital product items from online shopping websites, in which the product item comprises item images, titles, categories, and descriptions. Then we compare the performance of multimodal searching to single space searching in these datasets.

READ FULL TEXT
research
06/15/2019

Joint Visual-Textual Embedding for Multimodal Style Search

We introduce a multimodal visual-textual search refinement method for fa...
research
06/28/2022

Item Matching using Text Description and Similarity Search

In this paper, we focus on the problem of item matching using only the d...
research
03/24/2023

Search By Image: Deeply Exploring Beneficial Features for Beauty Product Retrieval

Searching by image is popular yet still challenging due to the extensive...
research
05/13/2019

Learning to Search Efficiently Using Comparisons

We consider the problem of searching in a set of items by using pairwise...
research
11/24/2016

Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

In this paper, we focus on training and evaluating effective word embedd...
research
11/30/2020

A proposal and evaluation of new timbre visualisation methods for audio sample browsers

Searching through vast libraries of sound samples can be a daunting and ...
research
07/04/2019

Searching for Apparel Products from Images in the Wild

In this age of social media, people often look at what others are wearin...

Please sign up or login with your details

Forgot password? Click here to reset