Measuring and Predicting Tag Importance for Image Retrieval

02/28/2016
by   Shangwen Li, et al.
0

Textual data such as tags, sentence descriptions are combined with visual cues to reduce the semantic gap for image retrieval applications in today's Multimodal Image Retrieval (MIR) systems. However, all tags are treated as equally important in these systems, which may result in misalignment between visual and textual modalities during MIR training. This will further lead to degenerated retrieval performance at query time. To address this issue, we investigate the problem of tag importance prediction, where the goal is to automatically predict the tag importance and use it in image retrieval. To achieve this, we first propose a method to measure the relative importance of object and scene tags from image sentence descriptions. Using this as the ground truth, we present a tag importance prediction model to jointly exploit visual, semantic and context cues. The Structural Support Vector Machine (SSVM) formulation is adopted to ensure efficient training of the prediction model. Then, the Canonical Correlation Analysis (CCA) is employed to learn the relation between the image visual feature and tag importance to obtain robust retrieval performance. Experimental results on three real-world datasets show a significant performance improvement of the proposed MIR with Tag Importance Prediction (MIR/TIP) system over other MIR systems.

READ FULL TEXT

page 2

page 4

page 6

page 12

page 14

research
10/13/2014

Tag Relevance Fusion for Social Image Retrieval

Due to the subjective nature of social tagging, measuring the relevance ...
research
03/02/2017

A novel image tag completion method based on convolutional neural network

In the problems of image retrieval and annotation, complete textual tag ...
research
09/26/2017

Understanding Infographics through Textual and Visual Tag Prediction

We introduce the problem of visual hashtag discovery for infographics: e...
research
07/07/2020

Location Sensitive Image Retrieval and Tagging

People from different parts of the globe describe objects and concepts i...
research
11/26/2022

Toward Universal Text-to-Music Retrieval

This paper introduces effective design choices for text-to-music retriev...
research
12/18/2012

A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics

This paper investigates the problem of modeling Internet images and asso...
research
08/19/2017

Image2song: Song Retrieval via Bridging Image Content and Lyric Words

Image is usually taken for expressing some kinds of emotions or purposes...

Please sign up or login with your details

Forgot password? Click here to reset