Fast Zero-Shot Image Tagging

05/31/2016
by   Yang Zhang, et al.
0

The well-known word analogy experiments show that the recent word vectors capture fine-grained linguistic regularities in words by linear vector offsets, but it is unclear how well the simple vector offsets can encode visual regularities over words. We study a particular image-word relevance relation in this paper. Our results show that the word vectors of relevant tags for a given image rank ahead of the irrelevant tags, along a principal direction in the word vector space. Inspired by this observation, we propose to solve image tagging by estimating the principal direction for an image. Particularly, we exploit linear mappings and nonlinear deep neural networks to approximate the principal direction from an input image. We arrive at a quite versatile tagging model. It runs fast given a test image, in constant time w.r.t. the training set size. It not only gives superior performance for the conventional tagging task on the NUS-WIDE dataset, but also outperforms competitive baselines on annotating images with previously unseen tags

READ FULL TEXT

page 4

page 10

research
03/16/2018

Deep Multiple Instance Learning for Zero-shot Image Tagging

In-line with the success of deep learning on traditional recognition pro...
research
11/21/2016

Sampled Image Tagging and Retrieval Methods on User Generated Content

Traditional image tagging and retrieval algorithms have limited value as...
research
09/29/2017

Towards Universal Semantic Tagging

The paper proposes the task of universal semantic tagging---tagging word...
research
07/14/2017

DocTag2Vec: An Embedding Based Multi-label Learning Approach for Document Tagging

Tagging news articles or blog posts with relevant tags from a collection...
research
02/15/2022

Unsupervised word-level prosody tagging for controllable speech synthesis

Although word-level prosody modeling in neural text-to-speech (TTS) has ...
research
11/01/2022

TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained Tags

So far, discontinuous named entity recognition (NER) has received increa...
research
12/20/2014

Improving zero-shot learning by mitigating the hubness problem

The zero-shot paradigm exploits vector-based word representations extrac...

Please sign up or login with your details

Forgot password? Click here to reset