Deep Convolutional Ranking for Multilabel Image Annotation

12/17/2013
by   Yunchao Gong, et al.
0

Multilabel image annotation is one of the most important challenges in computer vision with many real-world applications. While existing work usually use conventional visual features for multilabel annotation, features based on Deep Neural Networks have shown potential to significantly boost performance. In this work, we propose to leverage the advantage of such features and analyze key components that lead to better performances. Specifically, we show that a significant performance gain could be obtained by combining convolutional architectures with approximate top-k ranking objectives, as thye naturally fit the multilabel tagging problem. Our experiments on the NUS-WIDE dataset outperforms the conventional visual features by about 10 reported performance in the literature.

READ FULL TEXT

page 2

page 8

research
06/22/2017

Comparing Neural and Attractiveness-based Visual Features for Artwork Recommendation

Advances in image processing and computer vision in the latest years hav...
research
12/08/2014

Web image annotation by diffusion maps manifold learning algorithm

Automatic image annotation is one of the most challenging problems in ma...
research
04/16/2020

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

Computer vision with state-of-the-art deep learning models has achieved ...
research
01/05/2018

VSE-ens: Visual-Semantic Embeddings with Efficient Negative Sampling

Jointing visual-semantic embeddings (VSE) have become a research hotpot ...
research
11/28/2022

Pitfalls of Conditional Batch Normalization for Contextual Multi-Modal Learning

Humans have perfected the art of learning from multiple modalities throu...

Please sign up or login with your details

Forgot password? Click here to reset