Hybrid Generative/Discriminative Learning for Automatic Image Annotation

03/15/2012
by   Shuang Hong Yang, et al.
0

Automatic image annotation (AIA) raises tremendous challenges to machine learning as it requires modeling of data that are both ambiguous in input and output, e.g., images containing multiple objects and labeled with multiple semantic tags. Even more challenging is that the number of candidate tags is usually huge (as large as the vocabulary size) yet each image is only related to a few of them. This paper presents a hybrid generative-discriminative classifier to simultaneously address the extreme data-ambiguity and overfitting-vulnerability issues in tasks such as AIA. Particularly: (1) an Exponential-Multinomial Mixture (EMM) model is established to capture both the input and output ambiguity and in the meanwhile to encourage prediction sparsity; and (2) the prediction ability of the EMM model is explicitly maximized through discriminative learning that integrates variational inference of graphical models and the pairwise formulation of ordinal regression. Experiments show that our approach achieves both superior annotation performance and better tag scalability.

READ FULL TEXT
research
11/19/2017

Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement

The number of social images has exploded by the wide adoption of social ...
research
09/17/2014

Adaptive Tag Selection for Image Annotation

Not all tags are relevant to an image, and the number of relevant tags i...
research
03/31/2018

Tagging like Humans: Diverse and Distinct Image Annotation

In this work we propose a new automatic image annotation model, dubbed ...
research
04/21/2020

Automatic Tag Recommendation for Painting Artworks Using Diachronic Descriptions

In this paper, we deal with the problem of automatic tag recommendation ...
research
11/16/2022

GAMMT: Generative Ambiguity Modeling Using Multiple Transformers

We introduce a new model based on sets of probabilities for sequential d...
research
12/04/2021

In Search of Ambiguity: A Three-Stage Workflow Design to Clarify Annotation Guidelines for Crowd Workers

We propose a novel three-stage FIND-RESOLVE-LABEL workflow for crowdsour...

Please sign up or login with your details

Forgot password? Click here to reset