Stochastic Learning of Multi-Instance Dictionary for Earth Mover's Distance based Histogram Comparison

09/03/2016
by   Jihong Fan, et al.
0

Dictionary plays an important role in multi-instance data representation. It maps bags of instances to histograms. Earth mover's distance (EMD) is the most effective histogram distance metric for the application of multi-instance retrieval. However, up to now, there is no existing multi-instance dictionary learning methods designed for EMD based histogram comparison. To fill this gap, we develop the first EMD-optimal dictionary learning method using stochastic optimization method. In the stochastic learning framework, we have one triplet of bags, including one basic bag, one positive bag, and one negative bag. These bags are mapped to histograms using a multi-instance dictionary. We argue that the EMD between the basic histogram and the positive histogram should be smaller than that between the basic histogram and the negative histogram. Base on this condition, we design a hinge loss. By minimizing this hinge loss and some regularization terms of the dictionary, we update the dictionary instances. The experiments over multi-instance retrieval applications shows its effectiveness when compared to other dictionary learning methods over the problems of medical image retrieval and natural language relation classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2015

Sparse Coding with Earth Mover's Distance for Multi-Instance Histogram Representation

Sparse coding (Sc) has been studied very well as a powerful data represe...
research
11/09/2015

Multiple Instance Dictionary Learning using Functions of Multiple Instances

A multiple instance dictionary learning method using functions of multip...
research
07/06/2019

Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction

In this paper we present a novel approach for extracting a Bag-of-Words ...
research
05/26/2016

Domain Transfer Multi-Instance Dictionary Learning

In this paper, we invest the domain transfer learning problem with multi...
research
09/22/2022

Efficient CNN with uncorrelated Bag of Features pooling

Despite the superior performance of CNN, deploying them on low computati...
research
01/20/2020

Analysis of the quotation corpus of the Russian Wiktionary

The quantitative evaluation of quotations in the Russian Wiktionary was ...
research
01/06/2015

A Study on Clustering for Clustering Based Image De-Noising

In this paper, the problem of de-noising of an image contaminated with A...

Please sign up or login with your details

Forgot password? Click here to reset