Modeling Latent Variable Uncertainty for Loss-based Learning

06/18/2012
by   M. Pawan Kumar, et al.
0

We consider the problem of parameter estimation using weakly supervised datasets, where a training sample consists of the input and a partially specified annotation, which we refer to as the output. The missing information in the annotation is modeled using latent variables. Previous methods overburden a single distribution with two separate tasks: (i) modeling the uncertainty in the latent variables during training; and (ii) making accurate predictions for the output and the latent variables during testing. We propose a novel framework that separates the demands of the two tasks using two distributions: (i) a conditional distribution to model the uncertainty of the latent variables for a given input-output pair; and (ii) a delta distribution to predict the output and the latent variables for a given input. During learning, we encourage agreement between the two distributions by minimizing a loss-based dissimilarity coefficient. Our approach generalizes latent SVM in two important ways: (i) it models the uncertainty over latent variables instead of relying on a pointwise estimate; and (ii) it allows the use of loss functions that depend on latent variables, which greatly increases its applicability. We demonstrate the efficacy of our approach on two challenging problems---object detection and action detection---using publicly available datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2022

High-dimensional factor copula models with estimation of latent variables

Factor models are a parsimonious way to explain the dependence of variab...
research
11/25/2018

Dissimilarity Coefficient based Weakly Supervised Object Detection

We consider the problem of weakly supervised object detection, where the...
research
05/24/2022

RevUp: Revise and Update Information Bottleneck for Event Representation

In machine learning, latent variables play a key role to capture the und...
research
09/22/2022

Learning Interpretable Latent Dialogue Actions With Less Supervision

We present a novel architecture for explainable modeling of task-oriente...
research
10/16/2012

Latent Composite Likelihood Learning for the Structured Canonical Correlation Model

Latent variable models are used to estimate variables of interest quanti...
research
02/03/2015

Deep Joint Task Learning for Generic Object Extraction

This paper investigates how to extract objects-of-interest without relyi...
research
07/25/2020

Modal Uncertainty Estimation via Discrete Latent Representation

Many important problems in the real world don't have unique solutions. I...

Please sign up or login with your details

Forgot password? Click here to reset