Gibbs Max-margin Topic Models with Data Augmentation

10/10/2013
by   Jun Zhu, et al.
0

Max-margin learning is a powerful approach to building classifiers and structured output predictors. Recent work on max-margin supervised topic models has successfully integrated it with Bayesian topic models to discover discriminative latent semantic structures and make accurate predictions for unseen testing data. However, the resulting learning problems are usually hard to solve because of the non-smoothness of the margin loss. Existing approaches to building max-margin supervised topic models rely on an iterative procedure to solve multiple latent SVM subproblems with additional mean-field assumptions on the desired posterior distributions. This paper presents an alternative approach by defining a new max-margin loss. Namely, we present Gibbs max-margin supervised topic models, a latent variable Gibbs classifier to discover hidden topic representations for various tasks, including classification, regression and multi-task learning. Gibbs max-margin supervised topic models minimize an expected margin loss, which is an upper bound of the existing margin loss derived from an expected prediction rule. By introducing augmented variables and integrating out the Dirichlet variables analytically by conjugacy, we develop simple Gibbs sampling algorithms with no restricting assumptions and no need to solve SVM subproblems. Furthermore, each step of the "augment-and-collapse" Gibbs sampling algorithms has an analytical conditional distribution, from which samples can be easily drawn. Experimental results demonstrate significant improvements on time efficiency. The classification performance is also significantly improved over competitors on binary, multi-class and multi-label classification tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2013

Improved Bayesian Logistic Supervised Topic Models with Data Augmentation

Supervised topic models with a logistic likelihood have two issues that ...
research
12/30/2009

MedLDA: A General Framework of Maximum Margin Supervised Topic Models

Supervised topic models utilize document's side information for discover...
research
10/09/2013

Discriminative Relational Topic Models

Many scientific and engineering fields involve analyzing network data. F...
research
05/31/2021

Max-Margin is Dead, Long Live Max-Margin!

The foundational concept of Max-Margin in machine learning is ill-posed ...
research
05/23/2018

Learning latent variable structured prediction models with Gaussian perturbations

The standard margin-based structured prediction commonly uses a maximum ...
research
03/14/2022

Soft-margin classification of object manifolds

A neural population responding to multiple appearances of a single objec...
research
02/14/2012

Sparse Topical Coding

We present sparse topical coding (STC), a non-probabilistic formulation ...

Please sign up or login with your details

Forgot password? Click here to reset