Bethe Projections for Non-Local Inference

03/04/2015
by   Luke Vilnis, et al.
0

Many inference problems in structured prediction are naturally solved by augmenting a tractable dependency structure with complex, non-local auxiliary objectives. This includes the mean field family of variational inference algorithms, soft- or hard-constrained inference using Lagrangian relaxation or linear programming, collective graphical models, and forms of semi-supervised learning such as posterior regularization. We present a method to discriminatively learn broad families of inference objectives, capturing powerful non-local statistics of the latent variables, while maintaining tractable and provably fast inference using non-Euclidean projected gradient descent with a distance-generating function given by the Bethe entropy. We demonstrate the performance and flexibility of our method by (1) extracting structured citations from research papers by learning soft global constraints, (2) achieving state-of-the-art results on a widely-used handwriting recognition task using a novel learned non-convex inference procedure, and (3) providing a fast and highly scalable algorithm for the challenging problem of inference in a collective graphical model applied to bird migration.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2015

Copula variational inference

We develop a general variational inference method that preserves depende...
research
05/09/2012

Optimization of Structured Mean Field Objectives

In intractable, undirected graphical models, an intuitive way of creatin...
research
05/20/2019

MaxEntropy Pursuit Variational Inference

One of the core problems in variational inference is a choice of approxi...
research
02/15/2021

Tractable structured natural gradient descent using local parameterizations

Natural-gradient descent on structured parameter spaces (e.g., low-rank ...
research
09/26/2013

Automorphism Groups of Graphical Models and Lifted Variational Inference

Using the theory of group action, we first introduce the concept of the ...
research
09/28/2018

Embedded-State Latent Conditional Random Fields for Sequence Labeling

Complex textual information extraction tasks are often posed as sequence...
research
02/18/2021

Non-approximate Inference for Collective Graphical Models on Path Graphs via Discrete Difference of Convex Algorithm

The importance of aggregated count data, which is calculated from the da...

Please sign up or login with your details

Forgot password? Click here to reset