Structured Prediction Energy Networks

11/19/2015
by   David Belanger, et al.
0

We introduce structured prediction energy networks (SPENs), a flexible framework for structured prediction. A deep architecture is used to define an energy function of candidate labels, and then predictions are produced by using back-propagation to iteratively optimize the energy with respect to the labels. This deep architecture captures dependencies between labels that would lead to intractable graphical models, and performs structure learning by automatically learning discriminative features of the structured output. One natural application of our technique is multi-label classification, which traditionally has required strict prior assumptions about the interactions between labels to ensure tractable learning and prediction. We are able to apply SPENs to multi-label problems with substantially larger label sets than previous applications of structured prediction, while modeling high-order interactions using minimal structural assumptions. Overall, deep learning provides remarkable tools for learning features of the inputs to a prediction problem, and this work extends these techniques to learning features of structured outputs. Our experiments provide impressive performance on a variety of benchmark multi-label classification tasks, demonstrate that our technique can be used to provide interpretable structure learning, and illuminate fundamental trade-offs between feed-forward and iterative structured prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2018

Predict and Constrain: Modeling Cardinality in Deep Structured Prediction

Many machine learning problems require the prediction of multi-dimension...
research
12/22/2009

Learning to Predict Combinatorial Structures

The major challenge in designing a discriminative learning algorithm for...
research
08/25/2020

Towards Structured Prediction in Bioinformatics with Deep Learning

Using machine learning, especially deep learning, to facilitate biologic...
research
06/27/2012

Maximum Margin Output Coding

In this paper we study output coding for multi-label prediction. For a m...
research
10/30/2017

Prototype Matching Networks for Large-Scale Multi-label Genomic Sequence Classification

One of the fundamental tasks in understanding genomics is the problem of...
research
04/08/2019

Component-Wise Boosting of Targets for Multi-Output Prediction

Multi-output prediction deals with the prediction of several targets of ...
research
03/09/2018

Learning Approximate Inference Networks for Structured Prediction

Structured prediction energy networks (SPENs; Belanger & McCallum 2016) ...

Please sign up or login with your details

Forgot password? Click here to reset