End-to-End Learning for Structured Prediction Energy Networks

03/16/2017
by   David Belanger, et al.
0

Structured Prediction Energy Networks (SPENs) are a simple, yet expressive family of structured prediction models (Belanger and McCallum, 2016). An energy function over candidate structured outputs is given by a deep network, and predictions are formed by gradient-based optimization. This paper presents end-to-end learning for SPENs, where the energy function is discriminatively trained by back-propagating through gradient-based prediction. In our experience, the approach is substantially more accurate than the structured SVM method of Belanger and McCallum (2016), as it allows us to use more sophisticated non-convex energies. We provide a collection of techniques for improving the speed, accuracy, and memory requirements of end-to-end SPENs, and demonstrate the power of our method on 7-Scenes image denoising and CoNLL-2005 semantic role labeling tasks. In both, inexact minimization of non-convex SPEN energies is superior to baseline methods that use simplistic energy functions that can be minimized exactly.

READ FULL TEXT
research
06/22/2020

Non-convex Optimization via Adaptive Stochastic Search for End-to-End Learning and Control

In this work we propose the use of adaptive stochastic search as a build...
research
10/02/2018

Learning Discriminators as Energy Networks in Adversarial Learning

We propose a novel framework for structured prediction via adversarial l...
research
02/28/2019

Scaling Matters in Deep Structured-Prediction Models

Deep structured-prediction energy-based models combine the expressive po...
research
12/29/2019

Improving Deep Neuroevolution via Deep Innovation Protection

Evolutionary-based optimization approaches have recently shown promising...
research
09/27/2019

The Differentiable Cross-Entropy Method

We study the Cross-Entropy Method (CEM) for the non-convex optimization ...
research
12/22/2018

Search-Guided, Lightly-supervised Training of Structured Prediction Energy Networks

In structured output prediction tasks, labeling ground-truth training ou...
research
07/23/2015

Deep Fishing: Gradient Features from Deep Nets

Convolutional Networks (ConvNets) have recently improved image recogniti...

Please sign up or login with your details

Forgot password? Click here to reset