Backpropagating through Structured Argmax using a SPIGOT

05/12/2018
by   Hao Peng, et al.
0

We introduce the structured projection of intermediate gradients optimization technique (SPIGOT), a new method for backpropagating through neural networks that include hard-decision structured predictions (e.g., parsing) in intermediate layers. SPIGOT requires no marginal inference, unlike structured attention networks (Kim et al., 2017) and some reinforcement learning-inspired solutions (Yogatama et al., 2017). Like so-called straight-through estimators (Hinton, 2012), SPIGOT defines gradient-like quantities associated with intermediate nondifferentiable operations, allowing backpropagation before and after them; SPIGOT's proxy aims to ensure that, after a parameter update, the intermediate structure will remain well-formed. We experiment on two structured NLP pipelines: syntactic-then-semantic dependency parsing, and semantic parsing followed by sentiment classification. We show that training with SPIGOT leads to a larger improvement on the downstream task than a modularly-trained pipeline, the straight-through estimator, and structured attention, reaching a new state of the art on semantic dependency parsing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2021

Sparse Fuzzy Attention for Structured Sentiment Analysis

Attention scorers have achieved success in parsing tasks like semantic a...
research
10/05/2020

A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese

Semantic parsing is an important NLP task. However, Vietnamese is a low-...
research
02/12/2018

SparseMAP: Differentiable Sparse Structured Inference

Structured prediction requires searching over a combinatorial number of ...
research
10/13/2021

Compositional Generalization in Dependency Parsing

Compositionality, or the ability to combine familiar units like words in...
research
07/24/2019

Notes on Latent Structure Models and SPIGOT

These notes aim to shed light on the recently proposed structured projec...
research
04/17/2018

Learning Joint Semantic Parsers from Disjoint Data

We present a new approach to learning semantic parsers from multiple dat...
research
02/07/2022

Measuring and Reducing Model Update Regression in Structured Prediction for NLP

Recent advance in deep learning has led to rapid adoption of machine lea...

Please sign up or login with your details

Forgot password? Click here to reset