Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing

06/08/2018
by   Daniel Fried, et al.
0

Dynamic oracles provide strong supervision for training constituency parsers with exploration, but must be custom defined for a given parser's transition system. We explore using a policy gradient method as a parser-agnostic alternative. In addition to directly optimizing for a tree-level metric such as F1, policy gradient has the potential to reduce exposure bias by allowing exploration during training; moreover, it does not require a dynamic oracle for supervision. On four constituency parsers in three languages, the method substantially outperforms static oracle likelihood training in almost all settings. For parsers where a dynamic oracle is available (including a novel oracle which we define for the transition system of Dyer et al. 2016), policy gradient typically recaptures a substantial fraction of the performance gain afforded by the dynamic oracle.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2018

A Dynamic Oracle for Linear-Time 2-Planar Dependency Parsing

We propose an efficient dynamic oracle for training the 2-Planar transit...
research
06/11/2017

A Full Non-Monotonic Transition System for Unrestricted Non-Projective Parsing

Restricted non-monotonicity has been shown beneficial for the projective...
research
04/08/2019

Samples are not all useful: Denoising policy gradient updates using variance

Policy gradient algorithms in reinforcement learning rely on efficiently...
research
10/08/2018

An AMR Aligner Tuned by Transition-based Parser

In this paper, we propose a new rich resource enhanced AMR aligner which...
research
10/25/2018

Dynamic Oracles for Top-Down and In-Order Shift-Reduce Constituent Parsing

We introduce novel dynamic oracles for training two of the most accurate...
research
09/03/2018

Emergence of Communication in an Interactive World with Consistent Speakers

Training agents to communicate with one another given task-based supervi...
research
01/28/2020

Parameter Sharing in Coagent Networks

In this paper, we aim to prove the theorem that generalizes the Coagent ...

Please sign up or login with your details

Forgot password? Click here to reset