Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)

07/22/2022
by   Huang Bojun, et al.
0

This paper discusses a new approach to the fundamental problem of learning optimal Q-functions. In this approach, optimal Q-functions are formulated as saddle points of a nonlinear Lagrangian function derived from the classic Bellman optimality equation. The paper shows that the Lagrangian enjoys strong duality, in spite of its nonlinearity, which paves the way to a general Lagrangian method to Q-function learning. As a demonstration, the paper develops an imitation learning algorithm based on the duality theory, and applies the algorithm to a state-of-the-art machine translation benchmark. The paper then turns to demonstrate a symmetry breaking phenomenon regarding the optimality of the Lagrangian saddle points, which justifies a largely overlooked direction in developing the Lagrangian method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2019

Augmented Lagrangian Method for Thin Plates with Signorini Boundaries

We consider C^1-continuous approximations of the Kirchhoff plate problem...
research
09/14/2023

Symplectic and Lagrangian Polar Duality; Applications to Quantum Information Geometry

Polar duality is a well-known concept from convex geometry and analysis....
research
11/18/2019

A new interface capturing method for Allen-Cahn type equations based on a flow dynamic approach in Lagrangian coordinates, I. One-dimensional case

We develop a new Lagrangian approach — flow dynamic approach to effectiv...
research
12/21/2022

Analysis of an Explicit, High-Order Semi-Lagrangian Nodal Method

A discrete analysis of the phase and dissipation errors of an explicit, ...
research
03/30/2020

Certifiable Relative Pose Estimation

In this paper we present the first fast optimality certifier for the non...
research
03/22/2022

A Hybrid Lagrangian-Eulerian Model for the Structural Analysis of Multifield Datasets

Multifields datasets are common in a large number of research and engine...
research
09/23/2019

Persuasion and Incentives Through the Lens of Duality

Lagrangian duality underlies both classical and modern mechanism design....

Please sign up or login with your details

Forgot password? Click here to reset