Dual Learning: Theoretical Study and an Algorithmic Extension

05/17/2020
by   Zhibing Zhao, et al.
0

Dual learning has been successfully applied in many machine learning applications including machine translation, image-to-image transformation, etc. The high-level idea of dual learning is very intuitive: if we map an x from one domain to another and then map it back, we should recover the original x. Although its effectiveness has been empirically verified, theoretical understanding of dual learning is still very limited. In this paper, we aim at understanding why and when dual learning works. Based on our theoretical analysis, we further extend dual learning by introducing more related mappings and propose multi-step dual learning, in which we leverage feedback signals from additional domains to improve the qualities of the mappings. We prove that multi-step dual learn-ing can boost the performance of standard dual learning under mild conditions. Experiments on WMT 14 EnglishGerman and MultiUNEnglishFrench translations verify our theoretical findings on dual learning, and the results on the translations among English, French, and Spanish of MultiUN demonstrate the effectiveness of multi-step dual learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2016

Dual Learning for Machine Translation

While neural machine translation (NMT) is making good progress in the pa...
research
10/07/2020

Dual Reconstruction: a Unifying Objective for Semi-Supervised Neural Machine Translation

While Iterative Back-Translation and Dual Learning effectively incorpora...
research
03/23/2022

An introduction to using dual quaternions to study kinematics

We advocate for the use of dual quaternions to represent poses, twists, ...
research
07/03/2017

Dual Supervised Learning

Many supervised learning tasks are emerged in dual forms, e.g., English-...
research
09/21/2021

One Source, Two Targets: Challenges and Rewards of Dual Decoding

Machine translation is generally understood as generating one target tex...
research
11/01/2018

Unsupervised Dual-Cascade Learning with Pseudo-Feedback Distillation for Query-based Extractive Summarization

We propose Dual-CES -- a novel unsupervised, query-focused, multi-docume...
research
06/10/2021

One Sense Per Translation

The idea of using lexical translations to define sense inventories has a...

Please sign up or login with your details

Forgot password? Click here to reset