Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels

10/09/2020
by   Harris Chan, et al.
6

A channel corresponds to a viewpoint or transformation of an underlying meaning. A pair of parallel sentences in English and French express the same underlying meaning, but through two separate channels corresponding to their languages. In this work, we present the Multichannel Generative Language Model (MGLM). MGLM is a generative joint distribution model over channels. MGLM marginalizes over all possible factorizations within and across all channels. MGLM endows flexible inference, including unconditional generation, conditional generation (where 1 channel is observed and other channels are generated), and partially observed generation (where incomplete observations are spread across all the channels). We experiment with the Multi30K dataset containing English, French, Czech, and German. We demonstrate experiments with unconditional, conditional, and partially conditional generation. We provide qualitative samples sampled unconditionally from the generative joint distribution. We also quantitatively analyze the quality-diversity trade-offs and find MGLM outperforms traditional bilingual discriminative models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

Adapting BigScience Multilingual Model to Unseen Languages

We benchmark different strategies of adding new languages (German and Ko...
research
02/21/2018

A Generative Deep Recurrent Model for Exchangeable Data

We present a novel model architecture which leverages deep learning tool...
research
10/16/2020

Generating Diverse Translation from Model Distribution with Dropout

Despite the improvement of translation quality, neural machine translati...
research
09/25/2017

Generative learning for deep networks

Learning, taking into account full distribution of the data, referred to...
research
07/21/2021

cGANs with Auxiliary Discriminative Classifier

Conditional generative models aim to learn the underlying joint distribu...
research
01/13/2018

Asymptotic Distribution of Multilevel Channel Polarization for a Certain Class of Erasure Channels

This study examines multilevel channel polarization for a certain class ...
research
10/05/2020

Acrostic Poem Generation

We propose a new task in the area of computational creativity: acrostic ...

Please sign up or login with your details

Forgot password? Click here to reset