On the Strengths of Cross-Attention in Pretrained Transformers for Machine Translation

04/18/2021
by   Mozhdeh Gheini, et al.
10

We study the power of cross-attention in the Transformer architecture within the context of machine translation. In transfer learning experiments, where we fine-tune a translation model on a dataset with one new language, we find that, apart from the new language's embeddings, only the cross-attention parameters need to be fine-tuned to obtain competitive BLEU performance. We provide insights into why this is the case and further find that limiting fine-tuning in this manner yields cross-lingually aligned type embeddings. The implications of this finding include a mitigation of catastrophic forgetting in the network and the potential for zero-shot translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2022

Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation

In this paper, we explore the challenging problem of performing a genera...
research
04/30/2020

On the Evaluation of Contextual Embeddings for Zero-Shot Cross-Lingual Transfer Learning

Pre-trained multilingual contextual embeddings have demonstrated state-o...
research
03/31/2021

Zero-Shot Language Transfer vs Iterative Back Translation for Unsupervised Machine Translation

This work focuses on comparing different solutions for machine translati...
research
12/11/2020

Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer

Adapter modules, additional trainable parameters that enable efficient f...
research
02/01/2023

An Evaluation of Persian-English Machine Translation Datasets with Transformers

Nowadays, many researchers are focusing their attention on the subject o...
research
10/21/2021

StyleAlign: Analysis and Applications of Aligned StyleGAN Models

In this paper, we perform an in-depth study of the properties and applic...
research
10/22/2020

Not all parameters are born equal: Attention is mostly what you need

Transformers are widely used in state-of-the-art machine translation, bu...

Please sign up or login with your details

Forgot password? Click here to reset