Exploring Representational Disparities Between Multilingual and Bilingual Translation Models

05/23/2023
by   Neha Verma, et al.
0

Multilingual machine translation has proven immensely useful for low-resource and zero-shot language pairs. However, language pairs in multilingual models sometimes see worse performance than in bilingual models, especially when translating in a one-to-many setting. To understand why, we examine the geometric differences in the representations from bilingual models versus those from one-to-many multilingual models. Specifically, we evaluate the isotropy of the representations, to measure how well they utilize the dimensions in their underlying vector space. Using the same evaluation data in both models, we find that multilingual model decoder representations tend to be less isotropic than bilingual model decoder representations. Additionally, we show that much of the anisotropy in multilingual decoder representations can be attributed to modeling language-specific information, therefore limiting remaining representational capacity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2022

AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

In this work, we demonstrate that multilingual large-scale sequence-to-s...
research
12/25/2019

A Study of Multilingual Neural Machine Translation

Multilingual neural machine translation (NMT) has recently been investig...
research
12/19/2021

LUC at ComMA-2021 Shared Task: Multilingual Gender Biased and Communal Language Identification without using linguistic features

This work aims to evaluate the ability that both probabilistic and state...
research
04/30/2020

Bridging linguistic typology and multilingual machine translation with multi-view language representations

Sparse language vectors from linguistic typology databases and learned e...
research
06/24/2019

Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models

We study several methods for full or partial sharing of the decoder para...
research
06/07/2016

Multilingual Visual Sentiment Concept Matching

The impact of culture in visual emotion perception has recently captured...
research
08/01/2015

Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling

A common evaluation practice in the vector space models (VSMs) literatur...

Please sign up or login with your details

Forgot password? Click here to reset