Is Multilingual BERT Fluent in Language Generation?

10/09/2019
by   Samuel Rönnqvist, et al.
0

The multilingual BERT model is trained on 104 languages and meant to serve as a universal language model and tool for encoding sentences. We explore how well the model performs on several languages across several tasks: a diagnostic classification probing the embeddings for a particular syntactic property, a cloze task testing the language modelling ability to fill in gaps in a sentence, and a natural language generation task testing for the ability to produce coherent text fitting a given context. We find that the currently available multilingual BERT model is clearly inferior to the monolingual counterparts, and cannot in many cases serve as a substitute for a well-trained monolingual model. We find that the English and German models perform well at generation, whereas the multilingual model is lacking, in particular, for Nordic languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2021

gaBERT – an Irish Language Model

The BERT family of neural language models have become highly popular due...
research
07/22/2021

Evaluation of contextual embeddings on less-resourced languages

The current dominance of deep neural networks in natural language proces...
research
05/07/2021

Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation

Concept-to-text Natural Language Generation is the task of expressing an...
research
04/11/2022

Adapting BigScience Multilingual Model to Unseen Languages

We benchmark different strategies of adding new languages (German and Ko...
research
08/21/2018

Translational Grounding: Using Paraphrase Recognition and Generation to Demonstrate Semantic Abstraction Abilities of MultiLingual NMT

In this paper, we investigate whether multilingual neural translation mo...
research
07/14/2022

Language Modelling with Pixels

Language models are defined over a finite set of inputs, which creates a...
research
08/03/2018

Lightweight Multilingual Software Analysis

Developer preferences, language capabilities and the persistence of olde...

Please sign up or login with your details

Forgot password? Click here to reset