MTG: A Benchmarking Suite for Multilingual Text Generation

08/13/2021

∙

We introduce MTG, a new benchmark suite for training and evaluating multilingual text generation. It is the first and largest text generation benchmark with 120k human-annotated multi-way parallel data for three tasks (story generation, question generation, and title generation) across four languages (English, German, French, and Spanish). Based on it, we set various evaluation scenarios and make a deep analysis of several popular multilingual generation models from different aspects. Our benchmark suite will encourage the multilingualism for text generation community with more human-annotated parallel data and more diverse generation scenarios.

READ FULL TEXT

MTG: A Benchmarking Suite for Multilingual Text Generation

Sign in with Google

Consider DeepAI Pro