Multilingual CheckList: Generation and Evaluation

03/24/2022
by   Karthikeyan K, et al.
3

The recently proposed CheckList (Riberio et al,. 2020) approach to evaluation of NLP systems has revealed high failure rates for basic capabilities for multiple state-of-the-art and commercial models. However, the CheckList creation process is manual which creates a bottleneck towards creation of multilingual CheckLists catering 100s of languages. In this work, we explore multiple approaches to generate and evaluate the quality of Multilingual CheckList. We device an algorithm – Automated Multilingual Checklist Generation (AMCG) for automatically transferring a CheckList from a source to a target language that relies on a reasonable machine translation system. We then compare the CheckList generated by AMCG with CheckLists generated with different levels of human intervention. Through in-depth crosslingual experiments between English and Hindi, and broad multilingual experiments spanning 11 languages, we show that the automatic approach can provide accurate estimates of failure rates of a model across capabilities, as would a human-verified CheckList, and better than CheckLists generated by humans from scratch.

READ FULL TEXT
research
05/27/2023

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models

This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared...
research
10/15/2021

Breaking Down Multilingual Machine Translation

While multilingual training is now an essential ingredient in machine tr...
research
05/31/2023

Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios

We tackle the task of automatically discriminating between human and mac...
research
12/13/2021

English2Gbe: A multilingual machine translation model for Fon/EweGbe

Language is an essential factor of emancipation. Unfortunately, most of ...
research
09/09/2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset

We introduce MADLAD-400, a manually audited, general domain 3T token mon...
research
10/12/2021

Learning Compact Metrics for MT

Recent developments in machine translation and multilingual text generat...
research
06/20/2021

Challenges in Translation of Emotions in Multilingual User-Generated Content: Twitter as a Case Study

Although emotions are universal concepts, transferring the different sha...

Please sign up or login with your details

Forgot password? Click here to reset