MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation

07/24/2021
by   Ayush Garg, et al.
5

Code-mixing is a phenomenon of mixing words and phrases from two or more languages in a single utterance of speech and text. Due to the high linguistic diversity, code-mixing presents several challenges in evaluating standard natural language generation (NLG) tasks. Various widely popular metrics perform poorly with the code-mixed NLG tasks. To address this challenge, we present a metric independent evaluation pipeline MIPE that significantly improves the correlation between evaluation metrics and human judgments on the generated code-mixed text. As a use case, we demonstrate the performance of MIPE on the machine-generated Hinglish (code-mixing of Hindi and English languages) sentences from the HinGE corpus. We can extend the proposed evaluation strategy to other code-mixed language pairs, NLG tasks, and evaluation metrics with minimal to no effort.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2021

HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish Text

Text generation is a highly active area of research in the computational...
research
06/18/2021

Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

Code-mixing is a frequent communication style among multilingual speaker...
research
11/13/2019

Prevalence of code mixing in semi-formal patient communication in low resource languages of South Africa

In this paper we address the problem of code-mixing in resource-poor lan...
research
04/10/2020

A New Dataset for Natural Language Inference from Code-mixed Conversations

Natural Language Inference (NLI) is the task of inferring the logical re...
research
01/30/2020

Harnessing Code Switching to Transcend the Linguistic Barrier

Code mixing (or code switching) is a common phenomenon observed in socia...
research
11/02/2022

Dialect-robust Evaluation of Generated Text

Evaluation metrics that are not robust to dialect variation make it impo...
research
07/16/1999

Mixing representation levels: The hybrid approach to automatic text generation

Natural language generation systems (NLG) map non-linguistic representat...

Please sign up or login with your details

Forgot password? Click here to reset