VALUE: Understanding Dialect Disparity in NLU

04/06/2022
by   Caleb Ziems, et al.
2

English Natural Language Understanding (NLU) systems have achieved great performances and even outperformed humans on benchmarks like GLUE and SuperGLUE. However, these benchmarks contain only textbook Standard American English (SAE). Other dialects have been largely overlooked in the NLP community. This leads to biased and inequitable NLU systems that serve only a sub-population of speakers. To understand disparities in current models and to facilitate more dialect-competent NLU systems, we introduce the VernAcular Language Understanding Evaluation (VALUE) benchmark, a challenging variant of GLUE that we created with a set of lexical and morphosyntactic transformation rules. In this initial release (V.1), we construct rules for 11 features of African American Vernacular English (AAVE), and we recruit fluent AAVE speakers to validate each feature transformation via linguistic acceptability judgments in a participatory design manner. Experiments show that these new dialectal features can lead to a drop in model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2022

Automatic question generation based on sentence structure analysis using machine learning approach

Automatic question generation is one of the most challenging tasks of Na...
research
04/07/2020

KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding

Natural language inference (NLI) and semantic textual similarity (STS) a...
research
04/13/2022

Study of Indian English Pronunciation Variabilities relative to Received Pronunciation

In contrast to British or American English, labeled pronunciation data o...
research
06/15/2021

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Artificial Intelligence (AI), along with the recent progress in biomedic...
research
04/05/2021

What Will it Take to Fix Benchmarking in Natural Language Understanding?

Evaluation for many natural language understanding (NLU) tasks is broken...
research
06/11/2017

Exploring Automated Essay Scoring for Nonnative English Speakers

Automated Essay Scoring (AES) has been quite popular and is being widely...
research
12/15/2022

Multi-VALUE: A Framework for Cross-Dialectal English NLP

Dialect differences caused by regional, social, and economic barriers ca...

Please sign up or login with your details

Forgot password? Click here to reset