CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment

03/11/2022
by   Lutfi Kerem Senel, et al.
7

Pretrained language models (PLMs) have achieved superhuman performance on many benchmarks, creating a need for harder tasks. We introduce CoDA21 (Context Definition Alignment), a challenging benchmark that measures natural language understanding (NLU) capabilities of PLMs: Given a definition and a context each for k words, but not the words themselves, the task is to align the k definitions with the k contexts. CoDA21 requires a deep understanding of contexts and definitions, including complex inference and world knowledge. We find that there is a large gap between human and PLM performance, suggesting that CoDA21 measures an aspect of NLU that is not sufficiently covered in existing benchmarks.

READ FULL TEXT
research
03/05/2021

Overcoming Poor Word Embeddings with Word Definitions

Modern natural language understanding models depend on pretrained subwor...
research
02/06/2021

Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models

Recent progress in pretraining language models on large corpora has resu...
research
04/07/2020

Evaluating Machines by their Real-World Language Use

There is a fundamental gap between how humans understand and use languag...
research
05/15/2023

What's the Meaning of Superhuman Performance in Today's NLU?

In the last five years, there has been a significant focus in Natural La...
research
06/29/2022

Is it possible not to cheat on the Turing Test_Exploring the potential and challenges for true natural language 'understanding' by computers

The increasing sophistication of NLP models has renewed optimism regardi...
research
09/21/2023

ContextRef: Evaluating Referenceless Metrics For Image Description Generation

Referenceless metrics (e.g., CLIPScore) use pretrained vision–language m...
research
11/01/2018

Learning to Describe Phrases with Local and Global Contexts

When reading a text, it is common to become stuck on unfamiliar words an...

Please sign up or login with your details

Forgot password? Click here to reset