Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy

04/14/2023
by   David Schlangen, et al.
0

How does one measure "ability to understand language"? If it is a person's ability that is being measured, this is a question that almost never poses itself in an unqualified manner: Whatever formal test is applied, it takes place on the background of the person's language use in daily social practice, and what is measured is a specialised variety of language understanding (e.g., of a second language; or of written, technical language). Computer programs do not have this background. What does that mean for the applicability of formal tests of language understanding? I argue that such tests need to be complemented with tests of language use embedded in a practice, to arrive at a more comprehensive evaluation of "artificial language understanding". To do such tests systematically, I propose to use "Dialogue Games" – constructed activities that provide a situational embedding for language use. I describe a taxonomy of Dialogue Game types, linked to a model of underlying capabilites that are tested, and thereby giving an argument for the construct validity of the test. I close with showing how the internal structure of the taxonomy suggests an ordering from more specialised to more general situational language understanding, which potentially can provide some strategic guidance for development in this field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2017

Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems

Language understanding is a key component in a spoken dialogue system. I...
research
07/22/2019

Why Build an Assistant in Minecraft?

In this document we describe a rationale for a research program aimed at...
research
12/14/2017

Rasa: Open Source Language Understanding and Dialogue Management

We introduce a pair of tools, Rasa NLU and Rasa Core, which are open sou...
research
04/07/2020

Evaluating Machines by their Real-World Language Use

There is a fundamental gap between how humans understand and use languag...
research
09/30/2017

Speaker Role Contextual Modeling for Language Understanding and Dialogue Policy Learning

Language understanding (LU) and dialogue policy learning are two essenti...
research
09/30/2018

On the Winograd Schema Challenge: Levels of Language Understanding and the Phenomenon of the Missing Text

The Winograd Schema (WS) challenge has been proposed as an alternative t...
research
05/24/2023

On Degrees of Freedom in Defining and Testing Natural Language Understanding

Natural language understanding (NLU) studies often exaggerate or underes...

Please sign up or login with your details

Forgot password? Click here to reset