Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolution

09/27/2021
by   Laura Aina, et al.
0

It is often posited that more predictable parts of a speaker's meaning tend to be made less explicit, for instance using shorter, less informative words. Studying these dynamics in the domain of referring expressions has proven difficult, with existing studies, both psycholinguistic and corpus-based, providing contradictory results. We test the hypothesis that speakers produce less informative referring expressions (e.g., pronouns vs. full noun phrases) when the context is more informative about the referent, using novel computational estimates of referent predictability. We obtain these estimates training an existing coreference resolution system for English on a new task, masked coreference resolution, giving us a probability distribution over referents that is conditioned on the context but not the referring expression. The resulting system retains standard coreference resolution performance while yielding a better estimate of human-derived referent predictability than previous attempts. A statistical analysis of the relationship between model output and mention form supports the hypothesis that predictability affects the form of a mention, both its morphosyntactic type and its length.

READ FULL TEXT

page 12

page 13

research
09/24/2022

Understanding the Use of Quantifiers in Mandarin

We introduce a corpus of short texts in Mandarin, in which quantified ex...
research
09/26/2017

Learning to Explain Non-Standard English Words and Phrases

We describe a data-driven approach for automatically explaining new, non...
research
02/10/2017

Modeling Semantic Expectation: Using Script Knowledge for Referent Prediction

Recent research in psycholinguistics has provided increasing evidence th...
research
03/16/2020

A Formal Analysis of Multimodal Referring Strategies Under Common Ground

In this paper, we present an analysis of computationally generated mixed...
research
11/14/2020

Lessons from Computational Modelling of Reference Production in Mandarin and English

Referring expression generation (REG) algorithms offer computational mod...
research
11/24/2022

InDEX: Indonesian Idiom and Expression Dataset for Cloze Test

We propose InDEX, an Indonesian Idiom and Expression dataset for cloze t...

Please sign up or login with your details

Forgot password? Click here to reset