Training language models for deeper understanding improves brain alignment

12/21/2022
by   Khai Loong Aw, et al.
0

Building systems that achieve a deeper understanding of language is one of the central goals of natural language processing (NLP). Towards this goal, recent works have begun to train language models on narrative datasets which require extracting the most critical information by integrating across long contexts. However, it is still an open question whether these models are learning a deeper understanding of the text, or if the models are simply learning a heuristic to complete the task. This work investigates this further by turning to the one language processing system that truly understands complex language: the human brain. We show that training language models for deeper narrative understanding results in richer representations that have improved alignment to human brain activity. We further find that the improvements in brain alignment are larger for character names than for other discourse features, which indicates that these models are learning important narrative elements. Taken together, these results suggest that this type of training can indeed lead to deeper language understanding. These findings have consequences both for cognitive neuroscience by revealing some of the significant factors behind brain-NLP alignment, and for NLP by highlighting that understanding of long-range context can be improved beyond language modeling.

READ FULL TEXT

page 21

page 24

page 25

page 26

page 27

page 28

page 29

page 30

research
12/15/2022

Joint processing of linguistic properties in brains and language models

Language models have been shown to be very effective in predicting brain...
research
12/01/2022

Language models and brain alignment: beyond word-level semantics and prediction

Pretrained language models that have been trained to predict the next wo...
research
10/29/2019

Inducing brain-relevant bias in natural language processing models

Progress in natural language processing (NLP) models that estimate repre...
research
11/28/2021

Long-range and hierarchical language predictions in brains and algorithms

Deep learning has recently made remarkable progress in natural language ...
research
04/22/2022

ChapterBreak: A Challenge Dataset for Long-Range Language Models

While numerous architectures for long-range language models (LRLMs) have...
research
05/28/2019

Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)

Neural network models for NLP are typically implemented without the expl...
research
04/20/2011

Understanding Exhaustive Pattern Learning

Pattern learning in an important problem in Natural Language Processing ...

Please sign up or login with your details

Forgot password? Click here to reset