Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection

06/04/2019
by   Maria Corkery, et al.
0

The cognitive mechanisms needed to account for the English past tense have long been a subject of debate in linguistics and cognitive science. Neural network models were proposed early on, but were shown to have clear flaws. Recently, however, Kirov and Cotterell (2018) showed that modern encoder-decoder (ED) models overcome many of these flaws. They also presented evidence that ED models demonstrate humanlike performance in a nonce-word task. Here, we look more closely at the behaviour of their model in this task. We find that (1) the model exhibits instability across multiple simulations in terms of its correlation with human data, and (2) even when results are aggregated across simulations (treating each simulation as an individual human participant), the fit to the human data is not strong---worse than an older rule-based model. These findings hold up through several alternative training regimes and evaluation measures. Although other neural architectures might do better, we conclude that there is still insufficient evidence to claim that neural nets are a good cognitive model for this task.

READ FULL TEXT
research
10/22/2022

A Comprehensive Comparison of Neural Networks as Cognitive Models of Inflection

Neural networks have long been at the center of a debate around the cogn...
research
07/12/2018

Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate

Can advances in NLP help advance cognitive modeling? We examine the role...
research
01/19/2022

Investigating cognitive ability using action-based models of structural brain networks

Recent developments in network neuroscience have highlighted the importa...
research
05/18/2020

Inflecting when there's no majority: Limitations of encoder-decoder neural networks as cognitive models for German plurals

Can artificial neural networks learn to represent inflectional morpholog...
research
10/17/2022

How do we get there? Evaluating transformer neural networks as cognitive models for English past tense inflection

There is an ongoing debate on whether neural networks can grasp the quas...
research
05/03/2017

A Rule-Based Computational Model of Cognitive Arithmetic

Cognitive arithmetic studies the mental processes used in solving math p...
research
11/30/2018

Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger

We formulate the problem of defogging as state estimation and future sta...

Please sign up or login with your details

Forgot password? Click here to reset