Blackbird's language matrices (BLMs): a new benchmark to investigate disentangled generalisation in neural networks

05/22/2022
by   Paola Merlo, et al.
0

Current successes of machine learning architectures are based on computationally expensive algorithms and prohibitively large amounts of data. We need to develop tasks and data to train networks to reach more complex and more compositional skills. In this paper, we illustrate Blackbird's language matrices (BLMs), a novel grammatical dataset developed to test a linguistic variant of Raven's progressive matrices, an intelligence test usually based on visual stimuli. The dataset consists of 44800 sentences, generatively constructed to support investigations of current models' linguistic mastery of grammatical agreement rules and their ability to generalise them. We present the logic of the dataset, the method to automatically construct data on a large scale and the architecture to learn them. Through error analysis and several experiments on variations of the dataset, we demonstrate that this language task and the data that instantiate it provide a new challenging testbed to understand generalisation and abstraction.

READ FULL TEXT
research
02/05/2020

Solving Raven's Progressive Matrices with Neural Networks

Raven's Progressive Matrices (RPM) have been widely used for Intelligenc...
research
06/11/2021

Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems

Neural models excel at extracting statistical patterns from large amount...
research
05/23/2023

Assessing Linguistic Generalisation in Language Models: A Dataset for Brazilian Portuguese

Much recent effort has been devoted to creating large-scale language mod...
research
06/20/2023

Blackbird language matrices (BLM), a new task for rule-like generalization in neural networks: Motivations and Formal Specifications

We motivate and formally define a new task for fine-tuning rule-like gen...
research
07/29/2019

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

One of the primary challenges faced by deep learning is the degree to wh...
research
03/25/2020

Solving Raven's Progressive Matrices with Multi-Layer Relation Networks

Raven's Progressive Matrices are a benchmark originally designed to test...
research
02/09/2018

Two Is Harder To Recognize Than Tom: the Challenge of Visual Numerosity for Deep Learning

In the spirit of Turing test, we design and conduct a set of visual nume...

Please sign up or login with your details

Forgot password? Click here to reset