D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

09/15/2023
by   Amir Rahimi, et al.
0

Systematic generalization is a crucial aspect of intelligence, which refers to the ability to generalize to novel tasks by combining known subtasks and concepts. One critical factor that has been shown to influence systematic generalization is the diversity of training data. However, diversity can be defined in various ways, as data have many factors of variation. A more granular understanding of how different aspects of data diversity affect systematic generalization is lacking. We present new evidence in the problem of Visual Question Answering (VQA) that reveals that the diversity of simple tasks (i.e. tasks formed by a few subtasks and concepts) plays a key role in achieving systematic generalization. This implies that it may not be essential to gather a large and varied number of complex tasks, which could be costly to obtain. We demonstrate that this result is independent of the similarity between the training and testing data and applies to well-known families of neural network architectures for VQA (i.e. monolithic architectures and neural module networks). Additionally, we observe that neural module networks leverage all forms of data diversity we evaluated, while monolithic architectures require more extensive amounts of data to do so. These findings provide a first step towards understanding the interactions between data diversity design, neural network architectures, and systematic generalization capabilities.

READ FULL TEXT

page 2

page 3

page 7

page 8

page 9

page 14

research
06/15/2021

How Modular Should Neural Module Networks Be for Systematic Generalization?

Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via...
research
01/27/2022

Transformer Module Networks for Systematic Generalization in Visual Question Answering

Transformer-based models achieve great performance on Visual Question An...
research
07/19/2021

Separating Skills and Concepts for Novel Visual Question Answering

Generalization to out-of-distribution data has been a problem for Visual...
research
02/22/2022

Improving Systematic Generalization Through Modularity and Augmentation

Systematic generalization is the ability to combine known parts into nov...
research
12/23/2021

Equivariance and generalization in neural networks

The crucial role played by the underlying symmetries of high energy phys...
research
07/08/2017

Effective Approaches to Batch Parallelization for Dynamic Neural Network Architectures

We present a simple dynamic batching approach applicable to a large clas...
research
10/12/2021

Dynamic Inference with Neural Interpreters

Modern neural network architectures can leverage large amounts of data t...

Please sign up or login with your details

Forgot password? Click here to reset