How Modular Should Neural Module Networks Be for Systematic Generalization?

06/15/2021
by   Vanessa D'Amario, et al.
0

Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via composition of modules that tackle a sub-task. NMNs are a promising strategy to achieve systematic generalization, i.e. overcoming biasing factors in the training distribution. However, the aspects of NMNs that facilitate systematic generalization are not fully understood. In this paper, we demonstrate that the stage and the degree at which modularity is defined has large influence on systematic generalization. In a series of experiments on three VQA datasets (MNIST with multiple attributes, SQOOP, and CLEVR-CoGenT), our results reveal that tuning the degree of modularity in the network, especially at the image encoder stage, reaches substantially higher systematic generalization. These findings lead to new NMN architectures that outperform previous ones in terms of systematic generalization.

READ FULL TEXT

page 2

page 5

page 7

page 12

page 17

research
01/27/2022

Transformer Module Networks for Systematic Generalization in Visual Question Answering

Transformer-based models achieve great performance on Visual Question An...
research
09/15/2023

D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

Systematic generalization is a crucial aspect of intelligence, which ref...
research
11/30/2018

Systematic Generalization: What Is Required and Can It Be Learned?

Numerous models for grounded language understanding have been recently p...
research
11/22/2022

A Short Survey of Systematic Generalization

This survey includes systematic generalization and a history of how mach...
research
05/03/2021

Iterated learning for emergent systematicity in VQA

Although neural module networks have an architectural bias towards compo...
research
08/24/2022

On a Built-in Conflict between Deep Learning and Systematic Generalization

In this paper, we hypothesize that internal function sharing is one of t...
research
07/21/2022

Semantic-aware Modular Capsule Routing for Visual Question Answering

Visual Question Answering (VQA) is fundamentally compositional in nature...

Please sign up or login with your details

Forgot password? Click here to reset