DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer

03/30/2018
by   Joseph Suarez, et al.
0

We present a novel Dynamic Differentiable Reasoning (DDR) framework for jointly learning branching programs and the functions composing them; this resolves a significant nondifferentiability inhibiting recent dynamic architectures. We apply our framework to two settings in two highly compact and data efficient architectures: DDRprog for CLEVR Visual Question Answering and DDRstack for reverse Polish notation expression evaluation. DDRprog uses a recurrent controller to jointly predict and execute modular neural programs that directly correspond to the underlying question logic; it explicitly forks subprocesses to handle logical branching. By effectively leveraging additional structural supervision, we achieve a large improvement over previous approaches in subtask consistency and a small improvement in overall accuracy. We further demonstrate the benefits of structural supervision in the RPN setting: the inclusion of a stack assumption in DDRstack allows our approach to generalize to long expressions where an LSTM fails the task.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 11

page 12

research
06/20/2020

Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

Visual reasoning tasks such as visual question answering (VQA) require a...
research
04/30/2020

Dynamic Language Binding in Relational Visual Reasoning

We present Language-binding Object Graph Network, the first neural reaso...
research
06/08/2023

Modular Visual Question Answering via Code Generation

We present a framework that formulates visual question answering as modu...
research
04/08/2017

An Empirical Evaluation of Visual Question Answering for Novel Objects

We study the problem of answering questions about images in the harder s...
research
06/22/2016

Semantic Parsing to Probabilistic Programs for Situated Question Answering

Situated question answering is the problem of answering questions about ...
research
07/08/2017

Effective Approaches to Batch Parallelization for Dynamic Neural Network Architectures

We present a simple dynamic batching approach applicable to a large clas...

Please sign up or login with your details

Forgot password? Click here to reset