Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

11/18/2015
by   Hyeonwoo Noh, et al.
0

We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a set of candidate weights as its output. However, it is challenging to construct a parameter prediction network for a large number of parameters in the fully-connected dynamic parameter layer of the CNN. We reduce the complexity of this problem by incorporating a hashing technique, where the candidate weights given by the parameter prediction network are selected using a predefined hash function to determine individual weights in the dynamic parameter layer. The proposed network---joint network with the CNN for ImageQA and the parameter prediction network---is trained end-to-end through back-propagation, where its weights are initialized using a pre-trained CNN and GRU. The proposed algorithm illustrates the state-of-the-art performance on all available public ImageQA benchmarks.

READ FULL TEXT

page 1

page 8

research
06/01/2015

Learning to Answer Questions From Image Using Convolutional Neural Network

In this paper, we propose to employ the convolutional neural network (CN...
research
04/17/2015

Color Constancy Using CNNs

In this work we describe a Convolutional Neural Network (CNN) to accurat...
research
02/01/2017

Design, Analysis and Application of A Volumetric Convolutional Neural Network

The design, analysis and application of a volumetric convolutional neura...
research
11/15/2018

Improving Skin Condition Classification with a Question Answering Model

We present a skin condition classification methodology based on a sequen...
research
12/10/2015

Neural Self Talk: Image Understanding via Continuous Questioning and Answering

In this paper we consider the problem of continuously discovering image ...
research
04/28/2015

Speeding Up Neural Networks for Large Scale Classification using WTA Hashing

In this paper we propose to use the Winner Takes All hashing technique t...
research
11/06/2019

Doppler Spectrum Classification with CNNs via Heatmap Location Encoding and a Multi-head Output Layer

Spectral Doppler measurements are an important part of the standard echo...

Please sign up or login with your details

Forgot password? Click here to reset