VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering

12/12/2016
by   Marc Bolaños, et al.
0

In this paper, we address the problem of visual question answering by proposing a novel model, called VIBIKNet. Our model is based on integrating Kernelized Convolutional Neural Networks and Long-Short Term Memory units to generate an answer given a question about an image. We prove that VIBIKNet is an optimal trade-off between accuracy and computational load, in terms of memory and time consumption. We validate our method on the VQA challenge dataset and compare it to the top performing methods in order to illustrate its performance and speed.

READ FULL TEXT
research
07/23/2018

Question Relevance in Visual Question Answering

Free-form and open-ended Visual Question Answering systems solve the pro...
research
10/09/2016

Open-Ended Visual Question-Answering

This thesis report studies methods to solve Visual Question-Answering (V...
research
06/22/2015

Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering

In this paper, the answer selection problem in community question answer...
research
07/17/2017

Visual Question Answering with Memory-Augmented Networks

This paper exploits a memory-augmented neural network to predict accurat...
research
07/08/2021

A Long Short-Term Memory for AI Applications in Spike-based Neuromorphic Hardware

In spite of intensive efforts it has remained an open problem to what ex...
research
09/03/2021

Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification

Question Paraphrase Identification (QPI) is a critical task for large-sc...
research
02/15/2022

Privacy Preserving Visual Question Answering

We introduce a novel privacy-preserving methodology for performing Visua...

Please sign up or login with your details

Forgot password? Click here to reset