Recurrent Models of Visual Attention

06/24/2014
by   Volodymyr Mnih, et al.
0

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it performs can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn task-specific policies. We evaluate our model on several image classification tasks, where it significantly outperforms a convolutional neural network baseline on cluttered images, and on a dynamic visual control problem, where it learns to track a simple object without an explicit training signal for doing so.

READ FULL TEXT

page 6

page 7

page 9

page 10

page 11

research
11/13/2021

Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning

The idea of using the recurrent neural network for visual attention has ...
research
12/24/2014

Multiple Object Recognition with Visual Attention

We present an attention-based model for recognizing multiple objects in ...
research
08/24/2023

Data-Side Efficiencies for Lightweight Convolutional Neural Networks

We examine how the choice of data-side attributes for two important visu...
research
04/26/2019

Perceptual Attention-based Predictive Control

In this paper, we present a novel information processing architecture fo...
research
10/10/2019

NEURO-DRAM: a 3D recurrent visual attention model for interpretable neuroimaging classification

Deep learning is attracting significant interest in the neuroimaging com...
research
01/23/2017

Learning what to look in chest X-rays with a recurrent visual attention model

X-rays are commonly performed imaging tests that use small amounts of ra...
research
01/24/2017

Learning an attention model in an artificial visual system

The Human visual perception of the world is of a large fixed image that ...

Please sign up or login with your details

Forgot password? Click here to reset