Learning where to Attend with Deep Architectures for Image Tracking

09/16/2011
by   Misha Denil, et al.
0

We discuss an attentional model for simultaneous object tracking and recognition that is driven by gaze data. Motivated by theories of perception, the model consists of two interacting pathways: identity and control, intended to mirror the what and where pathways in neuroscience models. The identity pathway models object appearance and performs classification using deep (factored)-Restricted Boltzmann Machines. At each point in time the observations consist of foveated images, with decaying resolution toward the periphery of the gaze. The control pathway models the location, orientation, scale and speed of the attended object. The posterior distribution of these states is estimated with particle filtering. Deeper in the control pathway, we encounter an attentional mechanism that learns to select gazes so as to minimize tracking uncertainty. Unlike in our previous work, we introduce gaze selection strategies which operate in the presence of partial information and on a continuous action space. We show that a straightforward extension of the existing approach to the partial information setting results in poor performance, and we propose an alternative method based on modeling the reward surface as a Gaussian Process. This approach gives good performance in the presence of partial information and allows us to expand the action space from a small, discrete set of fixation points to a continuous domain.

READ FULL TEXT

page 3

page 5

page 7

page 8

page 10

page 22

page 24

page 25

research
10/26/2022

End-to-end Tracking with a Multi-query Transformer

Multiple-object tracking (MOT) is a challenging task that requires simul...
research
05/31/2020

In the Eye of the Beholder: Gaze and Actions in First Person Video

We address the task of jointly determining what a person is doing and wh...
research
11/08/2020

Integrating Human Gaze into Attention for Egocentric Activity Recognition

It is well known that human gaze carries significant information about v...
research
08/24/2022

Active Gaze Control for Foveal Scene Exploration

Active perception and foveal vision are the foundations of the human vis...
research
02/06/2022

On Smart Gaze based Annotation of Histopathology Images for Training of Deep Convolutional Neural Networks

Unavailability of large training datasets is a bottleneck that needs to ...
research
10/11/2020

Towards Hardware-Agnostic Gaze-Trackers

Gaze-tracking is a novel way of interacting with computers which allows ...

Please sign up or login with your details

Forgot password? Click here to reset