Hierarchical Attentive Recurrent Tracking

06/28/2017
by   Adam R. Kosiorek, et al.
0

Class-agnostic object tracking is particularly difficult in cluttered environments as target specific discriminative models cannot be learned a priori. Inspired by how the human visual cortex employs spatial attention and separate "where" and "what" processing pathways to actively suppress irrelevant visual features, this work develops a hierarchical attentive recurrent model for single object tracking in videos. The first layer of attention discards the majority of background by selecting a region containing the object of interest, while the subsequent layers tune in on visual features particular to the tracked object. This framework is fully differentiable and can be trained in a purely data driven fashion by gradient methods. To improve training convergence, we augment the loss function with terms for a number of auxiliary tasks relevant for tracking. Evaluation of the proposed model is performed on two datasets: pedestrian tracking on the KTH activity recognition dataset and the more difficult KITTI object tracking dataset.

READ FULL TEXT

page 2

page 6

page 8

research
01/23/2022

Visual Object Tracking on Multi-modal RGB-D Videos: A Review

The development of visual object tracking has continued for decades. Rec...
research
01/09/2017

Visual Multiple-Object Tracking for Unknown Clutter Rate

In multi-object tracking applications, model parameter tuning is a prere...
research
01/06/2018

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Recently, deep learning has achieved very promising results in visual ob...
research
10/07/2022

Trans2k: Unlocking the Power of Deep Models for Transparent Object Tracking

Visual object tracking has focused predominantly on opaque objects, whil...
research
04/03/2021

Learning Mobile CNN Feature Extraction Toward Fast Computation of Visual Object Tracking

In this paper, we construct a lightweight, high-precision and high-speed...
research
09/10/2018

Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers

Online Multi-Object Tracking (MOT) from videos is a challenging computer...
research
10/15/2020

An Empirical Analysis of Visual Features for Multiple Object Tracking in Urban Scenes

This paper addresses the problem of selecting appearance features for mu...

Please sign up or login with your details

Forgot password? Click here to reset