Robust Visual Tracking via Convolutional Networks

01/19/2015
by   Kaihua Zhang, et al.
0

Deep networks have been successfully applied to visual tracking by learning a generic representation offline from numerous training images. However the offline training is time-consuming and the learned generic representation may be less discriminative for tracking specific objects. In this paper we present that, even without offline training with a large amount of auxiliary data, simple two-layer convolutional networks can be powerful enough to develop a robust representation for visual tracking. In the first frame, we employ the k-means algorithm to extract a set of normalized patches from the target region as fixed filters, which integrate a series of adaptive contextual filters surrounding the target to define a set of feature maps in the subsequent frames. These maps measure similarities between each filter and the useful local intensity patterns across the target, thereby encoding its local structural information. Furthermore, all the maps form together a global representation, which is built on mid-level features, thereby remaining close to image-level information, and hence the inner geometric layout of the target is also well preserved. A simple soft shrinkage method with an adaptive threshold is employed to de-noise the global representation, resulting in a robust sparse representation. The representation is updated via a simple and effective online strategy, allowing it to robustly adapt to target appearance variations. Our convolution networks have surprisingly lightweight structure, yet perform favorably against several state-of-the-art methods on the CVPR2013 tracking benchmark dataset with 50 challenging videos.

READ FULL TEXT

page 3

page 5

page 6

page 13

page 14

page 15

research
10/28/2018

Object Tracking in Hyperspectral Videos with Convolutional Features and Kernelized Correlation Filter

Target tracking in hyperspectral videos is a new research topic. In this...
research
10/30/2016

Visual Tracking via Boolean Map Representations

In this paper, we present a simple yet effective Boolean map based repre...
research
12/19/2016

Dual Deep Network for Visual Tracking

Visual tracking addresses the problem of identifying and localizing an u...
research
11/06/2018

DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

In recent years, Discriminative Correlation Filter (DCF) based tracking ...
research
04/03/2020

Effective Fusion of Deep Multitasking Representations for Robust Visual Tracking

Visual object tracking remains an active research field in computer visi...
research
02/18/2019

Robust Structured Group Local Sparse Tracker Using Deep Features

Sparse representation has recently been successfully applied in visual t...
research
04/21/2015

Adaptive Compressive Tracking via Online Vector Boosting Feature Selection

Recently, the compressive tracking (CT) method has attracted much attent...

Please sign up or login with your details

Forgot password? Click here to reset