Tell Me What to Track

07/26/2019
by   Qi Feng, et al.
2

In recent years, deep-learning-based visual object trackers have been studied thoroughly, but handling occlusions and/or rapid motion of the target remains challenging. In this work, we argue that conditioning on the natural language (NL) description of a target provides information for longer-term invariance, and thus helps cope with typical tracking challenges. However, deriving a formulation to combine the strengths of appearance-based tracking with the language modality is not straightforward. We propose a novel deep tracking-by-detection formulation that can take advantage of NL descriptions. Regions that are related to the given NL description are generated by a proposal network during the detection phase of the tracker. Our LSTM based tracker then predicts the update of the target from regions proposed by the NL based detection phase. In benchmarks, our method is competitive with state of the art trackers, while it outperforms all other trackers on targets with unambiguous and precise language annotations. It also beats the state-of-the-art NL tracker when initializing without a bounding box. Our method runs at over 30 fps on a single GPU.

READ FULL TEXT
research
12/04/2019

Robust Visual Object Tracking with Natural Language Region Proposal Network

Tracking with natural-language (NL) specification is a powerful new para...
research
10/25/2021

TAPL: Dynamic Part-based Visual Tracking via Attention-guided Part Localization

Holistic object representation-based trackers suffer from performance dr...
research
08/21/2020

Blending of Learning-based Tracking and Object Detection for Monocular Camera-based Target Following

Deep learning has recently started being applied to visual tracking of g...
research
07/03/2020

Accurate Bounding-box Regression with Distance-IoU Loss for Visual Tracking

Most existing tracking methods are based on using a classifier and multi...
research
04/17/2022

In Defense of Subspace Tracker: Orthogonal Embedding for Visual Tracking

The paper focuses on a classical tracking model, subspace learning, grou...
research
03/21/2020

Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises

Adversarial attack of CNN aims at deceiving models to misbehave by addin...
research
03/27/2020

Probabilistic Regression for Visual Tracking

Visual tracking is fundamentally the problem of regressing the state of ...

Please sign up or login with your details

Forgot password? Click here to reset