Robust Visual Object Tracking with Natural Language Region Proposal Network

12/04/2019
by   Qi Feng, et al.
28

Tracking with natural-language (NL) specification is a powerful new paradigm to yield trackers that initialize without a manually-specified bounding box, stay on target in spite of occlusions, and auto-recover when diverged. These advantages stem in part from visual appearance and NL having distinct and complementary invariance properties. However, realizing these advantages is technically challenging: the two modalities have incompatible representations. In this paper, we present the first practical and competitive solution to the challenge of tracking with NL specification. Our first novelty is an NL region proposal network (NL-RPN) that transforms an NL description into a convolutional kernel and shares the search branch with siamese trackers; the combined network can be trained end-to-end. Secondly, we propose a novel formulation to represent the history of past visual exemplars and use those exemplars to automatically reset the tracker together with our NL-RPN. Empirical results over tracking benchmarks with NL annotations demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
07/26/2019

Tell Me What to Track

In recent years, deep-learning-based visual object trackers have been st...
research
07/10/2022

SRRT: Search Region Regulation Tracking

Dominant trackers generate a fixed-size rectangular region based on the ...
research
11/17/2019

SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking

By decomposing the visual tracking task into two subproblems as classifi...
research
06/13/2020

Accurate Anchor Free Tracking

Visual object tracking is an important application of computer vision. R...
research
05/24/2021

SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking

Recently, most siamese network based trackers locate targets via object ...
research
05/30/2017

Robust Tracking Using Region Proposal Networks

Recent advances in visual tracking showed that deep Convolutional Neural...
research
11/21/2022

PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework

Visual object tracking is an essential capability of intelligent robots....

Please sign up or login with your details

Forgot password? Click here to reset