Multi-modal Visual Tracking: Review and Experimental Comparison

12/08/2020
by   Pengyu Zhang, et al.
0

Visual object tracking, as a fundamental task in computer vision, has drawn much attention in recent years. To extend trackers to a wider range of applications, researchers have introduced information from multiple modalities to handle specific scenes, which is a promising research prospect with emerging methods and benchmarks. To provide a thorough review of multi-modal track-ing, we summarize the multi-modal tracking algorithms, especially visible-depth (RGB-D) tracking and visible-thermal (RGB-T) tracking in a unified taxonomy from different aspects. Second, we provide a detailed description of the related benchmarks and challenges. Furthermore, we conduct extensive experiments to analyze the effectiveness of trackers on five datasets: PTB, VOT19-RGBD, GTOT, RGBT234, and VOT19-RGBT. Finally, we discuss various future directions from different perspectives, including model design and dataset construction for further research.

READ FULL TEXT

page 7

page 11

page 12

page 13

page 16

page 18

page 22

page 26

research
01/23/2022

Visual Object Tracking on Multi-modal RGB-D Videos: A Review

The development of visual object tracking has continued for decades. Rec...
research
04/08/2022

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

With the popularity of multi-modal sensors, visible-thermal (RGB-T) obje...
research
08/31/2023

RGB-T Tracking via Multi-Modal Mutual Prompt Learning

Object tracking based on the fusion of visible and thermal im-ages, know...
research
01/22/2022

Temporal Aggregation for Adaptive RGBT Tracking

Visual object tracking with RGB and thermal infrared (TIR) spectra avail...
research
11/08/2021

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

In many visual systems, visual tracking often bases on RGB image sequenc...
research
09/26/2014

Multiple Object Tracking: A Literature Review

Multiple Object Tracking (MOT) is an important computer vision problem w...
research
03/17/2020

M^5L: Multi-Modal Multi-Margin Metric Learning for RGBT Tracking

Classifying the confusing samples in the course of RGBT tracking is a qu...

Please sign up or login with your details

Forgot password? Click here to reset