Is First Person Vision Challenging for Object Tracking? The TREK-100 Benchmark Dataset

by   Matteo Dunnhofer, et al.

Understanding human-object interactions is fundamental in First Person Vision (FPV). Tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. Despite a few previous attempts to exploit trackers in FPV applications, a systematic analysis of the performance of state-of-the-art trackers in this domain is still missing. On the other hand, the visual tracking solutions available in the computer vision literature have significantly improved their performance in the last years for a large variety of target objects and tracking scenarios. To fill the gap, in this paper, we present TREK-100, the first benchmark dataset for visual object tracking in FPV. The dataset is composed of 100 video sequences densely annotated with 60K bounding boxes, 17 sequence attributes, 13 action verb attributes and 29 target object attributes. Along with the dataset, we present an extensive analysis of the performance of 30 among the best and most recent visual trackers. Our results show that object tracking in FPV is challenging, which suggests that more research efforts should be devoted to this problem.


page 1

page 6

page 7

page 12


Is First Person Vision Challenging for Object Tracking?

Understanding human-object interactions is fundamental in First Person V...

360VOT: A New Benchmark Dataset for Omnidirectional Visual Object Tracking

360 images can provide an omnidirectional field of view which is importa...

Need for Speed: A Benchmark for Higher Frame Rate Object Tracking

In this paper, we propose the first higher frame rate video dataset (cal...

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility

One of the key factors behind the recent success in visual tracking is t...

AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes

Multi-object tracking (MOT) is a fundamental problem in computer vision ...

3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset

In this work we present a novel publicly available stereo based 3D RGB d...

Trans2k: Unlocking the Power of Deep Models for Transparent Object Tracking

Visual object tracking has focused predominantly on opaque objects, whil...

Please sign up or login with your details

Forgot password? Click here to reset