ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data

03/24/2023
by   Haojie Zhao, et al.
0

Compared with traditional RGB-only visual tracking, few datasets have been constructed for RGB-D tracking. In this paper, we propose ARKitTrack, a new RGB-D tracking dataset for both static and dynamic scenes captured by consumer-grade LiDAR scanners equipped on Apple's iPhone and iPad. ARKitTrack contains 300 RGB-D sequences, 455 targets, and 229.7K video frames in total. Along with the bounding box annotations and frame-level attributes, we also annotate this dataset with 123.9K pixel-level target masks. Besides, the camera intrinsic and camera pose of each frame are provided for future developments. To demonstrate the potential usefulness of this dataset, we further present a unified baseline for both box-level and pixel-level tracking, which integrates RGB features with bird's-eye-view representations to better explore cross-modality 3D geometry. In-depth empirical analysis has verified that the ARKitTrack dataset can significantly facilitate RGB-D tracking and that the proposed baseline method compares favorably against the state of the arts. The code and dataset is available at https://arkittrack.github.io.

READ FULL TEXT

page 1

page 5

research
08/21/2022

RGBD1K: A Large-scale Dataset and Benchmark for RGB-D Object Tracking

RGB-D object tracking has attracted considerable attention recently, ach...
research
05/04/2022

SDF-based RGB-D Camera Tracking in Neural Scene Representations

We consider the problem of tracking the 6D pose of a moving RGB-D camera...
research
08/31/2021

DepthTrack : Unveiling the Power of RGBD Tracking

RGBD (RGB plus depth) object tracking is gaining momentum as RGBD sensor...
research
08/12/2020

SIDOD: A Synthetic Image Dataset for 3D Object Pose Recognition with Distractors

We present a new, publicly-available image dataset generated by the NVID...
research
11/04/2022

HoloLens 2 Sensor Streaming

We present a HoloLens 2 server application for streaming device data via...
research
02/22/2021

SALT: A Semi-automatic Labeling Tool for RGB-D Video Sequences

Large labeled data sets are one of the essential basics of modern deep l...
research
10/01/2019

Omnipush: accurate, diverse, real-world dataset of pushing dynamics with RGB-D video

Pushing is a fundamental robotic skill. Existing work has shown how to e...

Please sign up or login with your details

Forgot password? Click here to reset