So you think you can track?

09/13/2023
by   Derek Gloudemans, et al.
0

This work introduces a multi-camera tracking dataset consisting of 234 hours of video data recorded concurrently from 234 overlapping HD cameras covering a 4.2 mile stretch of 8-10 lane interstate highway near Nashville, TN. The video is recorded during a period of high traffic density with 500+ objects typically visible within the scene and typical object longevities of 3-15 minutes. GPS trajectories from 270 vehicle passes through the scene are manually corrected in the video data to provide a set of ground-truth trajectories for recall-oriented tracking metrics, and object detections are provided for each camera in the scene (159 million total before cross-camera fusion). Initial benchmarking of tracking-by-detection algorithms is performed against the GPS trajectories, and a best HOTA of only 9.5 IOU 0.1, 47.9 average IDs per ground truth object), indicating the benchmarked trackers do not perform sufficiently well at the long temporal and spatial durations required for traffic scene understanding.

READ FULL TEXT

page 2

page 17

page 18

page 20

page 23

page 30

page 34

page 35

research
08/28/2023

The Interstate-24 3D Dataset: a new benchmark for 3D multi-camera vehicle tracking

This work presents a novel video dataset recorded from overlapping highw...
research
08/30/2022

Synthehicle: Multi-Vehicle Multi-Camera Tracking in Virtual Cities

Smart City applications such as intelligent traffic routing or accident ...
research
11/11/2021

Open surgery tool classification and hand utilization using a multi-camera system

Purpose: The goal of this work is to use multi-camera video to classify ...
research
03/10/2020

Reconstruction of 3D flight trajectories from ad-hoc camera networks

We present a method to reconstruct the 3D trajectory of an airborne robo...
research
12/20/2017

CameraTransform: a Scientific Python Package for Perspective Camera Corrections

Scientific applications often require an exact reconstruction of object ...
research
09/12/2017

Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision

We present a novel data set made up of omnidirectional video of multiple...
research
12/02/2020

MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection

We present the Multiview Extended Video with Activities (MEVA) dataset, ...

Please sign up or login with your details

Forgot password? Click here to reset