RGB-D Railway Platform Monitoring and Scene Understanding for Enhanced Passenger Safety

02/23/2021
by   Marco Wallner, et al.
0

Automated monitoring and analysis of passenger movement in safety-critical parts of transport infrastructures represent a relevant visual surveillance task. Recent breakthroughs in visual representation learning and spatial sensing opened up new possibilities for detecting and tracking humans and objects within a 3D spatial context. This paper proposes a flexible analysis scheme and a thorough evaluation of various processing pipelines to detect and track humans on a ground plane, calibrated automatically via stereo depth and pedestrian detection. We consider multiple combinations within a set of RGB- and depth-based detection and tracking modalities. We exploit the modular concepts of Meshroom [2] and demonstrate its use as a generic vision processing pipeline and scalable evaluation framework. Furthermore, we introduce a novel open RGB-D railway platform dataset with annotations to support research activities in automated RGB-D surveillance. We present quantitative results for multiple object detection and tracking for various algorithmic combinations on our dataset. Results indicate that the combined use of depth-based spatial information and learned representations yields substantially enhanced detection and tracking accuracies. As demonstrated, these enhancements are especially pronounced in adverse situations when occlusions and objects not captured by learned representations are present.

READ FULL TEXT

page 3

page 9

page 12

research
10/12/2019

Frustum VoxNet for 3D object detection from RGB-D or Depth images

Recently, there have been a plethora of classification and detection sys...
research
10/02/2020

Deep4Air: A Novel Deep Learning Framework for Airport Airside Surveillance

An airport runway and taxiway (airside) area is a highly dynamic and com...
research
06/14/2016

Multiple Human Tracking in RGB-D Data: A Survey

Multiple human tracking (MHT) is a fundamental task in many computer vis...
research
02/05/2022

Catch Me if You Can: A Novel Task for Detection of Covert Geo-Locations (CGL)

Most visual scene understanding tasks in the field of computer vision in...
research
02/06/2014

Real-time Pedestrian Surveillance with Top View Cumulative Grids

This manuscript presents an efficient approach to map pedestrian surveil...
research
12/08/2017

An Integrated Platform for Live 3D Human Reconstruction and Motion Capturing

The latest developments in 3D capturing, processing, and rendering provi...

Please sign up or login with your details

Forgot password? Click here to reset