Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World

03/22/2018
by   Matteo Fabbri, et al.
2

Multi-People Tracking in an open-world setting requires a special effort in precise detection. Moreover, temporal continuity in the detection phase gains more importance when scene cluttering introduces the challenging problems of occluded targets. For the purpose, we propose a deep network architecture that jointly extracts people body parts and associates them across short temporal spans. Our model explicitly deals with occluded body parts, by hallucinating plausible solutions of not visible joints. We propose a new end-to-end architecture composed by four branches (visible heatmaps, occluded heatmaps, part affinity fields and temporal affinity fields) fed by a time linker feature extractor. To overcome the lack of surveillance data with tracking, body part and occlusion annotations we created the vastest Computer Graphics dataset for people tracking in urban scenarios by exploiting a photorealistic videogame. It is up to now the vastest dataset (about 500.000 frames, more than 10 million body poses) of human body parts for people tracking in urban scenarios. Our architecture trained on virtual data exhibits good generalization capabilities also on public real tracking benchmarks, when image resolution and sharpness are high enough, producing reliable tracklets useful for further batch data association or re-id modules.

READ FULL TEXT

page 4

page 9

page 12

page 14

page 20

page 21

research
10/21/2020

Mutual-Supervised Feature Modulation Network for Occluded Pedestrian Detection

State-of-the-art pedestrian detectors have achieved significant progress...
research
01/23/2019

Can Adversarial Networks Hallucinate Occluded People With a Plausible Aspect?

When you see a person in a crowd, occluded by other persons, you miss vi...
research
07/19/2017

Detecting Parts for Action Localization

In this paper, we propose a new framework for action localization that t...
research
05/16/2021

Neighbourhood-guided Feature Reconstruction for Occluded Person Re-Identification

Person images captured by surveillance cameras are often occluded by var...
research
11/29/2018

Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields

We present an online approach to efficiently and simultaneously detect a...
research
11/24/2016

Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

We present an approach to efficiently detect the 2D pose of multiple peo...
research
03/03/2021

OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association

Many image-based perception tasks can be formulated as detecting, associ...

Please sign up or login with your details

Forgot password? Click here to reset