Detect-and-Track: Efficient Pose Estimation in Videos

12/26/2017
by   Rohit Girdhar, et al.
0

This paper addresses the problem of estimating and tracking human body keypoints in complex, multi-person video. We propose an extremely lightweight yet highly effective approach that builds upon the latest advancements in human detection and video understanding. Our method operates in two-stages: keypoint estimation in frames or short clips, followed by lightweight tracking to generate keypoint predictions linked over the entire video. For frame-level pose estimation we experiment with Mask R-CNN, as well as our own proposed 3D extension of this model, which leverages temporal information over small clips to generate more robust frame predictions. We conduct extensive ablative experiments on the newly released multi-person video pose estimation benchmark, PoseTrack, to validate various design choices of our model. Our approach achieves an accuracy of 55.2 the Multi-Object Tracking Accuracy (MOTA) metric, and achieves state of the art performance on the ICCV 2017 PoseTrack keypoint tracking challenge.

READ FULL TEXT

page 1

page 4

page 8

research
04/27/2020

Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos

Video annotation is expensive and time consuming. Consequently, datasets...
research
01/23/2019

A Top-down Approach to Articulated Human Pose Estimation and Tracking

Both the tasks of multi-person human pose estimation and pose tracking i...
research
12/05/2019

15 Keypoints Is All You Need

Pose tracking is an important problem that requires identifying unique h...
research
02/04/2020

Combining 3D Model Contour Energy and Keypoints for Object Tracking

We present a new combined approach for monocular model-based 3D tracking...
research
08/15/2019

FastPose: Towards Real-time Pose Estimation and Tracking via Scale-normalized Multi-task Networks

Both accuracy and efficiency are significant for pose estimation and tra...
research
02/28/2023

Video Pose Track with Graph-Guided Sparse Motion Estimation

In this paper, we propose a novel framework for multi-person pose estima...
research
10/02/2022

Fast and Robust Video-Based Exercise Classification via Body Pose Tracking and Scalable Multivariate Time Series Classifiers

Technological advancements have spurred the usage of machine learning ba...

Please sign up or login with your details

Forgot password? Click here to reset