Deep Dual Consecutive Network for Human Pose Estimation

03/12/2021
by   Zhenguang Liu, et al.
6

Multi-frame human pose estimation in complicated situations is challenging. Although state-of-the-art human joints detectors have demonstrated remarkable results for static images, their performances come short when we apply these models to video sequences. Prevalent shortcomings include the failure to handle motion blur, video defocus, or pose occlusions, arising from the inability in capturing the temporal dependency among video frames. On the other hand, directly employing conventional recurrent neural networks incurs empirical difficulties in modeling spatial contexts, especially for dealing with pose occlusions. In this paper, we propose a novel multi-frame human pose estimation framework, leveraging abundant temporal cues between video frames to facilitate keypoint detection. Three modular components are designed in our framework. A Pose Temporal Merger encodes keypoint spatiotemporal context to generate effective searching scopes while a Pose Residual Fusion module computes weighted pose residuals in dual directions. These are then processed via our Pose Correction Network for efficient refining of pose estimations. Our method ranks No.1 in the Multi-frame Person Pose Estimation Challenge on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018. We have released our code, hoping to inspire future research.

READ FULL TEXT

page 2

page 3

page 7

page 8

research
04/27/2020

Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos

Video annotation is expensive and time consuming. Consequently, datasets...
research
07/31/2023

DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation

Denoising diffusion probabilistic models that were initially proposed fo...
research
03/24/2022

AIMusicGuru: Music Assisted Human Pose Correction

Pose Estimation techniques rely on visual cues available through observa...
research
03/15/2023

Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video

Temporal modeling is crucial for multi-frame human pose estimation. Most...
research
04/16/2021

T-LEAP: occlusion-robust pose estimation of walking cows using temporal information

As herd size on dairy farms continue to increase, automatic health monit...
research
12/18/2017

LSTM Pose Machines

We observed that recent state-of-the-art results on single image human p...
research
03/29/2022

Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation

Multi-frame human pose estimation has long been a compelling and fundame...

Please sign up or login with your details

Forgot password? Click here to reset