Towards Accurate Human Pose Estimation in Videos of Crowded Scenes

10/16/2020
by   Li Yuan, et al.
1

Video-based human pose estimation in crowded scenes is a challenging problem due to occlusion, motion blur, scale variation and viewpoint change, etc. Prior approaches always fail to deal with this problem because of (1) lacking of usage of temporal information; (2) lacking of training data in crowded scenes. In this paper, we focus on improving human pose estimation in videos of crowded scenes from the perspectives of exploiting temporal context and collecting new data. In particular, we first follow the top-down strategy to detect persons and perform single-person pose estimation for each frame. Then, we refine the frame-based pose estimation with temporal contexts deriving from the optical-flow. Specifically, for one frame, we forward the historical poses from the previous frames and backward the future poses from the subsequent frames to current frame, leading to stable and accurate human pose estimation in videos. In addition, we mine new data of similar scenes to HIE dataset from the Internet for improving the diversity of training set. In this way, our model achieves best performance on 7 out of 13 videos and 56.33 average w_AP on test dataset of HIE challenge.

READ FULL TEXT
research
11/04/2020

Leveraging Temporal Joint Depths for Improving 3D Human Pose Estimation in Video

The effectiveness of the approaches to predict 3D poses from 2D poses es...
research
10/16/2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes

Detecting and recognizing human action in videos with crowded scenes is ...
research
05/10/2019

Exploiting temporal context for 3D human pose estimation in the wild

We present a bundle-adjustment-based algorithm for recovering accurate 3...
research
06/07/2021

Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking

Multi-person pose estimation and tracking serve as crucial steps for vid...
research
08/13/2022

A new way of video compression via forward-referencing using deep learning

To exploit high temporal correlations in video frames of the same scene,...
research
04/16/2021

T-LEAP: occlusion-robust pose estimation of walking cows using temporal information

As herd size on dairy farms continue to increase, automatic health monit...
research
07/30/2020

Key Frame Proposal Network for Efficient Pose Estimation in Videos

Human pose estimation in video relies on local information by either est...

Please sign up or login with your details

Forgot password? Click here to reset