OTPose: Occlusion-Aware Transformer for Pose Estimation in Sparsely-Labeled Videos

07/20/2022
by   Kyung-Min Jin, et al.
0

Although many approaches for multi-human pose estimation in videos have shown profound results, they require densely annotated data which entails excessive man labor. Furthermore, there exists occlusion and motion blur that inevitably lead to poor estimation performance. To address these problems, we propose a method that leverages an attention mask for occluded joints and encodes temporal dependency between frames using transformers. First, our framework composes different combinations of sparsely annotated frames that denote the track of the overall joint movement. We propose an occlusion attention mask from these combinations that enable encoding occlusion-aware heatmaps as a semi-supervised task. Second, the proposed temporal encoder employs transformer architecture to effectively aggregate the temporal relationship and keypoint-wise attention from each time step and accurately refines the target frame's final pose estimation. We achieve state-of-the-art pose estimation results for PoseTrack2017 and PoseTrack2018 datasets and demonstrate the robustness of our approach to occlusion and motion blur in sparsely annotated video data.

READ FULL TEXT

page 1

page 2

page 3

page 5

research
11/29/2022

Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos

Previous video-based human pose estimation methods have shown promising ...
research
03/09/2023

Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation

Accurately estimating 3D hand pose is crucial for understanding how huma...
research
06/24/2020

3D Pose Detection in Videos: Focusing on Occlusion

In this work, we build upon existing methods for occlusion-aware 3D pose...
research
10/13/2020

Multi-Scale Networks for 3D Human Pose Estimation with Inference Stage Optimization

Estimating 3D human poses from a monocular video is still a challenging ...
research
09/02/2023

Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for Pitch Analysis

Using videos to analyze pitchers in baseball can play a vital role in st...
research
07/30/2020

Key Frame Proposal Network for Efficient Pose Estimation in Videos

Human pose estimation in video relies on local information by either est...
research
07/22/2022

Learning Human Kinematics by Modeling Temporal Correlations between Joints for Video-based Human Pose Estimation

Estimating human poses from videos is critical in human-computer interac...

Please sign up or login with your details

Forgot password? Click here to reset