Multi-Hypothesis Pose Networks: Rethinking Top-Down Pose Estimation

01/27/2021
by   Rawal Khirodkar, et al.
8

A key assumption of top-down human pose estimation approaches is their expectation of having a single person present in the input bounding box. This often leads to failures in crowded scenes with occlusions. We propose a novel solution to overcome the limitations of this fundamental assumption. Our Multi-Hypothesis Pose Network (MHPNet) allows for predicting multiple 2D poses within a given bounding box. We introduce a Multi-Hypothesis Attention Block (MHAB) that can adaptively modulate channel-wise feature responses for each hypothesis and is parameter efficient. We demonstrate the efficacy of our approach by evaluating on COCO, CrowdPose, and OCHuman datasets. Specifically, we achieve 70.0 AP on CrowdPose and 42.5 AP on OCHuman test sets, a significant improvement of 2.4 AP and 6.5 AP over the prior art, respectively. When using ground truth bounding boxes for inference, MHPNet achieves an improvement of 0.7 AP on COCO, 0.9 AP on CrowdPose, and 9.1 AP on OCHuman validation sets compared to HRNet. Interestingly, when fewer, high confidence bounding boxes are used, HRNet's performance degrades (by 5 AP) on OCHuman, whereas MHPNet maintains a relatively stable performance (a drop of 1 AP) for the same inputs.

READ FULL TEXT

page 1

page 2

page 3

page 7

page 8

page 13

page 14

research
08/25/2022

Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons

In multi-person 2D pose estimation, the bottom-up methods simultaneously...
research
03/24/2022

Occluded Human Mesh Recovery

Top-down methods for monocular human mesh recovery have two stages: (1) ...
research
06/13/2023

Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity

Frequent interactions between individuals are a fundamental challenge fo...
research
07/26/2018

Bottom-up Pose Estimation of Multiple Person with Bounding Box Constraint

In this work, we propose a new method for multi-person pose estimation w...
research
03/04/2020

HintPose

Most of the top-down pose estimation models assume that there exists onl...
research
06/15/2022

LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection

The popular object detection metric 3D Average Precision (3D AP) relies ...
research
05/14/2019

Improving Head Pose Estimation with a Combined Loss and Bounding Box Margin Adjustment

We address a problem of estimating pose of a person's head from its RGB ...

Please sign up or login with your details

Forgot password? Click here to reset