Towards Accurate Multi-person Pose Estimation in the Wild

01/06/2017
by   George Papandreou, et al.
1

We propose a method for multi-person detection and 2-D pose estimation that achieves state-of-art results on the challenging COCO keypoints task. It is a simple, yet powerful, top-down approach consisting of two stages. In the first stage, we predict the location and scale of boxes which are likely to contain people; for this we use the Faster RCNN detector. In the second stage, we estimate the keypoints of the person potentially contained in each proposed bounding box. For each keypoint type we predict dense heatmaps and offsets using a fully convolutional ResNet. To combine these outputs we introduce a novel aggregation procedure to obtain highly localized keypoint predictions. We also use a novel form of keypoint-based Non-Maximum-Suppression (NMS), instead of the cruder box-level NMS, and a novel form of keypoint-based confidence score estimation, instead of box-level scoring. Trained on COCO data alone, our final system achieves average precision of 0.649 on the COCO test-dev set and the 0.643 test-standard sets, outperforming the winner of the 2016 COCO keypoints challenge and other recent state-of-art. Further, by using additional in-house labeled data we obtain an even higher average precision of 0.685 on the test-dev set and 0.673 on the test-standard set, more than 5 method on the same dataset.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
11/20/2017

Cascaded Pyramid Network for Multi-Person Pose Estimation

The topic of multi-person pose estimation has been largely improved rece...
research
03/09/2020

Learning Delicate Local Representations for Multi-Person Pose Estimation

In this paper, we propose a novel method called Residual Steps Network (...
research
03/04/2020

HintPose

Most of the top-down pose estimation models assume that there exists onl...
research
05/02/2018

Bi-directional Graph Structure Information Model for Multi-Person Pose Estimation

In this paper, we propose a novel multi-stage network architecture with ...
research
03/22/2018

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

We present a box-free bottom-up approach for the tasks of pose estimatio...
research
01/04/2022

Learning Quality-aware Representation for Multi-person Pose Regression

Off-the-shelf single-stage multi-person pose regression methods generall...
research
06/19/2014

R-CNNs for Pose Estimation and Action Detection

We present convolutional neural networks for the tasks of keypoint (pose...

Please sign up or login with your details

Forgot password? Click here to reset