Multi-scale Aggregation R-CNN for 2D Multi-person Pose Estimation

05/10/2019
by   Gyeongsik Moon, et al.
4

Multi-person pose estimation from a 2D image is challenging because it requires not only keypoint localization but also human detection. In state-of-the-art top-down methods, multi-scale information is a crucial factor for the accurate pose estimation because it contains both of local information around the keypoints and global information of the entire person. Although multi-scale information allows these methods to achieve the state-of-the-art performance, the top-down methods still require a huge amount of computation because they need to use an additional human detector to feed the cropped human image to their pose estimation model. To effectively utilize multi-scale information with the smaller computation, we propose a multi-scale aggregation R-CNN (MSA R-CNN). It consists of multi-scale RoIAlign block (MS-RoIAlign) and multi-scale keypoint head network (MS-KpsNet) which are designed to effectively utilize multi-scale information. Also, in contrast to previous top-down methods, the MSA R-CNN performs human detection and keypoint localization in a single model, which results in reduced computation. The proposed model achieved the best performance among single model-based methods and its results are comparable to those of separated model-based methods with a smaller amount of computation on the publicly available 2D multi-person keypoint localization dataset.

READ FULL TEXT

page 2

page 3

page 4

page 8

research
08/05/2018

Multi-Scale Supervised Network for Human Pose Estimation

Human pose estimation is an important topic in computer vision with many...
research
03/27/2018

Multi-Scale Structure-Aware Network for Human Pose Estimation

We develop a robust multi-scale structure-aware neural network for human...
research
03/18/2021

OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation

We propose OmniPose, a single-pass, end-to-end trainable framework, that...
research
11/25/2022

MS-PS: A Multi-Scale Network for Photometric Stereo With a New Comprehensive Training Dataset

The photometric stereo (PS) problem consists in reconstructing the 3D-su...
research
02/17/2023

MDPose: Real-Time Multi-Person Pose Estimation via Mixture Density Model

One of the major challenges in multi-person pose estimation is instance-...
research
03/25/2018

Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture

This paper presents a method that can accurately detect heads especially...
research
11/26/2019

Multi-Level Network for High-Speed Multi-Person Pose Estimation

In multi-person pose estimation, the left/right joint type discriminatio...

Please sign up or login with your details

Forgot password? Click here to reset