Adaptive Dilated Convolution For Human Pose Estimation

07/22/2021
by   Zhengxiong Luo, et al.
0

Most existing human pose estimation (HPE) methods exploit multi-scale information by fusing feature maps of four different spatial sizes, 1/4, 1/8, 1/16, and 1/32 of the input image. There are two drawbacks of this strategy: 1) feature maps of different spatial sizes may be not well aligned spatially, which potentially hurts the accuracy of keypoint location; 2) these scales are fixed and inflexible, which may restrict the generalization ability over various human sizes. Towards these issues, we propose an adaptive dilated convolution (ADC). It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels. More importantly, these dilation rates are generated by a regression module. It enables ADC to adaptively adjust the fused scales and thus ADC may generalize better to various human sizes. ADC can be end-to-end trained and easily plugged into existing methods. Extensive experiments show that ADC can bring consistent improvements to various HPE methods. The source codes will be released for further research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2019

Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation

In this paper, we are interested in bottom-up multi-person human pose es...
research
12/30/2020

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation

Heatmap regression has become the most prevalent choice for nowadays hum...
research
03/30/2016

Structured Feature Learning for Pose Estimation

In this paper, we propose a structured feature learning framework to rea...
research
03/27/2018

Multi-Scale Structure-Aware Network for Human Pose Estimation

We develop a robust multi-scale structure-aware neural network for human...
research
12/13/2020

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

In this paper, we propose an efficient human pose estimation network (DA...
research
07/05/2018

A Single Shot Text Detector with Scale-adaptive Anchors

Currently, most top-performing text detection networks tend to employ fi...
research
09/04/2020

SSP-Net: Scalable Sequential Pyramid Networks for Real-Time 3D Human Pose Regression

In this paper we propose a highly scalable convolutional neural network,...

Please sign up or login with your details

Forgot password? Click here to reset