Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information

05/09/2019
by   Kai Su, et al.
0

Multi-person pose estimation is an important but challenging problem in computer vision. Although current approaches have achieved significant progress by fusing the multi-scale feature maps, they pay little attention to enhancing the channel-wise and spatial information of the feature maps. In this paper, we propose two novel modules to perform the enhancement of the information for the multi-person pose estimation. First, a Channel Shuffle Module (CSM) is proposed to adopt the channel shuffle operation on the feature maps with different levels, promoting cross-channel information communication among the pyramid feature maps. Second, a Spatial, Channel-wise Attention Residual Bottleneck (SCARB) is designed to boost the original residual unit with attention mechanism, adaptively highlighting the information of the feature maps both in the spatial and channel-wise context. The effectiveness of our proposed modules is evaluated on the COCO keypoint benchmark, and experimental results show that our approach achieves the state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 3

page 6

page 8

research
03/27/2023

Global Relation Modeling and Refinement for Bottom-Up Human Pose Estimation

In this paper, we concern on the bottom-up paradigm in multi-person pose...
research
10/07/2020

Channel Recurrent Attention Networks for Video Pedestrian Retrieval

Full attention, which generates an attention value per element of the in...
research
03/17/2020

Augmented Parallel-Pyramid Net for Attention Guided Pose-Estimation

The target of human pose estimation is to determine body part or joint l...
research
03/19/2021

CE-FPN: Enhancing Channel Information for Object Detection

Feature pyramid network (FPN) has been an effective framework to extract...
research
12/23/2018

Chinese Herbal Recognition based on Competitive Attentional Fusion of Multi-hierarchies Pyramid Features

Convolution neural netwotks (CNNs) are successfully applied in image rec...
research
12/29/2019

Infant brain MRI segmentation with dilated convolution pyramid downsampling and self-attention

In this paper, we propose a dual aggregation network to adaptively aggre...
research
12/20/2021

BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations

We propose BAPose, a novel bottom-up approach that achieves state-of-the...

Please sign up or login with your details

Forgot password? Click here to reset