Lite-HRNet: A Lightweight High-Resolution Network

04/13/2021
by   Changqian Yu, et al.
0

We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution network), yielding stronger performance over popular lightweight networks, such as MobileNet, ShuffleNet, and Small HRNet. We find that the heavily-used pointwise (1x1) convolutions in shuffle blocks become the computational bottleneck. We introduce a lightweight unit, conditional channel weighting, to replace costly pointwise (1x1) convolutions in shuffle blocks. The complexity of channel weighting is linear w.r.t the number of channels and lower than the quadratic time complexity for pointwise convolutions. Our solution learns the weights from all the channels and over multiple resolutions that are readily available in the parallel branches in HRNet. It uses the weights as the bridge to exchange information across channels and resolutions, compensating the role played by the pointwise (1x1) convolution. Lite-HRNet demonstrates superior results on human pose estimation over popular lightweight networks. Moreover, Lite-HRNet can be easily applied to semantic segmentation task in the same lightweight manner. The code and models have been publicly available at https://github.com/HRNet/Lite-HRNet.

READ FULL TEXT
research
04/22/2022

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

A high-resolution network exhibits remarkable capability in extracting m...
research
07/27/2022

Lightweight and Progressively-Scalable Networks for Semantic Segmentation

Multi-scale learning frameworks have been regarded as a capable class of...
research
04/09/2019

High-Resolution Representations for Labeling Pixels and Regions

High-resolution representation learning plays an essential role in many ...
research
02/09/2023

To Perceive or Not to Perceive: Lightweight Stacked Hourglass Network

Human pose estimation (HPE) is a classical task in computer vision that ...
research
07/16/2020

EfficientHRNet: Efficient Scaling for Lightweight High-Resolution Multi-Person Pose Estimation

Recent years have brought great advancement in 2D human pose estimation....
research
03/21/2023

Human Pose as Compositional Tokens

Human pose is typically represented by a coordinate vector of body joint...
research
12/06/2019

Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference

Modern convolutional neural networks apply the same operations on every ...

Please sign up or login with your details

Forgot password? Click here to reset