Deep High-Resolution Representation Learning for Visual Recognition

08/20/2019
by   Jingdong Wang, et al.
0

High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions in series (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams in parallel; (ii) Repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at <https://github.com/HRNet>.

READ FULL TEXT

page 2

page 5

page 7

page 8

page 10

research
04/09/2019

High-Resolution Representations for Labeling Pixels and Regions

High-resolution representation learning plays an essential role in many ...
research
02/25/2019

Deep High-Resolution Representation Learning for Human Pose Estimation

This is an official pytorch implementation of Deep High-Resolution Repre...
research
09/15/2022

A Robotic Visual Grasping Design: Rethinking Convolution Neural Network with High-Resolutions

High-resolution representations are important for vision-based robotic g...
research
07/27/2020

3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

3D human shape and pose estimation from monocular images has been an act...
research
10/24/2018

Fast and accurate object detection in high resolution 4K and 8K video using GPUs

Machine learning has celebrated a lot of achievements on computer vision...
research
07/02/2020

JUMPS: Joints Upsampling Method for Pose Sequences

Human Pose Estimation is a low-level task useful for surveillance, human...
research
02/20/2020

Generalized sampling with functional principal components for high-resolution random field estimation

In this paper, we take a statistical approach to the problem of recoveri...

Please sign up or login with your details

Forgot password? Click here to reset