Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

12/19/2021
by   Renjie Li, et al.
7

Keypoint detection plays an important role in a wide range of applications. However, predicting keypoints of small objects such as human hands is a challenging problem. Recent works fuse feature maps of deep Convolutional Neural Networks (CNNs), either via multi-level feature integration or multi-resolution aggregation. Despite achieving some success, the feature fusion approaches increase the complexity and the opacity of CNNs. To address this issue, we propose a novel CNN model named Multi-Scale Deep Supervision Network (P-MSDSNet) that learns feature maps at different scales with deep supervisions to produce attention maps for adaptive feature propagation from layers to layers. P-MSDSNet has a multi-stage architecture which makes it scalable while its deep supervision with spatial attention improves transparency to the feature learning at each stage. We show that P-MSDSNet outperforms the state-of-the-art approaches on benchmark datasets while requiring fewer number of parameters. We also show the application of P-MSDSNet to quantify finger tapping hand movements in a neuroscience study.

READ FULL TEXT

page 6

page 8

research
01/01/2018

Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction

Recent works have shown that exploiting multi-scale representations deep...
research
05/18/2018

MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects

In order to improve the detection accuracy for objects at different scal...
research
10/02/2018

Multi-scale Convolution Aggregation and Stochastic Feature Reuse for DenseNets

Recently, Convolution Neural Networks (CNNs) obtained huge success in nu...
research
09/29/2020

Attentional Feature Fusion

Feature fusion, the combination of features from different layers or bra...
research
05/07/2017

Deep Visual Attention Prediction

Deep Convolutional Neural Networks (CNNs) have made substantial improvem...
research
10/31/2017

Clothing Retrieval with Visual Attention Model

Clothing retrieval is a challenging problem in computer vision. With the...
research
08/10/2023

Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention

Convolutional neural networks (CNNs) and vision transformers (ViTs) have...

Please sign up or login with your details

Forgot password? Click here to reset