Multi-Scale Dual-Branch Fully Convolutional Network for Hand Parsing

05/24/2019
by   Yang Lu, et al.
0

Recently, fully convolutional neural networks (FCNs) have shown significant performance in image parsing, including scene parsing and object parsing. Different from generic object parsing tasks, hand parsing is more challenging due to small size, complex structure, heavy self-occlusion and ambiguous texture problems. In this paper, we propose a novel parsing framework, Multi-Scale Dual-Branch Fully Convolutional Network (MSDB-FCN), for hand parsing tasks. Our network employs a Dual-Branch architecture to extract features of hand area, paying attention on the hand itself. These features are used to generate multi-scale features with pyramid pooling strategy. In order to better encode multi-scale features, we design a Deconvolution and Bilinear Interpolation Block (DB-Block) for upsampling and merging the features of different scales. To address data imbalance, which is a common problem in many computer vision tasks as well as hand parsing tasks, we propose a generalization of Focal Loss, namely Multi-Class Balanced Focal Loss, to tackle data imbalance in multi-class classification. Extensive experiments on RHD-PARSING dataset demonstrate that our MSDB-FCN has achieved the state-of-the-art performance for hand parsing.

READ FULL TEXT

page 1

page 3

page 4

page 8

research
08/01/2020

Land Cover Classification from Remote Sensing Images Based on Multi-Scale Fully Convolutional Network

In this paper, a Multi-Scale Fully Convolutional Network (MSFCN) with mu...
research
03/04/2017

Looking at Outfit to Parse Clothing

This paper extends fully-convolutional neural networks (FCN) for the clo...
research
10/08/2019

Deep Multiphase Level Set for Scene Parsing

Recently, Fully Convolutional Network (FCN) seems to be the go-to archit...
research
06/19/2018

MoE-SPNet: A Mixture-of-Experts Scene Parsing Network

Scene parsing is an indispensable component in understanding the semanti...
research
07/20/2017

Multi-Branch Fully Convolutional Network for Face Detection

Face detection is a fundamental problem in computer vision. It is still ...
research
08/08/2017

FoveaNet: Perspective-aware Urban Scene Parsing

Parsing urban scene images benefits many applications, especially self-d...

Please sign up or login with your details

Forgot password? Click here to reset