Hierarchical binary CNNs for landmark localization with limited resources

08/14/2018
by   Adrian Bulat, et al.
8

Our goal is to design architectures that retain the groundbreaking performance of Convolutional Neural Networks (CNNs) for landmark localization and at the same time are lightweight, compact and suitable for applications with limited computational resources. To this end, we make the following contributions: (a) we are the first to study the effect of neural network binarization on localization tasks, namely human pose estimation and face alignment. We exhaustively evaluate various design choices, identify performance bottlenecks, and more importantly propose multiple orthogonal ways to boost performance. (b) Based on our analysis, we propose a novel hierarchical, parallel and multi-scale residual architecture that yields large performance improvement over the standard bottleneck block while having the same number of parameters, thus bridging the gap between the original network and its binarized counterpart. (c) We perform a large number of ablation studies that shed light on the properties and the performance of the proposed block. (d) We present results for experiments on the most challenging datasets for human pose estimation and face alignment, reporting in many cases state-of-the-art performance. (e) We further provide additional results for the problem of facial part segmentation. Code can be downloaded from https://www.adrianbulat.com/binary-cnn-landmark

READ FULL TEXT

page 4

page 5

page 11

page 13

page 14

research
03/02/2017

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

Our goal is to design architectures that retain the groundbreaking perfo...
research
04/22/2022

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

A high-resolution network exhibits remarkable capability in extracting m...
research
02/27/2021

Deep Active Shape Model for Face Alignment and Pose Estimation

Active Shape Model (ASM) is a statistical model of object shapes that re...
research
08/07/2018

Quantized Densely Connected U-Nets for Efficient Landmark Localization

In this paper, we propose quantized densely connected U-Nets for efficie...
research
06/05/2019

Lightweight Real-time Makeup Try-on in Mobile Browsers with Tiny CNN Models for Facial Tracking

Recent works on convolutional neural networks (CNNs) for facial alignmen...
research
12/02/2019

Efficient Convolutional Neural Networks for Depth-Based Multi-Person Pose Estimation

Achieving robust multi-person 2D body landmark localization and pose est...
research
02/19/2018

Disentangling 3D Pose in A Dendritic CNN for Unconstrained 2D Face Alignment

Heatmap regression has been used for landmark localization for quite a w...

Please sign up or login with your details

Forgot password? Click here to reset