Deep Convolutional Neural Network-based Bernoulli Heatmap for Head Pose Estimation

by   Zhongxu Hu, et al.

Head pose estimation is a crucial problem for many tasks, such as driver attention, fatigue detection, and human behaviour analysis. It is well known that neural networks are better at handling classification problems than regression problems. It is an extremely nonlinear process to let the network output the angle value directly for optimization learning, and the weight constraint of the loss function will be relatively weak. This paper proposes a novel Bernoulli heatmap for head pose estimation from a single RGB image. Our method can achieve the positioning of the head area while estimating the angles of the head. The Bernoulli heatmap makes it possible to construct fully convolutional neural networks without fully connected layers and provides a new idea for the output form of head pose estimation. A deep convolutional neural network (CNN) structure with multiscale representations is adopted to maintain high-resolution information and low-resolution information in parallel. This kind of structure can maintain rich, high-resolution representations. In addition, channelwise fusion is adopted to make the fusion weights learnable instead of simple addition with equal weights. As a result, the estimation is spatially more precise and potentially more accurate. The effectiveness of the proposed method is empirically demonstrated by comparing it with other state-of-the-art methods on public datasets.


page 1

page 2

page 3

page 4

page 6

page 7

page 8


From Depth Data to Head Pose Estimation: a Siamese approach

The correct estimation of the head pose is a problem of the great import...

A Vector-based Representation to Enhance Head Pose Estimation

This paper proposes to use the three vectors in a rotation matrix as the...

POSEidon: Face-from-Depth for Driver Pose Estimation

Fast and accurate upper-body and head pose estimation is a key task for ...

Towards Real-Time Head Pose Estimation: Exploring Parameter-Reduced Residual Networks on In-the-wild Datasets

Head poses are a key component of human bodily communication and thus a ...

Numerical Coordinate Regression with Convolutional Neural Networks

We study deep learning approaches to inferring numerical coordinates for...

A Simple Nadaraya-Watson Head can offer Explainable and Calibrated Classification

In this paper, we empirically analyze a simple, non-learnable, and nonpa...

Multi-camera Torso Pose Estimation using Graph Neural Networks

Estimating the location and orientation of humans is an essential skill ...

Please sign up or login with your details

Forgot password? Click here to reset