Multi-modal Face Pose Estimation with Multi-task Manifold Deep Learning

12/18/2017
by   Chaoqun Hong, et al.
0

Human face pose estimation aims at estimating the gazing direction or head postures with 2D images. It gives some very important information such as communicative gestures, saliency detection and so on, which attracts plenty of attention recently. However, it is challenging because of complex background, various orientations and face appearance visibility. Therefore, a descriptive representation of face images and mapping it to poses are critical. In this paper, we make use of multi-modal data and propose a novel face pose estimation method that uses a novel deep learning framework named Multi-task Manifold Deep Learning M^2DL. It is based on feature extraction with improved deep neural networks and multi-modal mapping relationship with multi-task learning. In the proposed deep learning based framework, Manifold Regularized Convolutional Layers (MRCL) improve traditional convolutional layers by learning the relationship among outputs of neurons. Besides, in the proposed mapping relationship learning method, different modals of face representations are naturally combined to learn the mapping function from face images to poses. In this way, the computed mapping model with multiple tasks is improved. Experimental results on three challenging benchmark datasets DPOSE, HPID and BKHPD demonstrate the outstanding performance of M^2DL.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 10

page 11

research
02/04/2022

Multi-task head pose estimation in-the-wild

We present a deep learning-based multi-task approach for head pose estim...
research
01/22/2022

A Multi-modal Fusion Framework Based on Multi-task Correlation Learning for Cancer Prognosis Prediction

Morphological attributes from histopathological images and molecular pro...
research
03/13/2021

An Efficient Multitask Neural Network for Face Alignment, Head Pose Estimation and Face Tracking

While convolutional neural networks (CNNs) have significantly boosted th...
research
04/06/2017

A Convolution Tree with Deconvolution Branches: Exploiting Geometric Relationships for Single Shot Keypoint Detection

Recently, Deep Convolution Networks (DCNNs) have been applied to the tas...
research
03/03/2016

HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

We present an algorithm for simultaneous face detection, landmarks local...
research
06/21/2023

LPFormer: LiDAR Pose Estimation Transformer with Multi-Task Network

In this technical report, we present the 1st place solution for the 2023...
research
05/16/2018

When Regression Meets Manifold Learning for Object Recognition and Pose Estimation

In this work, we propose a method for object recognition and pose estima...

Please sign up or login with your details

Forgot password? Click here to reset