A neural network based post-filter for speech-driven head motion synthesis

07/24/2019
by   JinHong Lu, et al.
0

Despite the fact that neural networks are widely used for speech-driven head motion synthesis, it is well-known that the output of neural networks is noisy or discontinuous due to the limited capability of deep neural networks in predicting human motion. Thus, post-processing is required to obtain smooth head motion trajectories for animation. It is common to apply a linear filter or consider keyframes as post-processing. However, neither approach is optimal as there is always a trade-off between smoothness and accuracy. We propose to employ a neural network trained in a way that it is capable of reconstructing the head motions, in order to overcome this limitation. In the objective evaluation, this filter is proved to be good at de-noising data involving types of noise (dropout or Gaussian noise). Objective metrics also demonstrate the improvement of the joined head motion's smoothness after being processed by our proposed filter. A detailed analysis reveals that our proposed filter learns the characteristic of head motions. The subjective evaluation shows that participants were unable to distinguish the synthesised head motions with our proposed filter from ground truth, which was preferred over the Gaussian filter and moving average.

READ FULL TEXT
research
10/26/2022

Naturalistic Head Motion Generation from Speech

Synthesizing natural head motion to accompany speech for an embodied con...
research
02/05/2020

Prediction of head motion from speech waveforms with a canonical-correlation-constrained autoencoder

This study investigates the direct use of speech waveforms to predict he...
research
05/14/2020

Neural Networks Versus Conventional Filters for Inertial-Sensor-based Attitude Estimation

Inertial measurement units are commonly used to estimate the attitude of...
research
07/16/2020

Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation

This paper presents a novel framework for speech-driven gesture producti...
research
10/12/2020

Enhancement Of Coded Speech Using a Mask-Based Post-Filter

The quality of speech codecs deteriorates at low bitrates due to high qu...
research
11/15/2018

Motion Style Extraction Based on Sparse Coding Decomposition

We present a sparse coding-based framework for motion style decompositio...
research
11/02/2022

Autoregressive GAN for Semantic Unconditional Head Motion Generation

We address the task of unconditional head motion generation to animate s...

Please sign up or login with your details

Forgot password? Click here to reset