AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

03/20/2021
by   Yudong Guo, et al.
0

Generating high-fidelity talking head video by fitting with the input audio sequence is a challenging problem that receives considerable attentions recently. In this paper, we address this problem with the aid of neural scene representation networks. Our method is completely different from existing methods that rely on intermediate representations like 2D landmarks or 3D face models to bridge the gap between audio input and video output. Specifically, the feature of input audio signal is directly fed into a conditional implicit function to generate a dynamic neural radiance field, from which a high-fidelity talking-head video corresponding to the audio signal is synthesized using volume rendering. Another advantage of our framework is that not only the head (with hair) region is synthesized as previous methods did, but also the upper body is generated via two individual neural radiance fields. Experimental results demonstrate that our novel framework can (1) produce high-fidelity and natural results, and (2) support free adjustment of audio signals, viewing directions, and background images.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

research
01/31/2023

GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis

Generating photo-realistic video portrait with arbitrary speech audio is...
research
12/10/2021

HeadNeRF: A Real-time NeRF-based Parametric Head Model

In this paper, we propose HeadNeRF, a novel NeRF-based parametric head m...
research
07/19/2023

MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions

Audio-driven portrait animation aims to synthesize portrait videos that ...
research
12/17/2020

Learning Compositional Radiance Fields of Dynamic Human Heads

Photorealistic rendering of dynamic humans is an important ability for t...
research
11/22/2022

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

While dynamic Neural Radiance Fields (NeRF) have shown success in high-f...
research
06/13/2023

Parametric Implicit Face Representation for Audio-Driven Facial Reenactment

Audio-driven facial reenactment is a crucial technique that has a range ...
research
03/29/2023

HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion

Implicit neural fields, typically encoded by a multilayer perceptron (ML...

Please sign up or login with your details

Forgot password? Click here to reset