From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

08/30/2023
by   Shreyank N Gowda, et al.
0

Recent advancements in deep learning and computer vision have led to a surge of interest in generating realistic talking heads. This paper presents a comprehensive survey of state-of-the-art methods for talking head generation. We systematically categorises them into four main approaches: image-driven, audio-driven, video-driven and others (including neural radiance fields (NeRF), and 3D-based methods). We provide an in-depth analysis of each method, highlighting their unique contributions, strengths, and limitations. Furthermore, we thoroughly compare publicly available models, evaluating them on key aspects such as inference time and human-rated quality of the generated outputs. Our aim is to provide a clear and concise overview of the current landscape in talking head generation, elucidating the relationships between different approaches and identifying promising directions for future research. This survey will serve as a valuable reference for researchers and practitioners interested in this rapidly evolving field.

READ FULL TEXT

page 23

page 24

page 25

research
05/07/2020

What comprises a good talking-head video generation?: A Survey and Benchmark

Over the years, performance evaluation has become essential in computer ...
research
10/06/2021

Deep Neural Networks and Tabular Data: A Survey

Heterogeneous tabular data are the most commonly used form of data and a...
research
07/20/2023

Human Motion Generation: A Survey

Human motion generation aims to generate natural human pose sequences an...
research
08/25/2020

Image Colorization: A Survey and Dataset

Image colorization is an essential image processing and computer vision ...
research
10/04/2016

Image Aesthetic Assessment: An Experimental Survey

This survey aims at reviewing recent computer vision techniques used in ...
research
06/14/2023

Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey

Speaker-independent VSR is a complex task that involves identifying spok...
research
07/07/2023

A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision

Deep learning has the potential to revolutionize sports performance, wit...

Please sign up or login with your details

Forgot password? Click here to reset