A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

07/27/2022
by   Goluck Konuko, et al.
0

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While these schemes bring significant coding gains over conventional video codecs at low bitrates, their performance saturates quickly when the available bandwidth increases. In this paper, we propose a layered, hybrid coding scheme to overcome this limitation. Specifically, we extend a codec based on facial animation by adding an auxiliary stream consisting of a very low bitrate version of the video, obtained through a conventional video codec (e.g., HEVC). The animated and auxiliary videos are combined through a novel fusion module. Our results show consistent average BD-Rate gains in excess of -30 dataset of video conferencing sequences, extending the operational range of bitrates of a facial animation codec alone

READ FULL TEXT

page 3

page 4

research
02/27/2020

Deep Slow Motion Video Reconstruction with Hybrid Imaging System

Slow motion videos are becoming increasingly popular, but capturing high...
research
03/11/2022

Video Coding for Machines with Feature-Based Rate-Distortion Optimization

Common state-of-the-art video codecs are optimized to deliver a low bitr...
research
07/12/2022

CANF-VC: Conditional Augmented Normalizing Flows for Video Compression

This paper presents an end-to-end learning-based video compression syste...
research
12/01/2020

Low Bandwidth Video-Chat Compression using Deep Generative Models

To unlock video chat for hundreds of millions of people hindered by poor...
research
07/08/2022

FAIVConf: Face enhancement for AI-based Video Conference with Low Bit-rate

Recently, high-quality video conferencing with fewer transmission bits h...
research
03/01/2017

Video transrating in AVC and HEVC transcoding

HEVC (MPEG-H Part 2 and H.265) is a new coding technology which is expec...
research
12/12/2019

Speech-driven facial animation using polynomial fusion of features

Speech-driven facial animation involves using a speech signal to generat...

Please sign up or login with your details

Forgot password? Click here to reset