VCT: A Video Compression Transformer

06/15/2022
by   Fabian Mentzer, et al.
0

We show how transformers can be used to vastly simplify neural video compression. Previous methods have been relying on an increasing number of architectural biases and priors, including motion prediction and warping operations, resulting in complex models. Instead, we independently map input frames to representations and use a transformer to model their dependencies, letting it predict the distribution of future representations given the past. The resulting video compression transformer outperforms previous methods on standard video compression data sets. Experiments on synthetic data show that our model learns to handle complex motion patterns such as panning, blurring and fading purely from data. Our approach is easy to implement, and we release code to facilitate future research.

READ FULL TEXT
research
07/12/2023

AICT: An Adaptive Image Compression Transformer

Motivated by the efficiency investigation of the Tranformer-based transf...
research
07/05/2023

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Recently, the performance of neural image compression (NIC) has steadily...
research
12/11/2019

Deep motion estimation for parallel inter-frame prediction in video compression

Standard video codecs rely on optical flow to guide inter-frame predicti...
research
08/30/2023

MMVP: Motion-Matrix-based Video Prediction

A central challenge of video prediction lies where the system has to rea...
research
06/24/2019

LMVP: Video Predictor with Leaked Motion Information

We propose a Leaked Motion Video Predictor (LMVP) to predict future fram...
research
04/04/2023

Blockwise Compression of Transformer-based Models without Retraining

Transformer-based models, represented by GPT-3, ChatGPT, and GPT-4, have...
research
08/14/2023

Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases

The proliferation of technologies, such as extended reality (XR), has in...

Please sign up or login with your details

Forgot password? Click here to reset