The First Principles of Deep Learning and Compression

04/04/2022
by   Max Ehrlich, et al.
0

The deep learning revolution incited by the 2012 Alexnet paper has been transformative for the field of computer vision. Many problems which were severely limited using classical solutions are now seeing unprecedented success. The rapid proliferation of deep learning methods has led to a sharp increase in their use in consumer and embedded applications. One consequence of consumer and embedded applications is lossy multimedia compression which is required to engineer the efficient storage and transmission of data in these real-world scenarios. As such, there has been increased interest in a deep learning solution for multimedia compression which would allow for higher compression ratios and increased visual quality. The deep learning approach to multimedia compression, so called Learned Multimedia Compression, involves computing a compressed representation of an image or video using a deep network for the encoder and the decoder. While these techniques have enjoyed impressive academic success, their industry adoption has been essentially non-existent. Classical compression techniques like JPEG and MPEG are too entrenched in modern computing to be easily replaced. This dissertation takes an orthogonal approach and leverages deep learning to improve the compression fidelity of these classical algorithms. This allows the incredible advances in deep learning to be used for multimedia compression without threatening the ubiquity of the classical methods. The key insight of this work is that methods which are motivated by first principles, i.e., the underlying engineering decisions that were made when the compression algorithms were developed, are more effective than general methods. By encoding prior knowledge into the design of the algorithm, the flexibility, performance, and/or accuracy are improved at the cost of generality...

READ FULL TEXT
research
11/17/2020

Analyzing and Mitigating Compression Defects in Deep Learning

With the proliferation of deep learning methods, many computer vision pr...
research
12/30/2020

Towards Robust Data Hiding Against (JPEG) Compression: A Pseudo-Differentiable Deep Learning Approach

Data hiding is one widely used approach for protecting authentication an...
research
01/31/2022

Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction

Video compression is a central feature of the modern internet powering t...
research
04/03/2021

A Deep Learning Scheme for Efficient Multimedia IoT Data Compression

Given the voluminous nature of the multimedia sensed data, the Multimedi...
research
07/10/2018

The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

This paper proposes a fundamental answer to a frequently asked question ...
research
04/06/2021

A Decade of Research for Image Compression In Multimedia Laboratory

With the advancement of technology, we have supercomputers with high pro...
research
05/26/2021

Towards Transparent Application of Machine Learning in Video Processing

Machine learning techniques for more efficient video compression and vid...

Please sign up or login with your details

Forgot password? Click here to reset