DeepAI AI Chat
Log In Sign Up

Object Detection-Based Variable Quantization Processing

by   Likun Liu, et al.

In this paper, we propose a preprocessing method for conventional image and video encoders that can make these existing encoders content-aware. By going through our process, a higher quality parameter could be set on a traditional encoder without increasing the output size. A still frame or an image will firstly go through an object detector. Either the properties of the detection result will decide the parameters of the following procedures, or the system will be bypassed if no object is detected in the given frame. The processing method utilizes an adaptive quantization process to determine the portion of data to be dropped. This method is primarily based on the JPEG compression theory and is optimum for JPEG-based encoders such as JPEG encoders and the Motion JPEG encoders. However, other DCT-based encoders like MPEG part 2, H.264, etc. can also benefit from this method. In the experiments, we compare the MS-SSIM under the same bitrate as well as similar MS-SSIM but enhanced bitrate. As this method is based on human perception, even with similar MS-SSIM, the overall watching experience will be better than the direct encoded ones.


page 2

page 9

page 12


Video Time: Properties, Encoders and Evaluation

Time-aware encoding of frame sequences in a video is a fundamental probl...

Rate-Perception Optimized Preprocessing for Video Coding

In the past decades, lots of progress have been done in the video compre...

User Generated HDR Gaming Video Streaming: Dataset, Codec Comparison and Challenges

Gaming video streaming services have grown tremendously in the past few ...

On the advantages of stochastic encoders

Stochastic encoders have been used in rate-distortion theory and neural ...

Non-Autoregressive ASR with Self-Conditioned Folded Encoders

This paper proposes CTC-based non-autoregressive ASR with self-condition...

Transformer-based encoder-encoder architecture for Spoken Term Detection

The paper presents a method for spoken term detection based on the Trans...

Calibrated steganalysis of mp3stego in multi-encoder scenario

Comparing popularity of mp3 and wave with the amount of works published ...