Object Detection-Based Variable Quantization Processing

09/01/2020
by   Likun Liu, et al.
0

In this paper, we propose a preprocessing method for conventional image and video encoders that can make these existing encoders content-aware. By going through our process, a higher quality parameter could be set on a traditional encoder without increasing the output size. A still frame or an image will firstly go through an object detector. Either the properties of the detection result will decide the parameters of the following procedures, or the system will be bypassed if no object is detected in the given frame. The processing method utilizes an adaptive quantization process to determine the portion of data to be dropped. This method is primarily based on the JPEG compression theory and is optimum for JPEG-based encoders such as JPEG encoders and the Motion JPEG encoders. However, other DCT-based encoders like MPEG part 2, H.264, etc. can also benefit from this method. In the experiments, we compare the MS-SSIM under the same bitrate as well as similar MS-SSIM but enhanced bitrate. As this method is based on human perception, even with similar MS-SSIM, the overall watching experience will be better than the direct encoded ones.

READ FULL TEXT

page 2

page 9

page 12

research
07/18/2018

Video Time: Properties, Encoders and Evaluation

Time-aware encoding of frame sequences in a video is a fundamental probl...
research
01/25/2023

Rate-Perception Optimized Preprocessing for Video Coding

In the past decades, lots of progress have been done in the video compre...
research
03/03/2021

User Generated HDR Gaming Video Streaming: Dataset, Codec Comparison and Challenges

Gaming video streaming services have grown tremendously in the past few ...
research
02/18/2021

On the advantages of stochastic encoders

Stochastic encoders have been used in rate-distortion theory and neural ...
research
02/17/2022

Non-Autoregressive ASR with Self-Conditioned Folded Encoders

This paper proposes CTC-based non-autoregressive ASR with self-condition...
research
11/02/2022

Transformer-based encoder-encoder architecture for Spoken Term Detection

The paper presents a method for spoken term detection based on the Trans...
research
09/23/2017

Calibrated steganalysis of mp3stego in multi-encoder scenario

Comparing popularity of mp3 and wave with the amount of works published ...

Please sign up or login with your details

Forgot password? Click here to reset