Saliency-Driven Versatile Video Coding for Neural Object Detection

03/11/2022
by   Kristian Fischer, et al.
0

Saliency-driven image and video coding for humans has gained importance in the recent past. In this paper, we propose such a saliency-driven coding framework for the video coding for machines task using the latest video coding standard Versatile Video Coding (VVC). To determine the salient regions before encoding, we employ the real-time-capable object detection network You Only Look Once (YOLO) in combination with a novel decision criterion. To measure the coding quality for a machine, the state-of-the-art object segmentation network Mask R-CNN was applied to the decoded frame. From extensive simulations we find that, compared to the reference VVC with a constant quality, up to 29 bitrate can be saved with the same detection accuracy at the decoder side by applying the proposed saliency-driven framework. Besides, we compare YOLO against other, more traditional saliency detection methods.

READ FULL TEXT
research
07/12/2018

Video Saliency Detection by 3D Convolutional Neural Networks

Different from salient object detection methods for still images, a key ...
research
11/19/2016

Multi-Scale Saliency Detection using Dictionary Learning

Saliency detection has drawn a lot of attention of researchers in variou...
research
04/22/2016

A Classifier-guided Approach for Top-down Salient Object Detection

We propose a framework for top-down salient object detection that incorp...
research
08/04/2022

Scalable Video Coding for Humans and Machines

Video content is watched not only by humans, but increasingly also by ma...
research
05/30/2023

VVC Extension Scheme for Object Detection Using Contrast Reduction

In recent years, video analysis using Artificial Intelligence (AI) has b...
research
06/07/2021

Task-driven Semantic Coding via Reinforcement Learning

Task-driven semantic video/image coding has drawn considerable attention...
research
12/29/2020

Quality-Driven Dynamic VVC Frame Partitioning for Efficient Parallel Processing

VVC is the next generation video coding standard, offering coding capabi...

Please sign up or login with your details

Forgot password? Click here to reset