Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

04/03/2023
by   Takahiro Shindo, et al.
0

With advances in image recognition technology based on deep learning, automatic video analysis by Artificial Intelligence is becoming more widespread. As the amount of video used for image recognition increases, efficient compression methods for such video data are necessary. In general, when the image quality deteriorates due to image encoding, the image recognition accuracy also falls. Therefore, in this paper, we propose a neural-network-based approach to improve image recognition accuracy, especially the object detection accuracy by applying post-processing to the encoded video. Versatile Video Coding (VVC) will be used for the video compression method, since it is the latest video coding method with the best encoding performance. The neural network is trained using the features of YOLO-v7, the latest object detection model. By using VVC as the video coding method and YOLO-v7 as the detection model, high object detection accuracy is achieved even at low bit rates. Experimental results show that the combination of the proposed method and VVC achieves better coding performance than regular VVC in object detection accuracy.

READ FULL TEXT

page 3

page 4

research
05/30/2023

VVC Extension Scheme for Object Detection Using Contrast Reduction

In recent years, video analysis using Artificial Intelligence (AI) has b...
research
08/27/2023

Image Coding for Machines with Object Region Learning

Compression technology is essential for efficient image transmission and...
research
05/21/2018

DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization

As it requires a huge number of parameters when exposed to high dimensio...
research
04/07/2021

An Object Detection based Solver for Google's Image reCAPTCHA v2

Previous work showed that reCAPTCHA v2's image challenges could be solve...
research
04/08/2021

Multi-Density Attention Network for Loop Filtering in Video Compression

Video compression is a basic requirement for consumer and professional v...
research
12/14/2022

Improving Warped Planar Object Detection Network For Automatic License Plate Recognition

This paper aims to improve the Warping Planer Object Detection Network (...
research
10/22/2019

Towards best practice in explaining neural network decisions with LRP

Within the last decade, neural network based predictors have demonstrate...

Please sign up or login with your details

Forgot password? Click here to reset