Towards Modality Transferable Visual Information Representation with Optimal Model Compression

08/13/2020
by   Rongqun Lin, et al.
7

Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the redundancies within visual signals, much less work has been dedicated to the transformation of the visual signals to another well-established modality for better representation capability. In this paper, we propose a new scheme for visual signal representation that leverages the philosophy of transferable modality. In particular, the deep learning model, which characterizes and absorbs the statistics of the input scene with online training, could be efficiently represented in the sense of rate-utility optimization to serve as the enhancement layer in the bitstream. As such, the overall performance can be further guaranteed by optimizing the new modality incorporated. The proposed framework is implemented on the state-of-the-art video coding standard (i.e., versatile video coding), and significantly better representation capability has been observed based on extensive evaluations.

READ FULL TEXT

page 3

page 7

research
04/21/2020

Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression

It plays a fundamental role to compactly represent the visual informatio...
research
03/14/2019

Scalable Facial Image Compression with Deep Feature Reconstruction

In this paper, we propose a scalable image compression scheme, including...
research
06/21/2022

Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning

Video highlight detection is a crucial yet challenging problem that aims...
research
04/07/2019

Image and Video Compression with Neural Networks: A Review

In recent years, the image and video coding technologies have advanced b...
research
07/18/2023

Learned Scalable Video Coding For Humans and Machines

Video coding has traditionally been developed to support services such a...
research
08/23/2021

Learned Image Coding for Machines: A Content-Adaptive Approach

Today, according to the Cisco Annual Internet Report (2018-2023), the fa...
research
11/14/2007

On the Information Rates of the Plenoptic Function

The plenoptic function (Adelson and Bergen, 91) describes the visual in...

Please sign up or login with your details

Forgot password? Click here to reset