Just Noticeable Visual Redundancy Forecasting: A Deep Multimodal-driven Approach

03/18/2023
by   Wuyuan Xie, et al.
0

Just noticeable difference (JND) refers to the maximum visual change that human eyes cannot perceive, and it has a wide range of applications in multimedia systems. However, most existing JND approaches only focus on a single modality, and rarely consider the complementary effects of multimodal information. In this article, we investigate the JND modeling from an end-to-end homologous multimodal perspective, namely hmJND-Net. Specifically, we explore three important visually sensitive modalities, including saliency, depth, and segmentation. To better utilize homologous multimodal information, we establish an effective fusion method via summation enhancement and subtractive offset, and align homologous multimodal features based on a self-attention driven encoder-decoder paradigm. Extensive experimental results on eight different benchmark datasets validate the superiority of our hmJND-Net over eight representative methods.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 7

research
05/03/2022

i-Code: An Integrative and Composable Multimodal Learning Framework

Human intelligence is multimodal; we integrate visual, linguistic, and a...
research
12/03/2021

LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences

Learning modality-fused representations and processing unaligned multimo...
research
11/23/2021

Sparse Fusion for Multimodal Transformers

Multimodal classification is a core task in human-centric machine learni...
research
03/13/2023

Mirror U-Net: Marrying Multimodal Fission with Multi-task Learning for Semantic Segmentation in Medical Imaging

Positron Emission Tomography (PET) and Computer Tomography (CT) are rout...
research
03/31/2022

Dynamic Multimodal Fusion

Deep multimodal learning has achieved great progress in recent years. Ho...
research
12/20/2021

Multimodal Adversarially Learned Inference with Factorized Discriminators

Learning from multimodal data is an important research topic in machine ...
research
12/12/2020

Multimodal In-bed Pose and Shape Estimation under the Blankets

Humans spend vast hours in bed – about one-third of the lifetime on aver...

Please sign up or login with your details

Forgot password? Click here to reset