TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation

01/11/2023
by   Feiyan Hu, et al.
17

Video saliency prediction has recently attracted attention of the research community, as it is an upstream task for several practical applications. However, current solutions are particularly computationally demanding, especially due to the wide usage of spatio-temporal 3D convolutions. We observe that, while different model architectures achieve similar performance on benchmarks, visual variations between predicted saliency maps are still significant. Inspired by this intuition, we propose a lightweight model that employs multiple simple heterogeneous decoders and adopts several practical approaches to improve accuracy while keeping computational costs low, such as hierarchical multi-map knowledge distillation, multi-output saliency prediction, unlabeled auxiliary datasets and channel reduction with teacher assistant supervision. Our approach achieves saliency prediction accuracy on par or better than state-of-the-art methods on DFH1K, UCF-Sports and Hollywood2 benchmarks, while enhancing significantly the efficiency of the model. Code is on https://github.com/feiyanhu/tinyHD

READ FULL TEXT

page 3

page 6

research
07/05/2022

SESS: Saliency Enhancing with Scaling and Sliding

High-quality saliency maps are essential in several machine learning app...
research
10/02/2020

Video Saliency Detection with Domain Adaptation using Hierarchical Gradient Reversal Layers

In this work, we propose a 3D fully convolutional architecture for video...
research
03/11/2020

Unified Image and Video Saliency Modeling

Visual saliency modeling for images and videos is treated as two indepen...
research
08/25/2020

FastSal: a Computationally Efficient Network for Visual Saliency Prediction

This paper focuses on the problem of visual saliency prediction, predict...
research
03/15/2023

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting

This paper presents a significant contribution to the field of repetitiv...
research
06/11/2020

JIT-Masker: Efficient Online Distillation for Background Matting

We design a real-time portrait matting pipeline for everyday use, partic...
research
12/14/2020

FasteNet: A Fast Railway Fastener Detector

In this work, a novel high-speed railway fastener detector is introduced...

Please sign up or login with your details

Forgot password? Click here to reset