A Gated Fusion Network for Dynamic Saliency Prediction

02/15/2021
by   Aysun Kocak, et al.
9

Predicting saliency in videos is a challenging problem due to complex modeling of interactions between spatial and temporal information, especially when ever-changing, dynamic nature of videos is considered. Recently, researchers have proposed large-scale datasets and models that take advantage of deep learning as a way to understand what's important for video saliency. These approaches, however, learn to combine spatial and temporal features in a static manner and do not adapt themselves much to the changes in the video content. In this paper, we introduce Gated Fusion Network for dynamic saliency (GFSalNet), the first deep saliency model capable of making predictions in a dynamic way via gated fusion mechanism. Moreover, our model also exploits spatial and channel-wise attention within a multi-scale architecture that further allows for highly accurate predictions. We evaluate the proposed approach on a number of datasets, and our experimental analysis demonstrates that it outperforms or is highly competitive with the state of the art. Importantly, we show that it has a good generalization ability, and moreover, exploits temporal information more effectively via its adaptive fusion scheme.

READ FULL TEXT

page 1

page 4

page 6

page 10

page 11

research
02/02/2017

Video Salient Object Detection via Fully Convolutional Networks

This paper proposes a deep learning model to efficiently detect salient ...
research
03/13/2018

A Learning-Based Visual Saliency Fusion Model for High Dynamic Range Video (LBVS-HDR)

Saliency prediction for Standard Dynamic Range (SDR) videos has been wel...
research
12/09/2020

DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection

As moving objects always draw more attention of human eyes, the temporal...
research
07/25/2017

Graph-Theoretic Spatiotemporal Context Modeling for Video Saliency Detection

As an important and challenging problem in computer vision, video salien...
research
01/23/2018

Revisiting Video Saliency: A Large-scale Benchmark and a New Model

In this work, we contribute to video saliency research in two ways. Firs...
research
08/13/2023

Video Captioning with Aggregated Features Based on Dual Graphs and Gated Fusion

The application of video captioning models aims at translating the conte...
research
04/10/2019

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

The performance of video saliency estimation techniques has achieved sig...

Please sign up or login with your details

Forgot password? Click here to reset