SG-FCN: A Motion and Memory-Based Deep Learning Model for Video Saliency Detection

09/21/2018
by   Meijun Sun, et al.
14

Data-driven saliency detection has attracted strong interest as a result of applying convolutional neural networks to the detection of eye fixations. Although a number of imagebased salient object and fixation detection models have been proposed, video fixation detection still requires more exploration. Different from image analysis, motion and temporal information is a crucial factor affecting human attention when viewing video sequences. Although existing models based on local contrast and low-level features have been extensively researched, they failed to simultaneously consider interframe motion and temporal information across neighboring video frames, leading to unsatisfactory performance when handling complex scenes. To this end, we propose a novel and efficient video eye fixation detection model to improve the saliency detection performance. By simulating the memory mechanism and visual attention mechanism of human beings when watching a video, we propose a step-gained fully convolutional network by combining the memory information on the time axis with the motion information on the space axis while storing the saliency information of the current frame. The model is obtained through hierarchical training, which ensures the accuracy of the detection. Extensive experiments in comparison with 11 state-of-the-art methods are carried out, and the results show that our proposed model outperforms all 11 methods across a number of publicly available datasets.

READ FULL TEXT

page 2

page 5

page 6

page 9

page 12

research
07/12/2018

Video Saliency Detection by 3D Convolutional Neural Networks

Different from salient object detection methods for still images, a key ...
research
08/15/2019

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

TASED-Net is a 3D fully-convolutional network architecture for video sal...
research
09/19/2017

Predicting Video Saliency with Object-to-Motion CNN and Two-layer Convolutional LSTM

Over the past few years, deep neural networks (DNNs) have exhibited grea...
research
01/23/2018

Revisiting Video Saliency: A Large-scale Benchmark and a New Model

In this work, we contribute to video saliency research in two ways. Firs...
research
02/27/2020

A Neuromorphic Proto-Object Based Dynamic Visual Saliency Model with an FPGA Implementation

The ability to attend to salient regions of a visual scene is an innate ...
research
09/17/2023

Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures

Detecting firearms and accurately localizing individuals carrying them i...
research
03/31/2017

Semantic-driven Generation of Hyperlapse from 360^∘ Video

We present a system for converting a fully panoramic (360^∘) video into ...

Please sign up or login with your details

Forgot password? Click here to reset