Video Salient Object Detection via Fully Convolutional Networks

02/02/2017
by   Wenguan Wang, et al.
0

This paper proposes a deep learning model to efficiently detect salient regions in videos. It addresses two important issues: (1) deep video saliency model training with the absence of sufficiently large and pixel-wise annotated video data, and (2) fast video saliency training and detection. The proposed deep video saliency network consists of two modules, for capturing the spatial and temporal saliency information, respectively. The dynamic saliency model, explicitly incorporating saliency estimates from the static saliency model, directly produces spatiotemporal saliency inference without time-consuming optical flow computation. We further propose a novel data augmentation technique that simulates video training data from existing annotated image datasets, which enables our network to learn diverse saliency information and prevents overfitting with the limited number of training videos. Leveraging our synthetic video data (150K video sequences) and real videos, our deep video saliency model successfully learns both spatial and temporal saliency cues, thus producing accurate spatiotemporal saliency estimate. We advance the state-of-the-art on the DAVIS dataset (MAE of .06) and the FBMS dataset (MAE of .07), and do so with much improved speed (2fps with all steps).

READ FULL TEXT

page 3

page 4

page 5

page 6

page 8

page 10

research
07/12/2018

Video Saliency Detection by 3D Convolutional Neural Networks

Different from salient object detection methods for still images, a key ...
research
08/07/2020

Exploring Rich and Efficient Spatial Temporal Interactions for Real Time Video Salient Object Detection

The current main stream methods formulate their video saliency mainly fr...
research
02/15/2021

A Gated Fusion Network for Dynamic Saliency Prediction

Predicting saliency in videos is a challenging problem due to complex mo...
research
12/09/2020

DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection

As moving objects always draw more attention of human eyes, the temporal...
research
01/29/2022

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

Field of view (FoV) prediction is critical in 360-degree video multicast...
research
04/10/2019

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

The performance of video saliency estimation techniques has achieved sig...
research
11/19/2018

Global and Local Sensitivity Guided Key Salient Object Re-augmentation for Video Saliency Detection

The existing still-static deep learning based saliency researches do not...

Please sign up or login with your details

Forgot password? Click here to reset