Video Salient Object Detection Using Spatiotemporal Deep Features

08/04/2017
by   Trung-Nghia Le, et al.
0

This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional hand-crafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal Conditional Random Field (STCRF) to compute saliency from STD features. STCRF is our extension of CRF toward the temporal domain and formulates the relationship between neighboring regions both in a frame and over frames. STCRF leads to temporally consistent saliency maps over frames, contributing to the accurate detection of the boundaries of salient objects and the reduction of noise in detection. Our proposed method first segments an input video into multiple scales and then computes a saliency map at each scale level using STD features with STCRF. The final saliency map is computed by fusing saliency maps at different scale levels. Our intensive experiments using publicly available benchmark datasets confirm that the proposed method significantly outperforms state-of-the-art methods. We also applied our saliency computation to the video object segmentation task, showing that our method outperforms existing video object segmentation methods.

READ FULL TEXT

page 1

page 7

page 11

page 12

research
08/04/2017

Region-Based Multiscale Spatiotemporal Saliency for Video

Detecting salient objects from a video requires exploiting both spatial ...
research
04/18/2019

Salient Object Detection: A Distinctive Feature Integration Model

We propose a novel method for salient object detection in different imag...
research
08/15/2019

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

TASED-Net is a 3D fully-convolutional network architecture for video sal...
research
01/11/2023

VS-Net: Multiscale Spatiotemporal Features for Lightweight Video Salient Document Detection

Video Salient Document Detection (VSDD) is an essential task of practica...
research
07/13/2021

Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field

The developmental process of embryos follows a monotonic order. An embry...
research
10/13/2021

Saliency Detection via Global Context Enhanced Feature Fusion and Edge Weighted Loss

UNet-based methods have shown outstanding performance in salient object ...
research
07/20/2020

TENet: Triple Excitation Network for Video Salient Object Detection

In this paper, we propose a simple yet effective approach, named Triple ...

Please sign up or login with your details

Forgot password? Click here to reset