Saliency-guided video classification via adaptively weighted learning

03/23/2017
by   Yunzhen Zhao, et al.
0

Video classification is productive in many practical applications, and the recent deep learning has greatly improved its accuracy. However, existing works often model video frames indiscriminately, but from the view of motion, video frames can be decomposed into salient and non-salient areas naturally. Salient and non-salient areas should be modeled with different networks, for the former present both appearance and motion information, and the latter present static background information. To address this problem, in this paper, video saliency is predicted by optical flow without supervision firstly. Then two streams of 3D CNN are trained individually for raw frames and optical flow on salient areas, and another 2D CNN is trained for raw frames on non-salient areas. For the reason that these three streams play different roles for each class, the weights of each stream are adaptively learned for each class. Experimental results show that saliency-guided modeling and adaptively weighted learning can reinforce each other, and we achieve the state-of-the-art results.

READ FULL TEXT
research
09/16/2019

Motion Guided Attention for Video Salient Object Detection

Video salient object detection aims at discovering the most visually dis...
research
03/12/2019

Unsupervised motion saliency map estimation based on optical flow inpainting

The paper addresses the problem of motion saliency in videos, that is, i...
research
05/12/2017

Single Image Action Recognition by Predicting Space-Time Saliency

We propose a novel approach based on deep Convolutional Neural Networks ...
research
11/09/2017

Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification

Video classification is highly important with wide applications, such as...
research
04/27/2016

Deep Learning for Saliency Prediction in Natural Video

The purpose of this paper is the detection of salient areas in natural v...
research
05/19/2023

ViDaS Video Depth-aware Saliency Network

We introduce ViDaS, a two-stream, fully convolutional Video, Depth-Aware...
research
03/31/2017

Semantic-driven Generation of Hyperlapse from 360^∘ Video

We present a system for converting a fully panoramic (360^∘) video into ...

Please sign up or login with your details

Forgot password? Click here to reset