Dynamic texture and scene classification by transferring deep image features

02/01/2015
by   Xianbiao Qi, et al.
0

Dynamic texture and scene classification are two fundamental problems in understanding natural video content. Extracting robust and effective features is a crucial step towards solving these problems. However the existing approaches suffer from the sensitivity to either varying illumination, or viewpoint changing, or even camera motion, and/or the lack of spatial information. Inspired by the success of deep structures in image classification, we attempt to leverage a deep structure to extract feature for dynamic texture and scene classification. To tackle with the challenges in training a deep structure, we propose to transfer some prior knowledge from image domain to video domain. To be specific, we propose to apply a well-trained Convolutional Neural Network (ConvNet) as a mid-level feature extractor to extract features from each frame, and then form a representation of a video by concatenating the first and the second order statistics over the mid-level features. We term this two-level feature extraction scheme as a Transferred ConvNet Feature (TCoF). Moreover we explore two different implementations of the TCoF scheme, i.e., the spatial TCoF and the temporal TCoF, in which the mean-removed frames and the difference between two adjacent frames are used as the inputs of the ConvNet, respectively. We evaluate systematically the proposed spatial TCoF and the temporal TCoF schemes on three benchmark data sets, including DynTex, YUPENN, and Maryland, and demonstrate that the proposed approach yields superior performance.

READ FULL TEXT

page 3

page 8

page 9

page 10

page 11

research
09/11/2018

Temporal-Spatial Mapping for Action Recognition

Deep learning models have enjoyed great success for image related comput...
research
02/17/2015

SA-CNN: Dynamic Scene Classification using Convolutional Neural Networks

The task of classifying videos of natural dynamic scenes into appropriat...
research
06/09/2017

Manifold Regularized Slow Feature Analysis for Dynamic Texture Recognition

Dynamic textures exist in various forms, e.g., fire, smoke, and traffic ...
research
08/22/2022

Multilayer deep feature extraction for visual texture recognition

Convolutional neural networks have shown successful results in image cla...
research
07/22/2018

Deep Discriminative Model for Video Classification

This paper presents a new deep learning approach for video-based scene c...
research
03/22/2016

Knowledge Transfer for Scene-specific Motion Prediction

When given a single frame of the video, humans can not only interpret th...
research
02/10/2021

A Topological Approach for Motion Track Discrimination

Detecting small targets at range is difficult because there is not enoug...

Please sign up or login with your details

Forgot password? Click here to reset