Convolutional Drift Networks for Video Classification

11/03/2017
by   Dillon Graham, et al.
0

Analyzing spatio-temporal data like video is a challenging task that requires processing visual and temporal information effectively. Convolutional Neural Networks have shown promise as baseline fixed feature extractors through transfer learning, a technique that helps minimize the training cost on visual information. Temporal information is often handled using hand-crafted features or Recurrent Neural Networks, but this can be overly specific or prohibitively complex. Building a fully trainable system that can efficiently analyze spatio-temporal data without hand-crafted features or complex training is an open challenge. We present a new neural network architecture to address this challenge, the Convolutional Drift Network (CDN). Our CDN architecture combines the visual feature extraction power of deep Convolutional Neural Networks with the intrinsically efficient temporal processing provided by Reservoir Computing. In this introductory paper on the CDN, we provide a very simple baseline implementation tested on two egocentric (first-person) video activity datasets.We achieve video-level activity classification results on-par with state-of-the art methods. Notably, performance on this complex spatio-temporal task was produced by only training a single feed-forward layer in the CDN.

READ FULL TEXT
research
03/25/2015

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks

We propose a new way of incorporating temporal information present in vi...
research
08/13/2017

Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-temporal Path Proposals

Vehicle re-identification is an important problem and has many applicati...
research
05/24/2017

Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition

Based on the progress of image recognition, video recognition has been e...
research
09/26/2019

Deep Video Deblurring: The Devil is in the Details

Video deblurring for hand-held cameras is a challenging task, since the ...
research
06/15/2020

On the Preservation of Spatio-temporal Information in Machine Learning Applications

In conventional machine learning applications, each data attribute is as...
research
08/29/2016

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks

This thesis explore different approaches using Convolutional and Recurre...
research
02/04/2019

Saliency Tubes: Visual Explanations for Spatio-Temporal Convolutions

Deep learning approaches have been established as the main methodology f...

Please sign up or login with your details

Forgot password? Click here to reset