Beyond Short Snippets: Deep Networks for Video Classification

03/31/2015
by   Joe Yue-Hei Ng, et al.
0

Convolutional neural networks (CNNs) have been extensively applied for image recognition problems giving state-of-the-art results on recognition, detection, segmentation and retrieval. In this work we propose and evaluate several deep neural network architectures to combine image information across a video over longer time periods than previously attempted. We propose two methods capable of handling full length videos. The first method explores various convolutional temporal feature pooling architectures, examining the various design choices which need to be made when adapting a CNN for this task. The second proposed method explicitly models the video as an ordered sequence of frames. For this purpose we employ a recurrent neural network that uses Long Short-Term Memory (LSTM) cells which are connected to the output of the underlying CNN. Our best networks exhibit significant performance improvements over previously published results on the Sports 1 million dataset (73.1 datasets with (88.6 (82.6

READ FULL TEXT
research
10/11/2016

Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition

Long short-term memory (LSTM) recurrent neural networks (RNNs) have been...
research
06/14/2017

Large-Scale YouTube-8M Video Understanding with Deep Neural Networks

Video classification problem has been studied many years. The success of...
research
06/16/2016

Convolutional Residual Memory Networks

Very deep convolutional neural networks (CNNs) yield state of the art re...
research
05/22/2015

Efficient Large Scale Video Classification

Video classification has advanced tremendously over the recent years. A ...
research
09/04/2015

Object Recognition from Short Videos for Robotic Perception

Deep neural networks have become the primary learning technique for obje...
research
12/18/2017

LSTM Pose Machines

We observed that recent state-of-the-art results on single image human p...
research
06/04/2018

CFCM: Segmentation via Coarse to Fine Context Memory

Recent neural-network-based architectures for image segmentation make ex...

Please sign up or login with your details

Forgot password? Click here to reset