Incorporating Scalability in Unsupervised Spatio-Temporal Feature Learning

08/06/2018
by   Sujoy Paul, et al.
0

Deep neural networks are efficient learning machines which leverage upon a large amount of manually labeled data for learning discriminative features. However, acquiring substantial amount of supervised data, especially for videos can be a tedious job across various computer vision tasks. This necessitates learning of visual features from videos in an unsupervised setting. In this paper, we propose a computationally simple, yet effective, framework to learn spatio-temporal feature embedding from unlabeled videos. We train a Convolutional 3D Siamese network using positive and negative pairs mined from videos under certain probabilistic assumptions. Experimental results on three datasets demonstrate that our proposed framework is able to learn weights which can be used for same as well as cross dataset and tasks.

READ FULL TEXT
research
03/25/2015

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks

We propose a new way of incorporating temporal information present in vi...
research
01/24/2018

Unsupervised learning from videos using temporal coherency deep networks

In this work we address the challenging problem of unsupervised learning...
research
10/22/2020

Spatio-temporal Features for Generalized Detection of Deepfake Videos

For deepfake detection, video-level detectors have not been explored as ...
research
08/01/2020

Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning

Traffic accident anticipation aims to predict accidents from dashcam vid...
research
08/14/2017

Kinship Verification from Videos using Spatio-Temporal Texture Features and Deep Learning

Automatic kinship verification using facial images is a relatively new a...
research
05/09/2017

Deep Spatio-temporal Manifold Network for Action Recognition

Visual data such as videos are often sampled from complex manifold. We p...
research
06/16/2020

Focus of Attention Improves Information Transfer in Visual Features

Unsupervised learning from continuous visual streams is a challenging pr...

Please sign up or login with your details

Forgot password? Click here to reset