Slow and steady feature analysis: higher order temporal coherence in video

06/15/2015
by   Dinesh Jayaraman, et al.
0

How can unlabeled video augment visual learning? Existing methods perform "slow" feature analysis, encouraging the representations of temporally close frames to exhibit only small differences. While this standard approach captures the fact that high-level visual signals change slowly over time, it fails to capture *how* the visual content changes. We propose to generalize slow feature analysis to "steady" feature analysis. The key idea is to impose a prior that higher order derivatives in the learned feature space must be small. To this end, we train a convolutional neural network with a regularizer on tuples of sequential frames from unlabeled video. It encourages feature changes over time to be smooth, i.e., similar to the most recent changes. Using five diverse datasets, including unlabeled YouTube and KITTI videos, we demonstrate our method's impact on object, scene, and action recognition tasks. We further show that our features learned from unlabeled video can even surpass a standard heavily supervised pretraining approach.

READ FULL TEXT

page 7

page 13

page 14

page 15

page 16

research
01/24/2018

Unsupervised learning from videos using temporal coherency deep networks

In this work we address the challenging problem of unsupervised learning...
research
12/01/2016

Object-Centric Representation Learning from Unlabeled Videos

Supervised (pre-)training currently yields state-of-the-art performance ...
research
01/19/2017

Higher-order Pooling of CNN Features via Kernel Linearization for Action Recognition

Most successful deep learning algorithms for action recognition extend m...
research
03/29/2021

No frame left behind: Full Video Action Recognition

Not all video frames are equally informative for recognizing an action. ...
research
12/13/2018

Dynamic Graph Modules for Modeling Higher-Order Interactions in Activity Recognition

Video action recognition, as a critical problem towards video understand...
research
02/04/2021

Semi-Supervised Action Recognition with Temporal Contrastive Learning

Learning to recognize actions from only a handful of labeled videos is a...
research
06/07/2022

Online Deep Clustering with Video Track Consistency

Several unsupervised and self-supervised approaches have been developed ...

Please sign up or login with your details

Forgot password? Click here to reset