Disentangling Space and Time in Video with Hierarchical Variational Auto-encoders

12/14/2016
by   Will Grathwohl, et al.
0

There are many forms of feature information present in video data. Principle among them are object identity information which is largely static across multiple video frames, and object pose and style information which continuously transforms from frame to frame. Most existing models confound these two types of representation by mapping them to a shared feature space. In this paper we propose a probabilistic approach for learning separable representations of object identity and pose information using unsupervised video data. Our approach leverages a deep generative model with a factored prior distribution that encodes properties of temporal invariances in the hidden feature set. Learning is achieved via variational inference. We present results of learning identity and pose information on a dataset of moving characters as well as a dataset of rotating 3D objects. Our experimental results demonstrate our model's success in factoring its representation, and demonstrate that the model achieves improved performance in transfer learning tasks.

READ FULL TEXT

page 5

page 6

page 7

page 8

research
11/21/2021

Video Content Swapping Using GAN

Video generation is an interesting problem in computer vision. It is qui...
research
06/07/2022

ObPose: Leveraging Canonical Pose for Object-Centric Scene Inference in 3D

We present ObPose, an unsupervised object-centric generative model that ...
research
06/23/2020

Learning Disentangled Representations of Video with Missing Data

Missing data poses significant challenges while learning representations...
research
01/13/2018

Feature Space Transfer for Data Augmentation

The problem of data augmentation in feature space is considered. A new a...
research
10/24/2022

Occam learning

We discuss probabilistic neural network models for unsupervised learning...
research
10/13/2021

The deep generative decoder: Using MAP estimates of representations

A deep generative model is characterized by a representation space, its ...
research
12/13/2019

Identity Preserve Transform: Understand What Activity Classification Models Have Learnt

Activity classification has observed great success recently. The perform...

Please sign up or login with your details

Forgot password? Click here to reset