Disentangling Motion, Foreground and Background Features in Videos

07/13/2017
by   Xunyu Lin, et al.
0

This paper introduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that disentangles motion, foreground and background information. The proposed architecture consists of a 3D convolutional feature encoder for blocks of 16 frames, which is trained for reconstruction tasks over the first and last frames of the sequence. A preliminary supervised experiment was conducted to verify the feasibility of proposed method by training the model with a fraction of videos from the UCF-101 dataset taking as ground truth the bounding boxes around the activity regions. Qualitative results indicate that the network can successfully segment foreground and background in videos as well as update the foreground appearance based on disentangled motion features. The benefits of these learned features are shown in a discriminative classification task, where initializing the network with the proposed pretraining method outperforms both random initialization and autoencoder pretraining. Our model and source code are publicly available at https://imatge-upc.github.io/unsupervised-2017-cvprw/ .

READ FULL TEXT

page 3

page 4

research
11/26/2018

Foreground Clustering for Joint Segmentation and Localization in Videos and Images

This paper presents a novel framework in which video/image segmentation ...
research
12/15/2021

Autoencoder-based background reconstruction and foreground segmentation with background noise estimation

Even after decades of research, dynamic scene background reconstruction ...
research
09/10/2021

Temporally Coherent Person Matting Trained on Fake-Motion Dataset

We propose a novel neural-network-based method to perform matting of vid...
research
06/14/2020

Adaptively Meshed Video Stabilization

Video stabilization is essential for improving visual quality of shaky v...
research
02/03/2019

DeepPBM: Deep Probabilistic Background Model Estimation from Video Sequences

This paper presents a novel unsupervised probabilistic model estimation ...
research
06/01/2018

A Classification approach towards Unsupervised Learning of Visual Representations

In this paper, we present a technique for unsupervised learning of visua...
research
11/13/2018

Deep Neural Network Concepts for Background Subtraction: A Systematic Review and Comparative Evaluation

Conventional neural networks show a powerful framework for background su...

Please sign up or login with your details

Forgot password? Click here to reset