An end-to-end generative framework for video segmentation and recognition

09/07/2015
by   Hilde Kuehne, et al.
0

We describe an end-to-end generative approach for the segmentation and recognition of human activities. In this approach, a visual representation based on reduced Fisher Vectors is combined with a structured temporal model for recognition. We show that the statistical properties of Fisher Vectors make them an especially suitable front-end for generative models such as Gaussian mixtures. The system is evaluated for both the recognition of complex activities as well as their parsing into action units. Using a variety of video datasets ranging from human cooking activities to animal behaviors, our experiments demonstrate that the resulting architecture outperforms state-of-the-art approaches for larger datasets, i.e. when sufficient amount of data is available for training structured generative models.

READ FULL TEXT

page 1

page 5

page 7

research
03/26/2018

Unsupervised Learning and Segmentation of Complex Activities from Video

This paper presents a new method for unsupervised segmentation of comple...
research
05/14/2018

Deep Attentional Structured Representation Learning for Visual Recognition

Structured representations, such as Bags of Words, VLAD and Fisher Vecto...
research
12/12/2015

RNN Fisher Vectors for Action Recognition and Image Annotation

Recurrent Neural Networks (RNNs) have had considerable success in classi...
research
12/03/2018

Towards Accurate Generative Models of Video: A New Metric & Challenges

Recent advances in deep generative models have lead to remarkable progre...
research
12/04/2016

End-to-end Learning of Driving Models from Large-scale Video Datasets

Robust perception-action models should be learned from training data wit...
research
12/22/2016

Set2Model Networks: Learning Discriminatively To Learn Generative Models

We present a new "learning-to-learn"-type approach that enables rapid le...
research
07/10/2017

Improving Neural Parsing by Disentangling Model Combination and Reranking Effects

Recent work has proposed several generative neural models for constituen...

Please sign up or login with your details

Forgot password? Click here to reset