Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

08/24/2017
by   Savas Ozkan, et al.
0

Frame-level visual features are generally aggregated in time with the techniques such as LSTM, Fisher Vectors, NetVLAD etc. to produce a robust video-level representation. We here introduce a learnable aggregation technique whose primary objective is to retain short-time temporal structure between frame-level features and their spatial interdependencies in the representation. Also, it can be easily adapted to the cases where there have very scarce training samples. We evaluate the method on a real-fake expression prediction dataset to demonstrate its superiority. Our method obtains 65 test dataset in the official MAP evaluation and there is only one misclassified decision with the best reported result in the Chalearn Challenge (i.e. 66:7 Lastly, we believe that this method can be extended to different problems such as action/event recognition in future.

READ FULL TEXT

page 1

page 2

research
04/10/2017

ActionVLAD: Learning spatio-temporal aggregation for action classification

In this work, we introduce a new video representation for action classif...
research
05/29/2019

Hierarchical Feature Aggregation Networks for Video Action Recognition

Most action recognition methods base on a) a late aggregation of frame l...
research
11/30/2022

Spatio-Temporal Crop Aggregation for Video Representation Learning

We propose Spatio-temporal Crop Aggregation for video representation LEa...
research
06/21/2017

Learnable pooling with Context Gating for video classification

Common video representations often deploy an average or maximum pooling ...
research
07/11/2019

Object Detection in Video with Spatial-temporal Context Aggregation

Recent cutting-edge feature aggregation paradigms for video object detec...
research
02/04/2016

Joint Recognition and Segmentation of Actions via Probabilistic Integration of Spatio-Temporal Fisher Vectors

We propose a hierarchical approach to multi-action recognition that perf...
research
09/13/2015

Vectors of Locally Aggregated Centers for Compact Video Representation

We propose a novel vector aggregation technique for compact video repres...

Please sign up or login with your details

Forgot password? Click here to reset